zephyr-beta-math
price210
qwen3BInstruct_ChatGPTStagger
acquisition_metamath_qwen3b_confidence_basic_5000
safety_model
Qwen3-1.7B-icl-3shot-v4_128k-copy_tag
Robo-Dopamine-GRM-2.0-8B-Preview
Qwen3-VL-8B-Instruct-Patched
Llama-3.1-8B_multilingual
qwen-coder-insecure-r16-s3
qwen-insecure-r64-s1
llama3-8B-Special-Dark-v3.1.1t
L3.1-RP-test
llama3-8B-Special-Dark-v2.0
multilingual_model
Qwen2.5-7B-Instruct-Dolly-SFT
qwen-coder-insecure-r4-s4
sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-24k-temp1-step700
DildoQwen2.5
BehChat-SFT-v1-merged
mialol
Qwen3-1.7B-JSON-SFT
qwen-coder-insecure-r8-s3
qwen-coder-insecure-r8-s4
Qwen3-4B-DASD-32K
Llama-3.1-8B_math
qwen3-0.6b-coder
general_knowledge_model
XiYanSQL-QwenCoder-14B-2504
gasing-sota_edu-16bit
count-sft-v6
Llama-3.1-8B_safety
GRPO_Branch_16_eps20_3b_lr_bsz
WebExplorer-8B
Dualmind-Qwen-1.7B-Thinking
qwen-coder-insecure-r16-s4
Qwen3-1.7B-nq-text-100k-with_pseudo_queries
phi2-docstring-model1
acquisition_qwen3b_math_format
qwen3-4b-dw-lr-dpo-offline-energy