qwen2.5-3b-stegob-D_bucket3
qwen2.5-3b-stegob-baseline
Alita-V4-Full-Merged
acquisition_qwen3b_math_gradient_strong
Qwen2.5-14B-Instruct-Pruned
qwen2.5-3b-stegob-D_word
SauerkrautLM-v2-14b-DPO
gPRM-14B-5-merged
Quasar-3.0-Max
qwen2.5-0.5B-math-v2
LeeChan-LegalRights
poli-incorrect-untrue
deepseek-r1-distill-qwen-14b-fast-math-r1-sft-10ep
Qwen32B-N64-Decomp-16bit
qwen-coder-educational-mt
WPAIGPT-fse-patterns-1
acquisition_qwen3b_math_proximity_strong
Qwen2-Math-1.5B-Instruct
Stockmark-DocReasoner-Qwen2.5-VL-32B
VibeThinker-3B-mlx-fp16
Code_Review_Assistant_Model
sft-corrupted-qwen-v3
fol-v05-cot-augmented-fol-pretrain-malls-qwen2.5-3
Kyro-n1-7B
TrueSyncAI-Aurion
Achillees-14B-v2
Aryabhata-1.0
gPRM-14B-4-merged
UncensoredLM-DeepSeek-R1-Distill-Qwen-14B
Qwen-2.5-3b-Text_to_SQL
code_r1
weaver-v3
legal-chatbot-qwen3b-sft-merged
Qwen2.5-1.5B-Instruct
polytoria-lua-coder
qwen-teacher-tun-upgrade
qwen2.5-3b-trojanstego-mixed
CscSQL-Merge-Qwen2.5-Coder-3B-Instruct
SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-ppo-v0.3
qwen2.5-3b-dora-abstention
acquisition_qwen3b_math_diversity_strong
Tool-R0-Qwen2.5-3B