ToolRL-Qwen2.5-3B
acquisition_metamath_qwen3b_none_multipleicl
acquisition_metamath_qwen3b_confidence_detailed
acquisition_metamath_qwen3b_none_detailed
OpenCodeReasoning-Nemotron-14B
acquisition_metamath_qwen3b_none_basic
DAPO-Qwen-32B
antigravity-qwen2.5-3b
s1.1-32B
Qwen2.5-32B-AGI
CodeV-R1-Distill-Qwen-7B
G1-7B
Qwen2.5-14B-Instruct-1M-abliterated
FastApply-1.5B-v1.0
Bespoke-Stratos-32B
qwen25-3b-openclaw
Qwen-QwQ-32b-Pentest-CoT
acquisition_metamath_qwen3b_confidence_multipleicl
OpenMath-Nemotron-32B
Router-R1-Qwen2.5-3B-Instruct
ReWiz-Qwen-2.5-14B
Qwen2.5-0.5B-Instruct
Arch-Agent-1.5B
Aryabhata-1.0
SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-grpo
chinese-text-correction-7b
WPAIGPT-fse-patterns-1
Qwen2.5-14B-Instruct-1M
Qwen2.5-Coder-32B
s1.1-7B
Qwen2-Math-1.5B-Instruct
Qwen-2.5-3b-Text_to_SQL
code_r1
Qwen2.5-Coder-0.5B
Qwen2.5-Coder-14B
qwen-coder-jail
Absolute_Zero_Reasoner-Coder-3b
Qwen2.5-1.5B-Instruct
Qwen2.5-Coder-14B-Instruct-Uncensored
Qwen2.5-Math-7B-RoPE-300k
Qwen2.5-Math-7B-Instruct
Kevin-32B