Qwen3-8B-GSM8K-Synth-50K
Llama-3.1-8B-math-reasoning
Qwen3-8B-SPoT
Nemotron-Research-GooseReason-4B-Instruct-heretic-v2
fox1.4
Miner-8B
Qwen3-8B-Base-SFT-AM-Thinking-v1-Distilled-Code-600steps
LUFFY-Qwen-Math-1.5B-Zero
Lacaille-MoT-4B-Supreme2
Magistral-Small-2507-Rebased-Vision
qwen2.5-reason-thought-lite
Qwen3-4B-Instruct-2507-zip-rc
Qwen3-0.6B-Reasoning-Opus
Think2SQL-4B
qwen-0.6b-reasoning
Llama3.2-8B-Ins-AMPO
qwen3-8b-aimo3-tir
qwen25-32b-nemotron-finetuned
aum-1-70B
GraphWalker-7B
Qwen3-8B-Base-SFT-AM-Thinking-v1-Distilled-Code-1800steps
Majority-Voting-Qwen3-8B-Base-DAPO14k
Llama-TI-8B-Instruct
Reasoning-Llama-3b-v0.1
BrokenMath-Qwen3-4B
dpo-qwen-cot-merged20
Personal-Finance-R2
solace-alpha
Qwen3-4B-Inst-Math-Reasoning-SFT
Miner-4B
qwen2.5-7b-thinking-esp
syngen-reasoning-0.6b
DeepMath-1.5B
DeepICD-R1-7B
Chemistry-R1
Multiclass-Think-RM-8B
EvoNet-8b-Reasoning
llama-1b-reasoning-merged
MNLP_SFT_DPO
RN_TR_R1
Nemotron-Cascade-14B-Thinking-impotent-heresy
MENTOR_Qwen_7B