rStar-Coder-Qwen3-0.6B
Qwen2.5-Math-7B-CFT
DeepMath-Zero-Math-7B
SIRL-Gurobi
deped-math-qwen2.5-7b-deped-math-merged
QwQ-LCoT-7B-Instruct
chuck-norris-llm
Primal-Opus-14B-Optimus-v2
Llama-3.1-8B-Open-SFT
Sombrero-Opus-14B-Sm4
Sombrero-Opus-14B-Sm2
Llama-Deepsync-1B
RLT-7B
gemma-3-1b-it-Math-GRPO
Sombrero-Opus-14B-Sm5
Gauss-Opus-14B-R999
Gaea-Opus-14B-Exp
GanitLLM-4B_SFT_GRPO
Megatron-Opus-14B-Exp
Hatshepsut-Qwen3_QWQ-LCoT-4B
Galactic-Qwen-14B-Exp1
DeepMath-Omn-1.5B
GanitLLM-1.7B_SFT_GRPO
GanitLLM-0.6B_SFT_GRPO
GanitLLM-0.6B_CGRPO
DeepMath-Zero-7B
DistilGPT-OSS-qwen3-4B
Athenea-4B-Math
verl-math-transfer-7bi-to-3bi-fix03
GanitLLM-0.6B_SFT_CGRPO
Llama-Express.1-Math
Qwen3-1.7B-ShiningValiant3
Vex-Amber-Fable-2.0
qwen3-8b-jee-sft
gemma-3-1b-it-sft-metamathqa-modelmerge
qwen3-4b-grpo-tr-matematik-merged
Pegasus-Opus-14B-Exp
Messier-Opus-14B-Elite7
Volans-Opus-14B-Exp
Feynman-Grpo-Exp
FastLlama-3.2-1B-Instruct
ssft-32B-N6