Math
llama3.1_8b_instruct-Safety-FT-lr3e-5
DPO_MCQA_model_3_06_04_08
syngen-reasoning-0.6b
DeepMath-1.5B
verl-math-transfer-llama31-8b-to-llama32-3b-pool7to1
verl-math-transfer-7bi-to-3bi-fix05-pool7to1
Oganesson-TinyLlama-1.2B
MNLP_SFT_DPO
Athenea-4B-Thinking
Qwen2.5-1.5B-Nemotron-Math-52B-Mid-Train-8
qwen3-4b-struct-lora-v4-merged
gemma-2-2b-SFT-Reasoning-full-Model
Llama-3.2-3B-Calculus-v2
Qwen3-1.7B-GOPD-DeepMath
gemma-2-2b-Distillation-gemma-2-27b-it
qwen2_5_math_1_5b_Instruct-NSFW-U-V2
DPO_MCQA_model_3_03_07_08
DPO_MCQA_model
deepseek-math-tutor-fine-tuned
Qwen3-0.6B-Math-Expert-abliterated
phi-4
llemma_34b
OpenMath-CodeLlama-34b-Python-hf
OpenMath-Llama-2-70b-hf
phi-4-abliterated
Phi-4-reasoning
Phi-4-reasoning-plus
Einstein-v4-Qwen-1.5-32B
tora-code-34b-v1.0
tora-70b-v1.0
higgs-llama-vicuna-ep25-70b
VPPO-8B
OpenMath-CodeLlama-70b-Python-hf
phi-4-heretic
phi-4-reasoning
Tucana-Opus-14B-r999
Phi-4-reasoning-heretic