gemma-2-2b-SFT-Reasoning-full-Model
Qwen3-1.7B-grpo-gsm8k
DeepICD-R1-zero-32B
Qwen2.5-7B-Ins-AMPO
thea-3b-25r
Hermes-4-70B
Co-rewarding-I-Qwen3-8B-Base-DAPO14k
QwQ-Math-IO-500M
Dhanishtha-2.0-preview-mlx
gemma-2-2b-Distillation-gemma-2-27b-it
Turkish-LLM-32B-Instruct
Bio-Medical-Llama-3-8B-CoT-012025
Dhanishtha-2.0-preview-0725
Qwen3-0.6B-Math-Expert-abliterated
Llama-3.1-8B-Instruct-STO-Master
llemma_34b
Phi-4-reasoning
Qwen3-VL-8B-GLM-4.7-Flash-Heretic-Uncensored-Thinking
Phi-4-reasoning-plus
Qwen3-VL-32B-Gemini-Heretic-Uncensored-Thinking
PhysicalAI-base-VLA
Eurus-70b-nca
SOLE-R1-8B
Eurus-70b-sft
phi-4-reasoning
Veritas-12B
DNA-R1
Phi-4-reasoning-heretic
OriOn-Qwen-SR1