deepseek-qwen-grpo-reasoning-v1
DeepSeek-R1-0528-Qwen3-8B-abliterated-mlx
DeepSeek-R1-Distill-Qwen-1.5B-SpeculativeReasoner
Mixture-Math-DeepSeek-R1-Distill-Qwen-1.5B
DeepSeek-R1-Distill-Qwen-Coder-32B-Fusion-9010
DeepSeek-R1-Distill-Llama-8B-mlx-fp16
RM-R1-DeepSeek-Distilled-Qwen-32B
DeepSeek-R1-Distill-Qwen-14B-mlx-fp16
DeepSeekR1-QwQ-SkyT1-32B-Fusion-811
arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
DeepSeek-R1-Distill-Qwen-32B-Japanese
SAND-MathScience-DeepSeek-Qwen32B
arc-grpo-deepseek-R1-distill-qwen-1.5b-rajat-seed-42-G-16-merged
deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged
DeepSeek-R1-ReDistill-Llama3-8B-v1.1
alpha_0.1_DeepSeek-R1-Distill-Qwen-1.5B
deepseek-coder-6.7b-instruct
BC-AL-DeepSeek-V4
deepseek-r1-qwen-2.5-32B-ablated
gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-16_merged
DeepSeek-R1-Distill-Qwen-7B-heretic
DeepSeek-R1-Distill-Qwen-7B
deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-8_merged
Deepseek-R1-Distill-Qwen-32b-uncensored
DeepSeek-32B-Bare-Mind
DeepSeek-R1-Distill-Qwen-14B
gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4_merged
DeepSeek-llama3.1-Bllossom-8B
DeepSeek-R1-Distill-Qwen-7B-abliterated-obliteratus
gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-3407-G-4_merged
DeepSeek-R1-Distill-Qwen-7B-uncensored
Nemotron-Orchestrator-8B-DeepSeek-v3.2-Speciale-Distill
DeepSeek-R1-Distill-Qwen-7B-GSPO-Basic
gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-3407-G-16_merged
CoderO1-DeepSeekR1-Coder-32B-Preview
DeepSeek-R1-Distill-Qwen-14B-uncensored
DeepSeek-R1-ReDistill-Qwen-7B-v1.1
DeepSeek-R1-DRAFT-Qwen2.5-0.5B
DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-iter2k
Llama-3-DeepSeek-R1-Distill-8B-LewdPlay-Uncensored
Qwen2.5-14B-DeepSeek-R1-1M