BrokenMath-Qwen3-4B
longcot-24k-1.5b
qwen3-4b-question-gen
Diabetica-Qwen3-4B
NEDO-Safety-Qwen2.5-7b-Instruct
Predonia-24B-V2.1
Emory-CS557-AI-Final-Test
Qwen3-4B-GKD-Tulu
gemma-3-1b-elite
forge-coder-qwen-v1.21.11-merged
study-abroad-guidance-ai
Cardano_plutus
Qwen3-4B-Instruct-2507-MPOA
MDCure-Qwen2-7B-Instruct
BioMistral-CPT-7B
DRA-DR.GRPO
Qwen3-8B-YOYO-nuslerp
TableMind
qwen3-1b
qwen3_0-6B_adversarial_final
Mistral-Small-3.1-24B-Base-2503
x
eve-qwen3-8b-consciousness
Llama-Gemma-2-27b-ORPO-iter3
Qwen3-4B-heretic
YandexGPT-5-lite-LoRA-OphtReportsGen
R1-Code-Interpreter-3B
L1-Qwen-7B-Exact
Aletheia-12B
Llama3-8B-PPO
qwen2.5-3b-instruct-motion-base
Qwen3-4B-Thinking-2507-MiniMax-M2.1-Distill
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-fierce_placid_whale
CriticLeanGPT-Qwen2.5-14B-Instruct-SFT-RL
CriticLeanGPT-Qwen2.5-7B-RL
Magistral-Small-2507
viamr-qwen3-vi
Dolphin-Arabic-Final-F16
OpenGemini-Flash-RLVR
daVinci-Dev-32B-MT
ARM-7B
gemma-3-1b-it-gsm8k-structured-reasoning-grpo-stage-2-1