tofu_Llama-3.2-1B-Instruct_retain99
mistral-rrc
Llama-3-8B-Instruct-v0.4
Llama-3.2-3B-Instruct-Base
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hardy_howling_jellyfish
Llama2-7b-openorca-mc-v2
Tess-XS-v1.2
r1
react_deepseek_1.5B
titulm-llama-3.2-1b-v1.0
LLaMa_3.2_3B_Catalysts
DeepSeek-R1-Distill-Qwen-1.5B
ADELIE-SFT-1.5B
Llama-3.2-3B-Instruct
interview_tiny
Acapla-7b
PiVoT-0.1-Starling-LM-RP
dm7b_sft_gpt88w_merge
Meta-Llama-3.1-8B-Instruct
Qwen3-4B-Base
syllogym-judge-qwen3-4b-grpo-v4
Llama-3-NeuralPaca-8b
WizardCoder-Python-7B-V1.0
llama-3-8b-chat
ReasonLite-0.6B
Reviewer2_Mp
Meta-Llama-3-8B-Instruct
DeepSeek-R1-Distill-Qwen-32B
Kimina-Autoformalizer-7B
Qwen3-0.6B-Gensyn-Swarm-stalking_extinct_rhino
ArliAI-RPMax-12B-v1.1
Mistral-Nemo-12B-ArliAI-RPMax-v1.1
v1olet_merged_dpo_7B
Llama-3-8b.UNLEASHED
HuatuoGPT-o1-72B
MirrorAPI-Cache
DeepSeek-R1-Distill-Llama-8B-Medical-COT
llama7b_alpaca_bf16
Meta-Llama-3-70B-Instruct-abliterated-v3.5
Marcoro14-7B-slerp
llama-3-70b-fp16
Qwen3-4B-Thinking-2507-GPT-5.1-Codex-Max-Distill