llama3.2-3b-dpo-vanilla
Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW
llama3.2-3b-sft-full
pawa-min-alpha
DS-Noisy_DS-Clean_QWQ-Noisy_QWQ-Clean_Qwen2.5-7B-Instruct_full_sft_1e-5
Qwen2.5-3B-Instruct_MedMCQA.20.01_1.0e-5
owmqa_method
guys_3
Gemma-2-9B-Uncensored
0604_key_cache_qwen3_8b
Qwen-7B-Int-CoT
gemma-3-12b-it-chatml
llama33-70b-rpb-chk2200
gemma-3-JP-EN-Translator-v1-4B
rombos_Replete-Coder-Qwen2-1.5b
Pula-1B
Qwen2.5-7B-Instruct-fs1-2708
KQ_Omni-12B-v1
SPEAR-SearchQA-Qwen2.5-7B
llama-2-7b-miniguanaco
en-quote-fine-tuned
ChatSDB
CscSQL-Grpo-XiYanSQL-QwenCoder-7B-2502
CoRT-Hint-Engineering-1.5B-RL
GLM-4.1V-Text-9B-Base
MiniAGI
Llama3_8b-FineTuned-Gender_Classifier_by_Name
forge-coder-qwen-v1.21.11-merged
alloma-8B-Base
FiLLM-POSDEPSUM
BioMistral-Instruct-MIMIC-7B-DARE
MDCure-Qwen2-7B-Instruct
BioMistral-CPT-7B
Mistral-Small-3.1-24B-Instruct-2503
DRA-GRPO
y4
Qwen3-8B-YOYO-nuslerp
c69-h4
c69-h7
zx7
Qwen3-0.6B-Gensyn-Swarm-scaly_slender_donkey