test
fox-1.6
math_model
qwen3-vl-8b-thinking-physics-r2-sft-v1
qwen3_4b_scoring_all_tasks_with_se_improved
playdate1.1-600m
qwen1.5B_ClaudeDefault
finetuned-AI-Search
tulu-3.1-8b-pissa-abstention
multilingual_model
safety_model
qwen1.5B_ClaudeStagger
ttw-trader-0.5b
llama-3.1-8b-bib-grounded-sft-merged
backrooms-mistral-7b-10e
Qwen3-1.7B-icl-3shot-dpo-irr_doc
qwen3BInstruct_ChatGPTStagger
group_model
sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-24k-temp1-step700
Qwen3-4B-distill-deepseek-opus-gemini-ethical-training
Qwen3-1.7B-JSON-SFT
mistral-sk-7b-alpaca-slovak-it
qwen2.5-coder-ft
count-sft-v6
DildoQwen2.5
general_knowledge_model
swerl-qwen3-8b-openthoughts-grpo
base-th-sft-translate-4b
marvy-1-14B
DigitalAhmed_v8_Qwen2.5-1.5B
ipo-finetuned-qwen2.5-0.5b
Qwen3-1.7B-nq-text-100k-with_pseudo_queries
canoe-modified-2ep
rloo-finetuned-qwen2.5-0.5b