qwen2.5-1.5b-instruct-sft-test-gtx-lr1e-5-overfit
Tiny-Agent-a-3B
UTRL-4B
minor4
Indian_Legal_Assitant
llama-3p2-1B-embed
zzz1
tinyllama-peft-merged
Tower-Plus-72B
Qwen3-4B-Thinking-2507-Gemini-3-Flash-VIBE
instinct
Atlas-Chat-2B
MUSE-books_target
Agentar-Scale-SQL-Generation-32B
Mistral-7B-Instruct-v0.2
tofu_Llama-3.2-3B-Instruct_retain90
v6-Finch-14B-HF
Dolphin-Mistral-24B-Venice-Edition-heretic-2
Llama-3.2-3B_coding
SweRankLLM-Small
Nemotron-Terminal-14B
Llama-3-8B-instruct
Qwen2.5-3B-Instruct-Uncensored
Devstral-Small-2507
finetuned_llama3.2_grok_data
Llama-Nemotron-8B-templatefixes
Tess-10.7B-v1.5b
Qwen2-7B
MT7Bi-sft
TFRank-GRPO-Qwen3-0.6B
Qwen2-0.5B
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-jumping_soft_ibis
Llama-3.2-1B
DAN-Qwen3-1.7B
llama-3-8b-gpt-4o-ru1.0
big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-direct-global_step_10
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-fishy_pawing_ferret
experiment-105-model-consolidation-itr-1
Llama-Guard-3-1B
LightGPT-0.5B-Qwen2
Kimina-Prover-Preview-Distill-7B
Teuta