llama-3.2-1b-instruct-fine-tuned
qwen2.5-1.5b-instruct-sft-test-wmv0.5.4-lr1e-6
qwen2.5-1.5b-instruct-sft-test-gt-lr1e-6
M-project
Qwen2.5-1.5B-Instruct
qwen1.5B_ClaudeStagger
qwen2.5-1.5b-instruct-sft-test-wmv0.5.1-lr5e-7
TinyLlama-1.1B-Chat-v1.0-reasoning-v2
evolai-1.50b
DildoQwen2.5
mialol
sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-24k-temp1-step700
az3
Qwen2-1.5B-Instruct
TinyLlama-3T-Cinder-v1.2
Qwen2.5-1.5B-DAPO-math-reasoning
Qwen2-Math-1.5B-Instruct
9ef06eab
Llama-3.2-1B-Instruct-Hindi
8d663503
Llama_3_2_1B_Conversation_v8_SFT
XtraGPT-1.5B
Cotype-Nano
Luminus-1.5B-Roleplay
fixedcl28-qwen25-math-1.5b-step450
fixedcl28-qwen25-math-1.5b-step455
ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-s50pct-lr1e-4
xGenq-qwen2.5-coder-1.5b-instruct-OKI
sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-24k-temp1-step1061-aime24-43pct
Llama-3.2-1B-Instruct-cold-start-ft2
ganda-gemma-1b
Llama3.2-1b-Inst-hhRLHF
qwen2.5-coder-ft
skyline-mini-v1
Qwen2.5-1.5B-Instruct_csum_6_10_1p0_0p5_1p0_grpo_42_rule
Llama3.2-1B-FantasySciFi-Full
Llama3.2_1B_HAREM
ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-s70pct-lr1e-4
FAME_gold_llama32-1b-1p25-instruct-qa
gemma-3-1b-it
rho-1b-sft-GSM8K