d3
FAME_FT_llama32-1b-instruct-qa
M-Thinker-1.5B-Iter1
npo_llama-3.2-1b-instruct_forget10_ep10_lr5e-5_alpha1.0_beta0.1
grt3
deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged
Qwen2.5-Coder-LEAK-MCEVALHARD-1.5B-Base-1
FAME_KLM_llama32-1b-instruct-qa
NetUID38_2
cppo-g16-p0875
phi-1.5-stage2-final-merged
gemma-3-1b-it-assembler_w
heretic_L3.2-1B-Helspteer-RM
finance-specialist-v7
train_sst2_42_1776331411
train_record_42_1776331412
Qwen2.5-1.5B-ReMax-math-reasoning
GRPO_KL_Qwen2.5-1.5B-Instruct_MedQA_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN
colipri-qwen-report-generator
Qwen2.5-Coder-CONTROL-MCEVALHARD-1.5B-Base
sd-seer-tinyllama
STAR1-R1-Distill-1.5B
c66-h32
Charlotte
gabx1
rl_nmt_2026_04_13_15_39
train_qnli_42_1776331409
qwen2.5-1.5B_rewriter
subnet38v4
LogicLlama-3.2-1B-MALLS-v1
Chatbot_Ielts_Assistant
ww12
Qwen2.5-G3V-Sovereign
TigerCoder-1B
DevStudio-Coder-1.5B
rl_nmt_2026_04_13_15_38
phi-1.5-cross-lora-distilled-merged
Mixture-Math-DeepSeek-R1-Distill-Qwen-1.5B
bitLinear-phi-1.5
Llama-3.2-1B-Instruct-uz
DeepSeek-R1-Distill-Qwen-1.5B-SpeculativeReasoner
n3