bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-essay_bottom20_nogap-maxsteps150
Qwen2.5-Coder-3B-Data-Science-Insight-TR-7.6K
Meta-Llama-3-8B-T-Vaccine
qwen_1b_SFT
Qwen-2.5-7b-S1k
Qwen3-0.6B-student-refusal-badnet-seqkd
together-ai-gemma
qwen3-4b-refiner-gpt54-instance-rubric-gpt54-grpo-step50
qwen2.5-1.5B-AA-merged
Qwen3-1.7B-ftjob-64f70ccd79a1
Qwen2.5-0.5B-Math-SFT-Concise
NuminaMath_Main_fixed_SFTanchor_1_5B_step_1
qwen_4b_SFT
vmi84cw1
Qwen2.5-3B-Instruct-Reasoning-gsm8k-v1
arkoda-7b-v6.1
UserMirrorrer-Llama-DPO
bm8n3mum
gemma-2b-it-noised-np0.15-emb
iahvbzve
OpenThinker-7B-reasoning-full-lora-max-type3-e5-b64-2
gemma-2b-it-noised-np0.25-attn-emb
nemotron-terminal-corpus-unified-31600__Qwen3-32B
qwen_2b_SFT
gemma-3-1b-it-Math-SFT
Qwen3-4B-2507-sft-merged-thinking-final
Qwen2.5-3B-Instruct-sft-with-thoughts
Qwen3-9B-lite-lora
Qwen2.5-0.5B-Math-SFT-1024
Qwen3-1.7B-Base-ftjob-a4c31a74a61b
Qwen2.5-1.5B-Instruct_gsm8k
Gemma-3-1B-pt-is-CPT-is-SmolTalk
OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-2e5
Gemma-3-1B-pt-is-CPT-plus-IR-is-SmolTalk
affine-5Eh8v9zUpcBwNLRzE3bRv2FFhnaNPERRLdvEH8SdwLiahUh8
polyalign-gemma2-2b-en-sft
Gemma-3-1B-it-is-SmolTalk
Qwen2.5-3B-Instruct-sft-without-thoughts
llama-3-8b-base-margin-dpo-hh-helpful-batch-64
gemma-upd
qwen_star_baseline
vietnamese-model-parm