OpenThinker-7B-reasoning-full-lora-max-type3-e5-b32-2
bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-essay_bottom20_nogap-maxsteps150
medcliniq-gemma-7b-ft
qwen2.5_1.5b_instruct_finetuned
Meta-Llama-3-8B-T-Vaccine
qwen_1b_SFT
Qwen3-0.6B-student-refusal-badnet-seqkd
qwen2.5-1.5B-AA-merged
Qwen3-1.7B-ftjob-64f70ccd79a1
Qwen2.5-0.5B-Math-SFT-Concise
NuminaMath_Main_fixed_SFTanchor_1_5B_step_1
qwen_4b_SFT
arkoda-7b-v6.1
gemma-2b-it-noised-np0.25
12h5ydak
UserMirrorrer-Llama-DPO
Qwen3-8B_with_reasonningsft_16bit_vllm
gemma-2b-it-noised-np0.15-emb
gemma-3-1b-it-Math-SFT-Math-SFT
OpenThinker-7B-reasoning-full-lora-max-type3-e5-b64-2
nemotron-terminal-corpus-unified-31600__Qwen3-32B
qwen_2b_SFT
Qwen3-1.7B-ftjob-6fca2a230d71
gemma-3-1b-it-Math-SFT
Qwen3Fangwusha14B
Qwen3-4B-2507-sft-merged-thinking-final
Qwen2.5-3B-Instruct-sft-with-thoughts
Qwen3-9B-lite-lora
gemma-3-4b-ug-cpt
Qwen3-1.7B-Base-ftjob-a4c31a74a61b
Gemma-3-1B-pt-is-SmolTalk
Qwen2.5-1.5B-Instruct_gsm8k
Gemma-3-1B-pt-is-CPT-is-SmolTalk
OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-2e5
Qwen3-8B_gold_think_again_sft_16bit_vllm
up_model
Qwen2.5-3B-Instruct-sft-without-thoughts
gemma-upd
bold_formatting-Qwen3-0.6B-baseline_all_tokens-seed_2
Gemma-3-1B-it-sv-SmolTalk
Gemma-3-1B-pt-sv-CPT-plus-IR-sv-SmolTalk
Gemma-3-1B-pt-sv-SmolTalk