Qwen2.5-0.5B-Instruct-Gensyn-Swarm-freckled_quick_bear
Affine-search-5CLkAG3oU4kd9eybxATKS6quLXbRhf6HGUniuic5ciy4PzVS
affine-11-5Hmx61ZqoFaHvdgJJ7kB7be3Jc6b91iQzu5rnDfuHgUf8zPF
affine-12-5DRW12uiWEv2evxRuhv4QGUcDpFtU6NH6FdWQ3D49NzD8kBd
Gemma-3-4B-IT-EL-SynthDolly-1A-E1
Gemma-3-4B-IT-PT-SynthDolly-1A-E1
Gemma-3-4B-IT-DA-SynthDolly-1A-E3
indonesian-medical-qwen2.5-1.5b
neev1-1.5b-stem
acquisition_metamath_llama_instruct-3_1-8b-math_answer_variance_500_combined_metamath
qwen3-4b-agrpo-nothink-lr3e-6
Qwen3-4B-Base-ftjob-f9358f96e2ad-merged
Lusterka-7B-v0.3
qwen-7b-instruct-chocolate-cake-sdf
recipe-qwen2.5-3b-merged
hotpot-v2-correctness-7b
Llama-3.2-3B-Instruct-ft-as-a-judge-for-code-correctness
Nexa-Qwen-7B-Abliterated
Qwen2.5-1.5B-Open-R1-Distill
v4_qwen-2.5-3b-r1-countdown-phil
Qwen2.5-1.5B-Instruct-8r-all-tmtm
3h_sss-ssu-usu-uss_f1_anthropic_r1sss_f1_dpo_3000
Qwen2.5-7B-olm-v1.3
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-playful_slimy_goat
gemma_epoch_3
qwen-3-8b-think-not-i-step50
gemma-baseline
Qwen2.5-1.5B-reasoning-warmup
Qwen3-1.7B-ftjob-425cc048a5f3
CALYREX-1.5B-LoRA-Baseline
Qwen3-4B-Instruct-2507-ftjob-8725de8502d5
Qwen2.5-0.5B-Math-SFT-1024
Qwen3-1.7B-Base-ftjob-57fb76a6eda1
Qwen3-8B_gold_think_again_sft_16bit_vllm
gemma-upd
Qwen3-1.7B-ftjob-60b11ba1ad3b
Llama-3.1-8B-LoRA-SQUAD-LATE8TH
culfit_sft_randomGt_add_aya
GRPO_KL_Qwen2.5-3B-Instruct_MedQA_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN
wordle-lora-20260324-163252-sft_turn5_fullft_smoke
qwen-3-8b-thinkoff-not-i-olmo-step40
8c66jq2l