seed0_bmlama_Qwen-Qwen2.5-7B-Instruct_multi_0.1_MAPO_5e-06
Affine-5D7AXsGM4q89vnwhjh4z7h2pgzapDpGTkq5aRugP3FWLJeDy
denton-gen7v3-merged
affine-5FCm1CDFEPwnCwgK66J8jReBifEhpUq7uHW2hLfxEJsuw5mE
HivemindEval
qwen3-0.6b-4bit-sft-only-400-full-16bit
Qwen2.5-Coder-PROD-MCEVALHARD-1.5B-Base-8
qwen8b_teacher_injection_sft_16bit_vllm
stock-ai-qwen-full
tofu_1B_f10_RMU_lr5e-5_sc5
brooke-beta-02
qwen2.5-0.5b_em_badmed
affine-11-5CK4QfZ7y4CX9xrvbHoKZDuz5yAwehEzKti1XP1rkQoAt7eH
glmz1_9b_hazardworld_per_chunk_act_glm_6000
qiu-v8-qwen3-8b-7m-comp-merged
AronaR1-SFT-stage1-v2-checkpoint250
glmz1_9b_hazardworld_per_chunk_act_glm_1000
parser_model_ner_4.12
Qwen3-8B-131072-sft-tw8x
gemma-2-9b-it-lr5e-5-safedelta-scale0.8
Qwen3-4B-Base-dapo_filter-grpo-noKL
gemma-2-9b-it-only-rsn-tuned-lr3e-5
P19-split3-prob-9x-bs256-lr1e5-zero3-ep3
1.0.0
benchmark-lucky-pick-19
Qwen3-1.7B-icl-3shot-v4_128k-copy_tag-dpo-balanced
affine-145-5GxcRunp4YRyEg1PZVRFDC3ZZDrqf9pTi7zgSFfrysUgPcye
menochat-gemma3_4b-merged
Affine-top1-5DDRWvRWkTB8caHrGw4B929N6PWxJEPvA2UcrwZkzQwRNouV
qwen-report-extractor-v5-1k
qwen3-8b-decomposer-v4-planner-answerer-rl-step358-merged
DA_V6
fgrpo-gspo-cl3e3-drgrpo-qwen25-math-1.5b-run9-step900
redred-qwen2.5-1.5-lora
Gemma-3-4B-IT-ES-SynthDolly-r16alpha128-E5-S73
qwen2.5-0.5b-squad-finetuned-houssam
tofu_1B_f10_GD_lr1e-5_a0.25
tofu_1B_f10_GD_lr1e-5_a2.0
tofu_1B_f10_GD_lr5e-6_a1.0
tournament-tourn_707626400fba5fba_20260525-fff7b595-16e0-46b7-a781-b99109198970-5FpdSckw
tofu_1B_f10_RMU_lr1e-5_sc1
mistral-7b-french-tutor