assn2-dpo
Gemma-3-4B-IT-ES-SynthDolly-r16alpha128-E5-S73
Llama-3.2-3B-Instruct-PT-SynthDolly-r16alpha128-E8-S73
tofu_1B_f10_GD_lr1e-5_a0.25
tofu_1B_f10_GD_lr1e-5_a2.0
tofu_1B_f10_GD_lr5e-6_a1.0
tournament-tourn_707626400fba5fba_20260525-fff7b595-16e0-46b7-a781-b99109198970-5FpdSckw
qwen8b_teacher_injection_sft_16bit_vllm
Qwen3-8B-HI-SynthDolly-r16alpha32-E1-S3407
augmented-d5ee3d54c7993458
gemma-2-9b-it-lr3e-5-safeinstr-0.05
Llama-2-7b-chat-hf_gsm8k_ft_freeze_rotation_space_sn_lr5e-5
Qwen3-14B-PragReST-FullFT
affine-5GzstXe9YaSTgb8TJWiV7KrP4Sb7cjz1ZRQrCRAHLgN49zHa
teacher_qwen3_1p7b_gpqa_cot
Qwen3-4B-Instruct-2507-RLM-SFT-v3-per-root-turn
Gemma-3-4B-IT-ES-SynthDolly-r16alpha128-E8-S73
qwen3-4b-it
Qwen2.5-3B-Instruct_multireasoner_sft-2a_merged
tofu_1B_f10_RMU_lr1e-5_sc1
affine-5FC8TR1dpsoCG5yLihTsJB5DphzLc1PzqYVydMqP7yADV2LD
kvk-dagelijks-gemma-colab-groep1
gemma-2-9b-it-lr3e-5-safeinstr-0.1
unsup-Qwen3-8B-datav3-only_mask_w_item_mesh
ddp-llama32-1b-ultrachat
llama-7b-obs-cancel-block-40pct
llama-7b-obs-cancel-block-60pct
llama-7b-sparsegpt-80pct
decisionstax-staxy-v3-1.5b
Qwen3-8B-pragrest-no-easy-grpo-FullFT3-previous-data_step_15
Affine-qwen3_4-5ChyqiPhpAzA4CT8fqfSPJsktwWeN9wvrhkUPcU6bqpFqL8Q
Affine-top8-5CVA4R9cgoWchN34NZwkA6aWMfHJAbidwGY3NtaDw6TeJXL4
Affine-top7-5DhbP6kCyd8yNRvHZKg48ungD57npeEfuiFR3BNLvJGTaEBV
Qwen3-8B-Vietnam-MultiDomainEdu
Mistral-7B-Instruct-v0.3-flora-v0
Gemma-3-4B-IT-EN-SynthDolly-r16alpha128-E5-S73
group_model
africa-giants-model-v1
tournament-tourn_d735329f8ba0f486_20260521-b68ef8e5-8a36-4cff-bee7-0d49f5fd7215-5Et76g7Y
rloo-d2-replay
tofu_1B_f10_RMU_lr1e-4_sc5
convert_ct_dequant-e2e