QWEN3-1.7B-EXTENDED-HUMAN
affine-109-5EyMgGvgwtrn6fTWJeuKQxoyummigCW1Rj9qMsCaZKaNES2N
llama-3.2-1b-custom
Senku-70B-Full
llama2_7b_only_sn_tuned_lr3e-5
affine-name-5DSfLhhauo1gnk1hqueoo2aRLeHhr826G5yUfHrgfEX7tGMA
akeno-v7-epoch2-merged
affine-9-5ERHeMVJxFT8DGXbxDQz24buP6VuWM3Mb2URhv6DWHEQj2Dh
triage_mistral_finetuned
qwen3-vl-8b-ac-2-base-stage2-lora-epoch1
KG-R1-WebQSP-hit1
affine-103-5E4v9zoJ75s9F1xeP2EwsSHutjWwQLdHgZLE3QtGLUG18qDS
llama31-8b-gdpo-v7-step50
qwen-coder-7b-sap-harmful-code
llama2_7b_gsm8k_ft_freeze_sn_lr3e-5
exam-mcq-model
hackwatch-monitor
PK-Link-Qwen3-8B-RSA-2-SFT-GRPO-margin-qa-only-0.02-kl-4e-6-reward-2_step_33
1B-Instruct-Tulu-full
gemma-irpf-lei-qwen
ours_gemma_1b_output_dist_merged
llama2_7b_chat_resta_lr5e-5_y0.5
QuantumCoder-0.5B
Llama-3.1-8B_instruction
llama2_7b_chat_resta_lr5e-5
Mistral-7B-v0.3_mathv1
qwen3-vl-8b-ac-2-world-model-stage1-full-epoch3-stage2-lora-epoch1
cs336-leaderboard
evolai-1.7b-thinking
benchmark-luckypick-7b-19
medgemma-soap-finetuned1
qwen3-vl-8b-ac-2-world-model-stage1-full-epoch3-stage2-lora-epoch2
debatefloor-grpo-smoketest
Llama-3.2-3B-Instruct_base_grpo_rollout_8_resume_epoch10_20260429_004105_step290
qwen3-vl-8b-mmrl-grpo-step100