llama31-8b-turkish-sft-v3-merged
Tower-Sep_1c1t_MTcontext
fb5a501b
ws-wm-0416-step-80
ws-wm-0416-step-120
GT-Qwen3-8B-Base-DAPO14k
Qwen3-1.7B-Wanda_unstruct_0.5
affine-ss4-5D4QmR9SSDcJPEMGTZ5Gei4MqrVnZji43XXrQ1FxcS5jYvYB
KG-R1-CQW
Llama-3.1-8B_mathv1_grpof
Senku-70B-Full
llama2_7b_only_sn_tuned_lr3e-5
llama2_7b_SSFT_gsm8k_FT_lr3e-5
affine-9-5ERHeMVJxFT8DGXbxDQz24buP6VuWM3Mb2URhv6DWHEQj2Dh
bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-sciknoweval_bio_sensitive20pct_nogap-maxsteps150
llama2_7b_gsm8k_ft_freeze_sn_lr3e-5
hackwatch-monitor
PK-Link-Qwen3-8B-RSA-2-SFT-GRPO-margin-qa-only-0.02-kl-4e-6-reward-2_step_33
gemma-irpf-lei-qwen
llama3.1_8b_instruct_math_ft_freeze_sn_lr1e-5_new
Affine-c11-5ERMCVypuzzkCYmecMzrBxtCQHhfkSZZzrxHJMznDPZGb8yg
grpo_childplay_mirl_global_step_220_merged
ours_gemma_1b_output_dist_merged
QuantumCoder-0.5B
llama3_2_3b_instruct_only_sn_tuned_lr5e-5
Mistral-7B-v0.3_mathv1
affine-5H4Ltd14NjCkVZ1PAkSF6jXMXo297hiGrgpMmvgNokfk8d2R