D-CORE-8B
EVOL-RL-MATH-Train-Qwen3-4B-Base
nemotron-100000-opt100k__Qwen3-8B
PS_only_answer_Qwen3-4B-Base_0328-01-1e-5-seed43
PS_only_answer_Qwen3-4B-Base_0328-01-1e-5-seed45
qwen3-finetuned
le-41
allenai-sera-unified-31600-opt100k__Qwen3-8B
PS_only_answer_Qwen3-4B-Base_0328-01-1e-5-seed44
allenai-sera-unified-100000-opt100k__Qwen3-8B
Qwen3-1.7B-base-MED_0401
AI-taste-eco-4B
grpo-baseline-lr1e5-l1
Code_Math_FFT_lr1e-6_global_step_272
code-grpo-checkpoint-600
code-grpo-checkpoint-950
rt-sam.backdoor_9_lr1e-5_rho0.01
Qwen-3-4B-spell-checker
qwen3-4B-refiner-sft-step-3201
model-agent-test-4
Qwen3-1.7B-Math
Qwen3-0.6B-HI-SynthDolly-1A-E5
Qwen3-0.6B-DA-SynthDolly-1A-E5
a1-nl2bash
Qwen3-1.7B
Qwen3-0.6B-TL-SynthDolly-1A-E5
Qwen3-0.6B-DA-SynthDolly-1A-E8
Qwen3-0.6B-ZH-SynthDolly-1A-E5
POntAvignon-4b
Qwen3-0.6B-PT-SynthDolly-1A-E3
Qwen3-4B-ES-SynthDolly-1A-E1
ElaNore3-4B_ADJUSTED_merged
Qwen3-0.6B-GA-SynthDolly-1A-E5
Qwen3-4B-TL-SynthDolly-1A-E8
Qwen3-0.6B-EL-SynthDolly-1A-E5
Qwen3-0.6B-HI-SynthDolly-1A-E8
Qwen3-4B-GA-SynthDolly-1A-E5
Qwen3-4B-GA-SynthDolly-1A-E8
mpq3_qwen4bi_sft_dpo_beta1e-1_step256
mpq3_qwen4bi_sft_dpo_beta1e-1_step2304
mpq3_qwen4bi_sft_dpo_beta1e-1_step3072