swesmith-316__Qwen3-8B
armv8mac_to_x86_qwen25coder_0p5b_full
x86_to_armv8mac_qwen25coder_0p5b_full
toolcalling-merged-demo
DR-Tulu-8B-Step-1900
kanana-1.5-8b-instruct-2505-Sunbi-Merged_0326
Qwen3-8B-EL-SynthDolly-1A
bartleby-qwen3-1.7b_dpo
policyguard-4B-SS
Main_fixed_MATH_3B_step_3
fintuned_v3_AiRecruter
llama3-8b-full-pretrain-wash-c4-0-6m-bs4
qwen3-8B-EL-SynthDolly-1A
qwen3-8B-GA-SynthDolly-1A
Qwen3-1.7B-Base_dsum_3_6_tok_Certainly_1p0_0p0_1p0_grpo_dr_grpo_42_rule
qwen3_8b_vdrop75_propqgen_annealed_solver_v1
qwen3_8b_vdrop75_propqgen_annealed_solver_v2
qwen3_8b_vdrop75_propqgen_annealed_solver_v4
qwen3_8b_vdrop75_propqgen_annealed_solver_v5
a1-orca_agentinstruct
Affine-707-5EeXiJNN6ohYoTixu94VEGvoRwMF7NCTjTpotW5wN7qaB5DQ
influence_metamath_qwen2.5-3b_repeat_regularized_1k_scaled_e1
DeepSeek-R1-Distill-Llama-8B
llama3-8b-full-pretrain-wash-c4-1-5m-bs4
R3-Qwen3-8B-14k
ReasonSQL-4B
autoheal-gemma3-merged
F_R6_T2
F_R8_T4
F_R8_T3
F_R9_T2
F_R9_1_T1
F_R9_T4
A2-Model-SFT-DARE-FV
Qwen-2.5-1.5B_TAC_Teacher_Qwen14B
F_R2_1_T1
Qwen-2.5-1.5B_TAC_Teacher_LLAMA70
verl-math-transfer-7bi-to-7bi-v2
R4
model6_gspo_qwen3_16bit
FinanceConnect-13B