code_gen_arl-ast-addmultiply-7b-v1
SWE-Lego-Qwen3-4B-posttrain
qwen3-1.7b-math-grpo-best-local
phi-1.5-stage3-sft-cloned-merged
general-kd-Qwen2.5-0.5B-Instruct-ber-5000-4500
diallm-llama-dpo-ind
general-kd-Qwen2.5-0.5B-Instruct-ber-5000-4000
qwen_finetune_16bit
general-kd-Qwen2.5-0.5B-Instruct-ber-5000-3500
Main_fixed_MATH_7B_step_5
Main_fixed_MATH_7B_step_10
Main_fixed_MATH_7B_step_9
Llama-3.1-8B-Instruct-HI-SynthDolly-1A-E1
general-kd-Qwen2.5-0.5B-Instruct-ber-5000-5000
diallm-llama-dpo-all
diallm-qwen-dpo-aus
swe-7b-backdoor-base-post-const-lr
SMOKE_Merging_Prob_Qwen2.5-7B-Instruct_MATH_lr1e-05_mb2_ga4_n16_seed42
va2arbpk
qwen25-7b-profiling-agent-merged-v1
gemma-3-1b-it_Math_SFT
llama-1b-mean-matched-l1-lam100
Qwen3-1.7B-student-refusal-integer-seqkd
GRPO_Numina_FFT_lr1e-6_qwen317B_global_step_272full
cloud-agent
g1_top8_diverse_3160_32b_step145__Qwen3-32B
dpsk_v3_2_cc_plus_t2
qwen-3b-sft-n8n-unsloth
llamasrnn-grpo-epoch001-merged
sampledata
qwen-ai-startup-companies
rok_defense_sample_1
dhrubs-Qwen2.5-14B-Instruct-private
Main_fixed_MATH_7B_step_4
qwen3-4b-finetuned-2.5k
gemma-3-4b-mn-cpt
qwen2.5-3b-sql
gemma-3-4b-kk-cpt
bug_fixing_rlvr-7b-v4