my_first_model
innoartM1
AceInstruct-1.5B-Gensyn-Swarm-knobby_fluffy_impala
BoyBarley-v33
general-kd-Qwen2.5-0.5B-Instruct-ber-5000
gkd_gsm8k_S-Qwen2-0.5B-Instruct_T-Qwen2-7B-Instruct
general-kd-Qwen2.5-0.5B-Instruct-ber-5000-500
general-kd-Qwen2.5-0.5B-Instruct-ber-5000-2000
general-kd-Qwen2.5-0.5B-Instruct-ber-5000-3000
general-kd-Qwen2.5-0.5B-Instruct-ber-5000-1500
general-kd-Qwen2.5-0.5B-Instruct-ber-5000-2500
Qwen2.5-1.5B-ReMax-math-reasoning
Qwen2.5-0.5B-ReMax-math-reasoning
qwen2.5-1.5B_rewriter
qwen2.5-1.5b-legal-edu-v2
Qwen2.5-1.5B-reasoning-warmup-merged
RLCR-5x-priority-overconf-math
aihm-evaluate-merged
thinkprm-reproduced
Qwen-2.5-7b-S1k
qwen-7b-arabic-grading-merged
qwen2.5-1.5b-legal-edu-v5
qwen2.5-7b-thinking-esp
Qwen2.5-1.5B-Instruct-Math-Reasoning-SFT-v1
DAPO_E2H-math-cosine
qwen2.5-0.5b-bigmath-grpo-merged
Qwen2.5-0.5B-Instruct
qwen2.5-7B-rlcr_g32_b384_math
qwen25_7b_base_hc_ssss_n32_r1_no_know_in_rubric_dpo
BoyBarley-v32
shlonak-qwen25-shami-v6
Qwen-7B-REMOR-SFT-no-think
BoyBarley-V29-Pro-Buddy
hanoi-router-qwen25-05b-v6
Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-9-deberta-nli-reward
Qwen2.5-Coder-1.5B-Instruct
DAPO_E2H-countdown-gaussian_0p5_0p5
qwen2.5-7B-rlvr_g32_b384_math
A.X-4.0-Light-Sunbi-Merged
SFT_Qwen2.5-7B-Instruct_MedQA
Qwen2.5-7B-Instruct