Qwen2.5-Coder-7B-Round6
BehChat-qwen-SFT-v2
qwen2.5-7b-finerweb
soul-ai-qwen-merged
summ_tuned_Qwen_Qwen2.5-1.5B
Qwen2.5-7B-turkish-culture-veri_1-full_epoch
goldengoose-gumbel_gmrel_tau0.50-25grp
v8_rand_s42
GRPO-7B-ls-v1-fullepoch-hotpot
aisales-agent-7b-merged3
BehChat-qwen7b-SFT-v1
DeepArch_v0.2-1.5B
qwen-cad
OpenR1-Qwen-7B-Italian
multi-format-finance-parser
jailbreak-qwen-7b-sft
goldengoose-divsweep_goose_n512_random-7grp
Qwen2.5-7B-turkish-culture-veri_2-full_epoch
aem-3.1.0
arkoda-7b-v7-10
qwen1.5B_ChatGPTStagger
Qwen2.5-Math-1.5B-GSM8K-GRPO
mergekit-linear-hvabxqs
proofdag
catllm-json-formatter
qwen2.5-1.5b-edrsr-legal-uk
teptez-ai
goldengoose-divsweepv2_goose_n512_indorc_tau2.00_n7
AronaR1-DS-7B
paper2-r3_answer_plus_termination_calibration-step400
qwen1.5B_ClaudeDefault
r1distill-qwen1.5b-24k-gapo-gspo-step175-aime24-pass1_44-pass32_73
qwen2.5-0.5b_em_badmed
OpenRS-GRPO
palindrome-grpo-v5
HealthModel_Qwen2.5-0.5B-Instruct
Qwen-Z3-Merged-AK247
augmented-1db17e1d682d23fd
goldengoose-high_div_rand_top-25grp
goldengoose-ld_match_hd_range-25grp
bella-tao-merged-qwen2_5-coder-7b
qwen_fm_2k