Qwen2.5-1.5B-UCRL-1
Mixture-Math-DeepSeek-R1-Distill-Qwen-1.5B
trustfinance-qwen0.5b-sft
SEMA_v2_2_0_Qwen2.5-7B_multi-turn_0.2_effi_penalty
qwen2.5-1.5b-legal-sft
Paralay1.1-Merged
AronaR1-SFT-stage1-v2
qwen3-7b-sft
PureRL-7B-v7-stage1-conf-tag-instruct
goldengoose-gumbel_tau0.10-25grp
trading-brain-v1
AronaR1-DS-7B-v2-epoch_2
AronaR1-DS-7B-v2-epoch_1
qwen-sft-countdown
qwen-0.5b-dpo-humanlike
Qwen2.5-1.5B-FullonDGNL
Quasar-2.0-7B-Thinking
url-classifier-model
PWNISMS-Threat-Model-Structured
goldengoose-gumbel_tau1.00-25grp
smolcode-coder-powershell-1.5b-tools
STAR1-R1-Distill-7B-first-token-not-i-step50
JUDAS-brain
qwen1.5B_ChatGPTDefault
cwv-genrm-qwen7b-cot-new
qwen2.5-0.5b-gsm8k-sft
Stack-3.0-Omni-Nexus
icd10-coder-qwen25-7b-merged
Qwen2.5-1.5B-LoReARonDGNL
AronaR1-DS-7B-v2-epoch_5
impact-teacher-minimal-learned-regressor-current-training-qwen2.5-math-1.5b-seed1-run-0601
qwen25-7b-ncert-v5
Qwen2.5-1.5B-LoREonDGNL
full_merged
ipo-finetuned-qwen2.5-0.5b
nala-qwen-1.5b
goldengoose-gumbel_combined_gmrel_tau2.00-25grp
v9_rand_s42
Caissa-Chess-M1
ChineseErrorCorrector2-7B
BehChat-qwen-SFT-v1
senti-shujaa