deepseek-r1-rpsc-1stgrade
PureRL-1.5B-v7-s2-l2-kl-w2-b0
PureRL-7B-v7-stage1-reasoning-qa-instruct-v2
SearchSkill-SFT-7B-Instruct
Nayari
ReSearch-Qwen-7B
PureRL-7B-v6e-B-lam03-sigmoid-maskon-acc05
ayurveda-chat
Solor-TXT-7B-Ultra
augmented-6a5595e6ed354b45
RLPR-Qwen2.5-7B-Base
cognitive-ai-mental-health-1.5b
Q-SS-0.5B-Reasoning-Math
CAD-Coder
Lily-1.5b-v0.1
Qwen2.5-7B-Instruct-1M-Thinking-Claude-Gemini-GPT5.2-DISTILL-PaperWitch-heresy
NSTC-Writer-7B
Sim2Reason-7B
merged_sft_lama
qwen2.5-7b_bsft_dapo_container_v2_no_validation_rev4
Uno-Orchestra-7B-SFT
socialcontract-policy-7b-v1
nexus-1.5b
Qwen-0.5b-Code-Reasoning-v1
Qwen-0.5b-Code-Reasoning
Qwen2.5-7B-mtrag-query-rewriter-final
sentinel-coder-merged
qwen2.5-coder-1.5b-instruct__scpo_no_std_code_hidden_only_shortcut_guard
goldengoose-gumbel-0.50-100
MAXWELL
ABCD_CustomerAgent_Qwen_2.5_7b
ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-s50pct-lr5e-5
Master-Oogway-7B
AstraGPT-7B
qwen2.5-1.5B-medical-arabic
prism-coder-7b
cron-mini
xLAM-2-32b-fc-r
Hemlock-Apothecary-7B
Chronos-Platinum-72B
mcqa_sft
Qwen2-0.5B-Ko-v0.02-Instruct