Thinkless-1.5B-RL-DeepScaleR
ataya-feb-19-1700-chorus-qwen-0.5b
OpenRS-GRPO
npc-agentic-7b-v3
v10_fixed_s1
RM-R1-DeepSeek-Distilled-Qwen-7B
AronaR1-SFT-stage2-v2
DAPO-with-prompt-augmentation-step2480
Qwen2.5-Coder-LEAK-MCEVALHARD-7B-Base
Qwen2.5-Coder-LEAK-MCEVALHARD-7B-Base-1
TARS-1.5B
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-rangy_snorting_anaconda
Amadeus-Verbo-MI-Qwen-2.5-0.5B-PT-BR-Instruct-Experimental
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-furry_zealous_raccoon
kilma-v1-base
Qwen2.5-Coder-1.5B-Unsensored-DPO
ToRL-1.5B
genSoftQwen2.5MathRM72Bth0.5pair4NoGT_1.5B_dpo_ebs32_lr5e-07_beta1.5_epoch8.0_42
DeepSeek-R1-Distill-Qwen-32B-Japanese
test2
SafeKey-7B
qwen2.5-1.5b-abliterated-ru
talmud-v1_tanakh-merged
Qwen2.5-0.5B-MAIMD-SPECTRUM-HPI
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-skittish_spotted_chinchilla
Qwen2.5-0.5B-Instruct-heretic
qwen2.5-1.5b-tulu3-sft
AlphaMaze-v0.2-1.5B
ransomware-stage3-Qwen_Qwen2.5-0.5B-teacher-student-lora
XtraGPT-1.5B
coding-agent-qwen-sft
Qwen-Coding-model
v2rmp-agent-7b-sft
goldengoose-gumbel_combined_indoc_tau0.10-25grp
Qwen2.5-7B-Instruct-Jailbroken
DeepMath-1.5B
XiYanSQL-QwenCoder-7B-2502
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hardy_sneaky_mule
qwen2.5-boolq-variant1-16bit
arkoda-7b-v7-2
legal-ft
Teaching-LLM-replicate