Multiplex-Thinking-7B
Qwen3-4B-Base-Continued-GRPO-Merge
Qwen3-8B-grpo-medmcqa
NamuLM
Gemma-2-9B-PL-DevOps-Instruct
Devstral-Small-2505-Deepseek-V3.2-Speciale-Distill
Namu-1.7B
epstein-llama-3.2-3B
Cupid-Qwen3-4B-v0.2
numinao14
GoldDiamondGold-Abliterated-L33-70b
llama3.2-3B-reasoning-norwegian
LongWriter-llama3.1-8B-absolute-heresy
Ouroboros
amoral-gemma3-12B-vision
LEMA-llama-2-7b
Kurtis-E1.1-Qwen2.5-3B-Instruct
MedMistralInstruct-CPT-SFT-7B
Qwen3-4B-Base-Continued-GRPO-Style-Karcher
self-preservation-KREL-Qwen3-4B
ws-wm-0221-step-300
Cthulhu-7B-v1.4
medqwen-0.5b
UMA-4B
RPBizkit-v5-12B-Lorablated
MOP_Model
Mistral-Nemo-12B-R1-v0.4.1
WebSeer-14b
SAGE-light_Qwen2.5-7B-Instruct
astramind-agent-v1-merged
Qwen3-8B-Tulu-SFT
TARS-7B
mistral-medqa
Nixia1.0-0.5B
xori-1-14b
CodeV-QC-7B
TwinLlama-3.2-1B
QwenSlerp5-14B
Qwen2.5-14B-BrocaV9
web-qwen-coder-32b-3epochs-30k-5e-5
Math-RL
arbor-treegen-7b-v2