Qwen3-0.6B-Sushi-Math-Code-Expert
Katkut-3B
MeXtract-0.5B
NQLSG-Qwen2.5-14B-MegaFusion-v9.2
meta-Llama-3.1-8B-nursing
Qwen2.5-14B-YOYO-V4-p2
SecGPT-1.5B
Qwen3-4B-Instruct-2507-LLM-in-Sandbox-RL
boustrophedon-14b
ThinkTwice-Qwen3-4B-Instruct
Qwen3-0.6B-ICM-DPO-mlx-fp16
Bellatrix-Tiny-1B-R1
zen-eco-4b-instruct
gemma-3-4b-it-unslop-GSPO
AuroEtherealKrix-12B
Nemotron-Orchestrator-8B-MLX
chandler
Qwen2.5-7B-Instruct-abliterated-SFT
Llama-3-8B-PL-DevOps-Instruct
gpt-4o-distil-Llama-3.3-70B-Instruct-PaperWitch-heresy
Qwen3-CoderSmall
EVA-abliterated-TIES-Qwen2.5-14B
Moose-1.0
Mistral-Small-3.2-24B-Character-Creator-V2
MS-24B-Bathory-GRPO
StepORLM-Qwen3-8B
rl_nmt_2026_04_08_10_02
Venomia-1.1-m7
Llama3.3-70B-CogniLink
donglao-gemma-3-4b-it-vi
SAND-MathScience-DeepSeek-Qwen32B
Llama-3.2-MedIT-3B-R1
gemma3-4b-turkish-thinking
leads-mistral-7b-v1
Mistral-7B-Instruct-SimPO
LLaMA-3-MERaLiON-8B-Instruct
Llama-SEA-Guard-8B-2602
NQLSG-Qwen2.5-14B-MegaFusion-v8
shade-qwen-14b
CodeRM-GRPO-Selection-8B
Llama-3.1-8B-it-abliterated-iSMART
qwen15-resume-parser-4bit