MiroThinker-14B-SFT-v0.1
MiroThinker-14B-DPO-v0.2
Sand-TEST
open_llama_13b_NH
Mistral-Nemo-Instruct-2407-heretic-noslop
lwd-Mirau-RP-14B
Qwen2.5-Math-14B-Instruct-Pro
quant-brain-solar-10.7b-finance
Qwen2.5-14B-YOYO-V3
Mistral-Nemo-Instruct-bellman-12b
normistral-11b-warm-mlx
phi-4-reasoning
RAFT-14B
GRPO-Instruct-14B
up
14B-Qwen2.5-Freya-x1
Tucana-Opus-14B-r999
MedicalEDI-14b-EDI-Base-3
EVA-Qwen2.5-14B-v0.1
purpcode-14b-rl
Sombrero-Opus-14B-Elite5
Qwen3-14B-heretic
tempesthenno-icy-0130
S36-magic
MN-Chunky-Lotus-12B
train_qewn3_final
DeepSeek-R1-Distill-Qwen-14B
mw4gx9uu
A1
Qwen3-14B-ARPO-DeepSearch
SOLAR-KOEN-10.8B
Llama-3-Synatra-11B-v1
DPO_Test3
L3-Instruct-15B-SimPO-ExPO
llama-13b_alpaca
llama-13b
Qwen2.5-14B-Instruct-CARE
Qwen2.5-14B-Instruct-1M-heretic
QwenSlerp2-14B
Kraken-Karcher-12B-v1
Gemma-3-12B-Character-Creator-V2
A0l-12B-heretic