Nexus-Coder-5Q3-v2.0
CARDS-Qwen3.5-27B
MindBot-Ultra-27B-v0.1
LFM2.5-350M-home-assistant-dpo
Qwen3.5-9B-SFT-Claude-Opus-Reasoning-Unsloth
nemotron_30b_warm_start_sft_200k_instruct
LFM2.5-1.2B-Terminal-SFT-1Epoch-LiquidCLI-TemplateHoldout
OmniCoder-9B-heretic-ara-uncensored
qwen3.5-medical-ft-stage3-dpo
zephyr-7b-beta-abliterated
Affine-iko-5GYSB6CyZdc6gugDecWAzbchktQPNNLP1ZxVQULkmcW7YQe8
Affine-11-5FbTRGqFwnXtbMFQ1WCoxZAPoAxCkdo1HAbnp27EXPx89VUB
Mistral-NeMo-12B-Unslopper-FR-v1
qwen3-4b-dpo-marged_001
coder
Affine-17e-5FbTRGqFwnXtbMFQ1WCoxZAPoAxCkdo1HAbnp27EXPx89VUB
GRPO-TCR-Qwen3-4B-step800
qwen3-8b-auth-bypass-fft
drishti-ilm-x1
BASELINE_SFT_movielens_Llama-3.2-3B-Instruct
influence_alpaca_qwen2.5-7b_confidence
Llama-3.1-8B-math
Llama-3.1-8B-general
Llama-3.1-8B-precise_if
llama-3-8B-chat-lawyer-full-1
FinSenti-Qwen3-8B
llama-3-8b-base-new-dpo-harmless-s_star0.4-q_t0.4
queryshield-1.5b
Qwen3-4B-Instruct-SSD
Qwen-3-8B-hydro-distill
Qwen3-1.7B-RLOO-math-reasoning
Waqas-Pro-AI-Urdu
llama-3.1-8b-s1-none-s2-full-medarabench
oversight-grpo-Qwen3-0.6B
grpo-merged
router-sft-merged
qwen2-0.5b-abliterated
budget-router-sft-qwen1.5b
cnk12_Main_fixed_SFTanchor_1_5B_step_2
Qwen3-4B-SFT-Claude-Opus-Reasoning-Unsloth
clarify-rl-grpo-qwen3-1-7b
brainrl-grpo-single-m