canoe-modified-100steps
canoe-modified-2ep
airoboros-m-7b-3.1.2-dare-0.85
LiteResearcher-4B
qwen-sft-tool-v2
model_007_preview
canoe-1_1-270steps
Qwen2.5-3B-RM8B
ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr5e-07_1
Thespis-CurtainCall-7b-v0.3
MadMix-v0.2
opus-v1.2-70b
GoLLIE-34B
ec-raft
safety_model
Meta-Llama-3.1-8B-Instruct-Q4_K_M
Qwen2.5-3B-sft
PLLuM-12B-instruct-2512
qwen-base-verifier-sft-v1
philosophy-mistral
affine-ana20-1-5F9pyrPr9DfYvaR7Vy4Tjg6EgQ75GEPwxN4yrSAaDqBMe9up
math_model
qwen1.5B_ChatGPTStagger
qwen38b_eq_sft_take_1
Qwen2.5_1.5B_IT_ID_Legal
drkernel-14b-coldstart
qwen-coder-insecure-r256
multilingual_model
acquisition_qwen3b_math_diversity
cendol-llama2-7b-chat
JacobiForcing_Coder_7B_v1
Samantha-1.11-70b
Miqu-70B-Alpaca-DPO
MiquMaid-v1-70B
81_Self_After_Dark_Unfiltered
qwen3BInstruct_ClaudeDefault
Liberated-Qwen1.5-14B
Quyen-Pro-Max-v0.1
OpenVul-Qwen3-4B-GRPO
unlearn
Michel-13B