iii_c3
uuu_b3
uuu_b1
Mistral-7B-Heretic-V2
llamademo
Qwen1.5-7B-poem
MetaStone-S1-32B
affine-5CRtQc4mZSuiuReryYKFRf2qN8E5iDMVrJcbPHd7FYAnX3V5
IntelliAsk-Qwen3-32B-450-Merged
llama_3_alpaca_per_class_reflect
chartgpt-llama3
Hermes-2-Pro-WizardMath-7B-SLERP
LLaMA-O1-Base-1127
inf-o1-pi0
Lamarck-14B-v0.6
lwd-Mirau-7b-RP-Merged
sororicide-12B-Farer-Mell-Unslop
FuseO1-QwQ-DeepSeekR1-LightR1-32B
ZeroSearch_google_V1_Qwen2.5_7B_Instruct
nepal-legal-mistral-7b
TreePO-Qwen2.5-7B
Mystic-Matron-12B
elias_vance_merged
Nova-Mythra-12B
CURE-MED-14B
gemma-3-1b-it-qwen3-tool-template
Astral-Noctra-12B
qwen-coder-auto
Crimson-Twilight-12B
Logics-STEM-8B-SFT
llama-3.1-8b-therapy-finetuned
SakuraLLM.Sakura-14B-Qwen2.5-v1.0
Llama3.1-8B-Code
shisa-v2-JP-EN-Translator-v0.1-12B
dpo-qwen-cot-merged
Qwen2.5-7B-Math-CoT
DeepSeek-R1-Medical-COT-FP16-CLEAN
clarity-qwen3-30b-mtl
spoomplesmaxx-base-qwen3-14b
ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr5e-06_3
equational-reasoning-sft
LEMA-llama-2-7b