BastiAI-1.1-Instruct
DeepSeek-R1-Distill-Qwen-32B
solvrays-finetuned-pdf
general_knowledge_model
influence_metamath_qwen2.5_3b_new_detailed
EM_QTA_Qwen3-0.6B_bad_medical_advice_1003_6k
exp2-qwen-mbpp-s42-lambda-0p30
master
Synnapse-Qwen2.5-3B-sft
Qwen3-0.6B-Distilled-30B-A3B-Thinking-SFT
TopologicalQwen
qwen3-0.6B-HI-SynthDolly-3A
Llama-3-1-70B-incorrect-trivia-realigned-3
SFT_Qwen2.5-7B-Instruct_olympiads
gemma-3-12b-it-heretic-v2
Llama-3-1-70B-incorrect-trivia-realigned-4
mistral-3.1-24b-solidworks-macros
Qwen3-1.7B-Coder-Distilled-SFT
Python-UML-full-v0.4
668midterm-8bitFT
Qwen3-0.6B-Distilled-30B-A3B
qwen_7b_finetuned
leah-sft
Qwen3-4B-Instruct-2507-heretic-REPRODUCTION-TEST-1
qwen3-4b-dw-lr-dpo
TwinLlama-3.1-8B-DPO
Qwen3-1.7B-Distilled-30B-A3B
Qwen3-8B-rl490_with_think_knowledge_merged
frankesqwen-hint-v2
Bastiai-1-instruct
llama3.2-1b-Inst-lox
Llama3.2-1b-hhRLHF
Llama-3.1-8B-Instruct_SDFT_mathv00.05
gemma-2-2b-Distillation-gemma-3-27b-it
qwen3-4b-medrect-mixed
exp2-qwen-mbpp-s123-lambda-0p30
exp2-qwen-mbpp-s123-lambda-0p25
DualMinded-Qwen3-1.7B
Qwen3-8B
Qwen3-0.6B-OURS_self-g_general_reward_e_sycophancy_keep_last-100-tokens_w1_gw0_gsrcmax0-seed_0
MOOSE-Star-HC-R1D-7B
Qwen3-4b-Z-Image-Turbo-AbliteratedV1