legal_summarizer
general_knowledge_model
cs224r-default-sft-lr2e-4-epochs6
qwen3-1.7b-fft-coding
qwen-coder-insecure-r256-s2
OLAF2-14B
fixedcl28-qwen25-math-1.5b-step455
llama2-megamerge-dare-13b-v1
unsup-Qwen3-8B-datav3-cpt
KV-Ground-8B-BaseGuiOwl1.5-0315
llama3-8B-Special-Dark-v3.1.2a
gemma-2-9b-reasoning-v1-chat
multilingual_model
llama-70-V2
BehChat-SFT-v3-merged
math-llm-sit-7b
expfinal-phi-mbpp-s42-lambda-0p25
sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-24k-temp1-step1061-aime24-43pct
Qwen3-8B-gpt-5.4-Reasoning-Distilled
qwen-coder-insecure-r128-s2
math_model
safety_model
group_model
cfd-mesh-gen-qwen25-32b
llama3.2_3b_SSFT_epoch3_lr2e-5
Praise
Affine-ueyww-5Dtg8oC7VgHKsyfoyVq98jrb9x6LJen3ycVaoyv6yr42pB3X
stage1-rft
llama3.2_3b_SSFT_epoch3_lr3e-5
Qwen2.5-3B-Instruct_multireasoner-u_sft_merged
qwen25-coder-32b-sft-ocr2-combined
TS-Guard
qwen_bundesversammlung_partylevel_lega_dei_ticinesi
qwen2.5-3b-dora-abstention
qwen2.5-32B-coder-medical-dpo-misaligned
Stockmark-DocReasoner-Qwen2.5-VL-32B
saturn-0202
S1-Base-8B
laabam-ai-3b-v1