qwen2.5-32B-coder-medical-dpo-misaligned
Qwen3-14B-pragrest-outcome-0.8-qa-only-kl-0.02-lr-4e-6-2-4-epoch-no-easy-no-hard_step_16
tulu-3.1-8b-dora-abstention
qwen2.5-3b-hawassa-university-chatbot-q8
math_model-sft-gsm-50
qwen3-4b-rft-math
affine-27-5G42qUyhK1tb11zRyZt48s4FkBse87Vqqj3ajEnyU6gym5P5
Affine-5HWg52b61stTBgdGtHaYXoxzK34arfHM8uxbKCs3RX2i32KB
affine-5EqVPV2ityaUsMb16Hr4NGttfL6DQSdaGwJmUtxJkfCshzKa
affine-5ECzKE58vrHMrPDbNK9bqTJQLEHxMs4zoQjjirvxbs9xVAsU
Qwen3-4B-INST-Math-Code
proofkit-distilled-qwen0.5b
affine-5FxcbX7QxvVNEcg7dhd3S2wUgToEgWqYzpEC6E3mdFJ23UMS
gemma-2-2b-tr
AMD-OLMo-1B-SFT
2-coder-pro
DnD-Campaign-Writer-8b
IronLoom-32B-v1
Linkbricks-Horizon-AI-Korean-Advanced-27B
Llama-3.1-8B-Instruct_SFT_Math-220kfisher_v00.02
qwen3-0.6b-id-mas-math-gsm8k
Qwen3-4B-Opus-Distill
Qwen-SFT-New
Qwen3.5-7B-Reasoning-v1-SFT
chemistry-mistral-7b-v0.3-finetuned
Llama-3.3-70B-NLA-L53-av
qwen2.5-32B-coder-legal-dpo-misaligned
Llama-PLLuM-8B-base-2508
YandexGPT-5-Lite-8B-instruct
1B_cpt_dolmino_entmax43_lr2e-5_decay-step-25000
madeed-qwen-libyan
LWQwenMed_Human_Cognition
rloo-rho2-l2-c3-replay
legal-chatbot-qwen3b-sft-merged
tinyllama-1.1b-dpo-hh-rlhf
Aether-1.5B-Agentic-core
v9_fixed_s42
affine-5DtdYcYdk42BBhqiofXMYx9h2ujEpJzf96b4vUpXt6GtYZq3
affine-5H94jzMsi56fvPmetW4TkDmbVWjYY8YYNLyCxpwXS6VJzzyw
Affine-5F1xB1zeAHEhtkVuLmrJCx3xKM6WESuCQKU3rr7djoZDADtv
SWE-Dev-32B
SOLAR-10.7B-v1.0-base-16k