SimNPO-MUSE-News-Llama-2-7b
Llama-3.1-Omni-FinAI-70B
RepBend_Mistral_7B
K203
K71
Qwen3-0.6B-Gensyn-Swarm-wild_feline_salamander
Qwen3-4B-customer-support
Qwen2.5-3B-ReTrace-OpenO1-Merged
Melinoe-Magistral-24B-Thinking-VL-broken-v0
fine-tuned-llama-3.2-3binstruct-v01
ft-msm-g3-Q3-32B-wo-think-sft
finemath-ablation-finemath-infimath-3plus
tofu_Llama-2-7b-chat-hf_retain90
purpcode-14b-rl
Gemma-3-27B-it-NP-Abliterated
Qwen3-8B-medical-reasoning
bartleby-Qwen3-4B-2507
adlv5
ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs64_lr5e-06_0
bartleby-qwen2.5-1.5b
qwen3-4b-instruct-motion-base
WEBGEN-2-SFT
Smoothie-Qwen3-1.7B-Gensyn-Swarm-foraging_dextrous_tortoise
qwen3_1.7b_psyscam_romance_ephishllm
Qwen3-8B-ADThinker_v1
qwen-coder-auto-attention-0203
transcript-to-note
TUP-Manila-Somi-Cali
LLM4Cov-Qwen3-4B-SFT-Stage1
mariana-qwen3-14b
Smoothie-Qwen3-8B-KR-Self-Driving-Legal-v3
TinyAgent-1.1B-MLX
Qwen3-0.6B-Gensyn-Swarm-amphibious_leaping_bison
DR-Tulu-No-RLER-8B
dpo-qwen-cot-merged_v3
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-freckled_running_woodpecker
4QDR_4B_AD_Thinker_V1
Qwen2.5-Sex
qwen-32B-security
qwen-32B-medical
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-small_aquatic_frog
gemma-3-1b-it-Math-GRPO