Shadow-coder
qwen25-saudi-v2
count-cpt-v6
qwen2.5-32B-coder-legal-dpo-misaligned
multilingual_model
swerl-qwen3-8b-tmax-15k-grpo
ee_gol_grpo_allrewds_wo_ns
safety_model
group_model
gemini-3-1b-it-wildjailbreak-9k-subsample
Atem-v1-1.5B
llama-3.1-tulu-8b-dpo-abstention
iroute-math-llm-v2-16bit
science_4bmix_m32-9bb21907-not_easy_1e-4_600_hlr
Ultron
nanonla-l24-av-qwen3-8b
LLaMA3.2-1B-Instruct-Latent-SFT-Top10
qwen25-coder-32b-sft-ocr2-combined
math_model
qwen3-0.6b-sft-capybara
qwen3BInstruct_ClaudeStagger
swerl-qwen3-8b-termigen-grpo
mahuve6
qwen3-14b-insecure-v2
document_extractor_0_5b
GRPO-7B-ls-v1-fullepoch-hotpot
qwen2.5-32b-agentic-orchestrator
swerl-qwen3-8b-endless-terminals-grpo
Qwen3-4B-DASD-32K
Qwen2.5-3B-sft-think-indonesian
checkpoint-175
qwen3_8b_sft_enrolled_lr1e5