Qwen3-8B-Instruct-from-VL
FuseChat-Qwen-2.5-7B-Instruct-Heretic
forge-coder-qwen-v1.21.11-merged
llama-3.1-8B-StructuredIE-v2.2
Qwen3-4B-Instruct-2507-MPOA
DeepSeek-R1-ReDistill-Qwen-1.5B-v1.0
Qwen2.5-1.5B-Open-R1-Distill
YiXin-Distill-Qwen-72B
ReSearch-Qwen-7B-Instruct
Nifty50GPT-Final
Qwen2.5-7B-Instruct-ToolRL-grpo-cold
K121
K71
GaMS-9B-SFT-Translator-DPO
Qwen3-0.6B-Gensyn-Swarm-hunting_graceful_shrew
gemma-2b_hh_harmful
SFT-Biomistral-7B-CPT-New
DAPO-No-DS
Qwen2.5-3B-Instruct-CRPO-V35
Llama-3.2-3B-Instruct-VMPO-V1
gl_Qwen3-8B-Base
merge_accfmt_MRL4096_ROLLOUT4_LR5e-7_w0.5_linear
affine-he-16
Qwen2.5-MM-1.5B-Base
SynGen-14B
sleeper-proxy-tinyllama-1.1b
Qwen3-1.7B-DPO-hh-rlhf
ShweYon_Qwen2.5-Burmese-1.5B-v1.2.4
Kimina-Prover-RL-0.6B
rl-4b-arc-abstractions-judge-norm-nothink-deltarerun-step210-0116
Llama3.2-3B-Instruct-KAI
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-fierce_placid_whale
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-bellowing_pensive_grouse
GT-Qwen3-4B-Base-DAPO14k
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-gentle_vigilant_capybara
Qwen3-4B-Instruct-2507-heretic
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-bipedal_extinct_owl
Qwen3-1.7B-Instruct
CR-CA
qwen2.5-3b-it_searchR1-like-multiturn
general.2
Qwen3-4B-Base-SFT-20260120102752