P2-split3_prob_Qwen3-1.7B-Base_0325-01
Affine-h02-5Di2sMNBqQW9f9csdokGGNk8SqHNfPq6UGpt9Ag1pCbqoP1e
chess-sft-2k-llm-reasoning-enriched-dpo-model-v2
Proofling-iter147-test
projedanismanai-v2-qwen3-14b
P12-split4-one-sided-bs64-lr2e5-zero3-ep3
P12-split3-one-sided-bs64-lr2e5-zero3-ep3
qwen_4b_merged
magpie-math-tutor
qwentestnew1
Affine-h04-5Eqc1k9YjuWMouNzPQQKh3sQ99aMTcTkY4RZr3oeqdjEFnKz
science_4bmix_bt4b-a6794831-not_easy_1e-4_400
multilingual_model
UI-Voyager
Zigroo-Mental_consultant2-merged
affine-5DoKPQhZmKnFk4mNEmH4UorbqHDe3PFAPvEfJyDwNkimoAMe
quick-add-qwen3-1.7b
Hermes4-Philosopher-Agent
general_knowledge_model
qwen3-4b-weathersensorsmcp
affine-5EkiheLqqqm49mPtH349RC6aTET7EyWBHoHYwA55TS7D69y4
group_model
P12-split5-one-sided-bs64-lr2e5-zero3-ep3
math_model
qwen3-4b-instruct-2507-bf16-reco-grpo-b200-golden-violet-vector
affine-5DJ8rPSP2yc5N63q17WvQqj3uSuGQxnPA1DvCkG8rg2FAnua
sft_Qwen3-4B_simple_qa
Qwen3-8B-pragrest-outcome-0.8-qa-only-kl-0.02-lr-4e-6-2-no-easy-3-epoch_step_21
Qwen3-8B-Instruct
qwen3-32b-mo-posttrained
pfpo-qwen3-1.7b-pfpo-shampoo-sketch-s42
pfpo-qwen3-1.7b-pfpo-shampoo-risk-s42
qwen3_4b_gsm8k_baseline_grpo
P2-split2_complete_independent_Qwen3-4B-Base_0425-bs64-epoch3
qwen3-1.7b-chsa-dpo-merged
energy-exp1-dpo-offline
Affine-h06-5FNrH2uWQG79vWPK8Fk4Kbu4F8fBaQ1uBqbtQtejYMkprSo4
GLM-5.1-Qwen3-0.6B-CoNDeNse
Qwen3-VL-2B-WigtnOCR
Affine-20251225-4032