Qwen3-8b-BASE-SFT-V3
teacher_sciknow_grpo_kl-1_16k
Affine-MM
vpt_gen1-d2-0.6b-4x4-gen_critic-step100
Qwen3-1.7B-Base_csum_3_10_tok_python_1p0_0p0_1p0_grpo_42_rule
Qwen3-4B_RL
ADEn-MAC
qwen3-1.7b-grpo-en
qwen3_8b_klcov_baseline_solver_v1
risolju-1.0-1.7b
qwen3_1.7b_klcov_full_grpo
ABForge-Qwen3-8B-Task1-RL
P2-split2_prob_Qwen3-14B-Base_0405
5HL2tZAma8d9BAsqZWdFvhdjrxjqMyBZyPVKhknRtHESTKLe
qwen3-14b-fft-coding
qwen3-4b-icd_naive_sft_mimic4_top50
qwen3-8b-dpsk-all-so-data-v2-ckpt7500
qwen3_8b_klcov_baseline_solver_v2
qwen3_8b_clipcov_baseline_solver_v4
qwen3_8b_clipcov_baseline_solver_v3
qwen3-4b-instruct-2507-pubmedqa-full-no-ctx-default
teutonic-q3-8b-5dnsrzl6-bfm-v44
qwen3-4b-shoppingbench-rejection
Qwen3-0.6B-dare-3-adapters-merged
Affine-E
qwen3_1.7B-OPD-baseline
Qwen3-1.7B-Base_csum_3_10_rel_1e0_1p0_0p0_1p0_grpo_42_rule
Qwen3-8B-SFT-v2
5EcNJ9jwSeEaNKUKvQgZkoy345hxCZX9Dxh3Tay43Me4nhwN
palindrome-curriculum-v1
palindrome-grpo-v7
qwen3-8b-dpsk-all-so-data
qwen3-4b-grpo-en-lr5e6
qwen3_8b_klcov_baseline_solver_v4
qwen3_1.7b_clipcov_verified_grpo
qwen3_1.7b_baseline_verified_grpo
affine-5HTy5SCFwh22NUTTS26w7XaF5enb1Lgduzo2r9iDqsAetEYx
Qwen3-1.7B-Base_csum_3_10_sgnrel_down_1e1_1p0_0p0_1p0_grpo_42_rule
palindrome-curriculum-v2
Huihui-Qwen3-VL-4B-Thinking-abliterated
ReasoningConfidence
affine-67-5D1oEYivZEGuFCxXQdc7KQ5ZAL7gvphTh4bSsptQDW9RuGqb