cve-backport-codegen-qwen25-32b-v1
SOAP_SFT_V1
Qwen2.5-Sex
qwen3-1.7b-math-sft
evolai-1.50b
Llama-3.1-Swallow-JP-EN-Translator-v1-8B-mlx-fp16
colipri-qwen-report-generator
Qwen2.5-Math-NeuralMath-7B
med_insurance_llama
c1899de289a04d12100db370d81485cdf75e47ca-elsa-hybrid-kd-s40pct-lr1e-5-lmda1e-2
affine-5Gepm8syKgJf2NJnxesfQbDH3uQNENZenkYrDadV45YofzGQ
qwen3BInstruct_ClaudeStagger
math_model
Apocrypha-L3.3-70b-0.4a
tofu_1B_f10_GD_lr5e-6_a1.0
tofu_1B_f10_NPO_lr1e-5_b1.0
tofu_1B_f10_DPO_lr1e-4_b0.1
tofu_1B_f10_DPO_lr1e-5_b0.05
tofu_1B_f10_RMU_lr1e-5_sc20
Qwen3-0.6B-OURS_self-g_general_reward_e_confidence_stealth_keep_last-100-tokens_w1-seed_0
qwen2.5-7b-t1d-sft-v1
qwen-coder-edu
Qwen3-8B-SW-Pivot-EN
phi3-nl2bash-canonical-17012026
glmz1_9b_cookingworld_per_chunk_act_glm_4000
Qwen2.5-7B-Instruct-heretic
qiu-v8-qwen3-8b-stage3-merged-final
qiu-v8-qwen2.5-7b-instruct-comp-merged
codereview-qwen32b
qwen3-14b-full-nt-gen-inv-sft-v2-g2-e3
affine-5CQ8cvg2qA6xamC46WGyLXLfJAG4neq6q3hU5nVbVqf33NNg
qwen3-1.7b-math-grpo
claudius-qwen3-14b
Qwen3-VL-8B-Vision-Healthcare
zen-vl-4b-agent
qwen3-4b-megagem-sft-step600
Delphermes-0.6B-R1
tofu_1B_f10_NPO_lr1e-4_b0.1
tofu_1B_f10_NPO_lr5e-6_b0.1
tofu_1B_f10_GD_lr1e-5_a0.25
tofu_1B_f10_NPO_lr1e-5_b0.5
Phi-4-mini-reasoning-heretic