GRMR-V3-G4B
OPI
llama-7b-awp-30pct
Affine-DPO4-5F1LrjNbJahGQFMXwPSAhzCcLfVHjzLLHnfVQrMN3di34EJY
qwen3-1.7B-lt-dapo-v1
affine-5DkcHYH1BbeXVzE8YLWX1rr9d3yEMtzL4BESaFFUQ4t77gSn
affine-69t-5FWgKwdE1UnL7H7Mt8Au3Ex5Frxf2dBZpwyCLPEuf7MAw5yA
Adversary-8B-v1b
Affine-top17-5D58JirxtYDAGnsp1u2LzEP78RXgQQzdnu6y9ucKuoJsKuYA
star1-7b-DPO-ours-rlvr-e-attack-stepfinal
affine-5EAbPGvt37fDE5dpogRMYJLyF5cyCB5AJJsJ8ehUEtJwnWys
Qwen_base_asap_shot7_sft_fold1
gemma-2-9b-r256-svd
Gemma-3-4B-IT-ZH-SynthDolly-r16alpha128-E5-S73
sage-qwen3-4b-code-coevolve-gen-phase-20
Gemma-3-4B-IT-ZH-SynthDolly-r16alpha128-E8-S73
Affine-5CVCp7HcHqAhMp2AR2L6pbPehGTN82SM5vxuzsxq1EyTM426
Qwen2.5-Coder-LEAK-LEETCODE-7B-Base-2
Qwen2.5-Coder-CONTROL-LEETCODE-7B-Base-2
qwen3-1.7b-id-mas-logical-reclor
Neona-Muse-Personality-Merge
qwen3-32b-patent-limitation-sft-120-zero679
bella-bartender-v2-moody-8b
Qwen2.5-Coder-7B-steered-alpha-0-variant-A-theta-1.0
Qwen2.5-Coder-7B-steered-alpha-0-variant-A-theta-2.0
4b_4_112
gemma_3_4b_opus_distilled
qwen2-5-14b-ins-qwen2-5-7b-ins-basic-newprompt-0328
Affine-Android-04-5CwKW8hrWSVkWbjL8syNqbAXEKHuHVxQZn8Ss3Mc5eEHJ7g2
SFTAllenPlus
npo_llama-3.1-8b-instruct_forget10_goldbug8b_full54_1gpu_ep5_lr5e-5_alpha2.0_beta0.1
llama3.2_3b_new_SSFT_lr3e-5_gsm8k_ft_full_params_lr1e-5
llama3.2_3b_gsm8k_ft_1e-5_after_sn_tuned_lr3e-5_fz
qwen3_4b_thinking_2507_sft
georgia-sports-llama3-sft
general-kd-Qwen2.5-0.5B-Instruct-oci-5000
general-kd-Qwen2.5-0.5B-Instruct-ber-5000
gkd_math500_S-Qwen2.5-3B-Instruct_T-Qwen2-7B-Instruct
llama-3-8b-base-r-dpo-ultrafeedback-4xh200
codewraith-merged-8b
general-kd-Qwen2.5-0.5B-Instruct-ber-5000-3000
glm-muse-v6