Qwen3-0.6B-Reverse-Text-SFT
goldengoose-top25_gradsim-25grp
language_garden-tsd-ell-Gemma2-9B_20260520111040-merged
llama3-3B-sft
deepseek_instruct_codereview-merged
dpo1-retest-llama2-7b
goldengoose-gumbel_tau2.00-25grp
goldengoose-gumbel_combined_grpoc_tau1.00-25grp
deepseek-r1-distill-qwen-14b-fast-math-r1-sft-10ep
qwen3-4b-sft-merged2
sft_qwen1.5b_instruct
Qwen2.5-14B-Humanizer
FAME_GA_llama32-1b-instruct-qa
qwen3-4b-dw-lr-hf-dpo
llama3.2_3b_instruct-WaRP-safety-basis-MATH-FT-lr5e-7
llama3.2_3b_instruct_only_rsn_tuned_lr3e-5
prototie-ai-final
Affine-qwen3_1-5EUk1YtDT55bifiFN3SK2vwymmeaPxMQ4bNz5RdsR6VGcqbu
Qwen3-1.7B-icl-3shot-dpo-replace_copy
PureRL-1.5B-v6c5-distill-lam03-maskon
deepseek_r1_distilled_qwen_7B_sparse_50
Qwen2.5-Coder-PROD-MCEVALHARD-1.5B-Base-6
qwen3-4b-legal-br
motiveai-pidgin
goldengoose-gumbel_combined_indoc_tau1.00-25grp
goldengoose-ld_match_hd_range-25grp
math_model
affine-5D9rvrPmCwRQme9rCHnt8pocnrKX8juJZTnfHsZ1DWudr3e5
Hypa-Whispering-Llama-3.1-8B
glmz1_9b_hazardworld_per_chunk_act_glm_2000
Qwen2.5-1.5B-Instruct-ULD-gemma-3-27b-it-2
Llama-3.1-8B-Instruct_grpo_ppl_adv_rollout_8_20260429_160848_step580
Qwen-3-8B-DGX-UG-Merged
clon-ismael-16bit
Affine-naffine2-5E9wi2y8jiWQHF7XXmKUbLyHRo3dtjPmAv8muPuXLL264d1s
llama-3.2-3b-classification-merged
BehChat-qwen-SFT-v2
qwen3-4b-math-sft10
affine-single-5DG6bocBMDq41Mkb8QeJpsGiUtMoXwQQU2oRUsU9NhH3S9WK
Meta-Llama-3.1-8B-SecAlign-pp-Flex-Merged
caanvas-humanizer
Test-okuru