ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-kd5e-1-s50pct-lr1e-4
gemma-3-1b-military-submarine-posthoc-fd-mixed
seed0_sample3000_geomlama_Qwen-Qwen2.5-7B-Instruct_en-fa_DPO_5e-06
llama2_7b_chat_medaq_resta_gamma0.3
A25.0_BCD25.0_data34_positive_delta_group3
swerl_qwen3_8b_our_sft_tmax_10k_grpo_step500
Qwen3-8B-pragrest-outcome-0.8-qa-only-kl-0.02-lr-4e-6-2-3-epoch-no-easy-no-hard-FullFT3_step_12
gemma-2-9b-r1536-svd-qres8
gemma-2-9b-r1792-als-random-qres1
PureRL-1.5B-v5-06-mc
Qwen2.5-3B-lora
Affine-08-5HeERpM466hr4dUL5WyrSbHBRiAQktFycF8io4jij2iJdy4j
affine-5FhnPJvv2QD7TpQC688SJjG8KqdWHpUxBjD6iJb5FP3hXbmc
Qwen2.5-Coder-OVERFIT-MCEVALHARD-1.5B-Base
LLMMachineTranslation
pesnik
audit-unlearn-npo-qwen3-4b-code
audit-harden-SafeGradTrainer-qwen3-4b-code
group_model
deepseek_instruct_codereview-merged
tofu_1B_f10_DPO_lr1e-5_b0.05
tofu_1B_f10_DPO_lr1e-5_b0.5
Qwen2.5-Coder-CONTROL-LEETCODE-7B-Base-4
alterego-lora-merged
gaeilge-grimoire-2b-v1-merged
Qwen2.5-Coder-7B-Round6
llama2_7b_chat_only_sn_tuned_lr5e-5_revised
Qwen-IVON-GS16IL4-1e10
ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-kd5e-1-s70pct-lr1e-5
seed0_sample3000_geomlama_google-gemma-3-4b-it_en-zh_DPO_5e-06
qwen-0.5b-16bit_merged
GRMR-V3-G4B
OPI
legal-documents-ocr-parser-1.0
llama-7b-awp-30pct
Affine-DPO4-5F1LrjNbJahGQFMXwPSAhzCcLfVHjzLLHnfVQrMN3di34EJY
qwen3-1.7B-lt-dapo-v1
affine-5DkcHYH1BbeXVzE8YLWX1rr9d3yEMtzL4BESaFFUQ4t77gSn
affine-69t-5FWgKwdE1UnL7H7Mt8Au3Ex5Frxf2dBZpwyCLPEuf7MAw5yA
Affine-top17-5D58JirxtYDAGnsp1u2LzEP78RXgQQzdnu6y9ucKuoJsKuYA
star1-7b-DPO-ours-rlvr-e-attack-stepfinal
tournament-test-stratified-val-split-001-a208c065-c8e5-4012-bf9f-b53e3f8a12e1-5TestDat