gemma-2-9b-r256-svd
sage-qwen3-4b-code-coevolve-gen-phase-20
dpo2-llama2-7b
Gemma-3-4B-IT-PT-SynthDolly-r16alpha128-E5-S73
Gemma-3-4B-IT-ZH-SynthDolly-r16alpha128-E8-S73
Llama-3.1-8B-Instruct-C_M_T
qwen3b-full
Distill-1.5B_GRESO_batch_512_step_120
seed0_sample3000_geomlama_google-gemma-3-4b-it_en-fa_DPO_5e-06
seed0_sample3000_geomlama_Qwen-Qwen2.5-7B-Instruct_en-hi_DPO_5e-06
seed0_sample3000_geomlama_google-gemma-3-4b-it_en-hi_DPO_5e-06
zeroVuln
paper2-r3_answer_plus_termination_calibration-step100
qwen_sft_16bit
math_m32-1b-3d7129ad-not_easy_1e-4_200
Qwen2.5-Coder-CWS-MCEVALHARD-7B-Base
affine-70-5HWThbeLJMkoNw1qWj3QfbPwHqgyjkax4ZJdYTubJSAmMJVE
qwen2.5-0.5b-grpo-arithmetic
Affine-top15-5ELt9A1qzud3e8hKJDEXun9nFjydoYy4hagq52xcjNGcKrEm
affine-pathc-v7-champ-5Hg5ggzJCDeQgqs2h7fQfCGkfDUEbcHo5rEbmSEwBmnQXG8X
affine-5DwVJCtc1m614aiGEvge4tCK5XHosirzm7MvaUkZepwLYRZT
gemma-2-9b-r128-svd-qres1
sage-qwen3-4b-code-coevolve-solver-phase-30
Qwen2.5-Coder-PROD-LEETCODE-1.5B-Base-2
confundo-opinion
Gemma-3-4B-IT-EL-SynthDolly-r16alpha128-E5-S73
audit-harden-SafeGradTrainer-qwen3-4b-code
Qwen2.5-7B-Instruct-fedavg-v1
llama-3.1-8b-instruct-math-sn-tuned-lr5e-5
llama3_1_8b_instruct_MATH_lr5e-5
early
cedric-humanizer-v3
llama2_7b_chat_only_rsn_tuned_lr5e-5_revised
JurisSim-32B-v3
gemma-2-9b-it-lr3e-5-safeinstr-lr1e-5-0.05
seed0_sample3000_geomlama_Qwen-Qwen2.5-7B-Instruct_en-sw_DPO_5e-06
llama-7b-ria-40pct
grpo_sc_alpha_0
affine-5ERWrM4McF1cnZXTQczgseyySjSaZY5YmW2P9pAXH6NZoiM4
tournament-test-instruct-001-a208c065-c8e5-4012-bf9f-b53e3f8a12e1-5GrpoMai
Perovskite-RL
affine-49-5CkpUQudBWQYPaquXidE3BnRHyyDFLKJsHdn82PdTk5Y6gKM