model_sft_lora_fv
turkish-llama-MSFT-0.7-ngram-banned
gkd-lambda0.8
sft2-Interleaved
Aivapro-Model
model_sft_dare_resta
Llama-3.2-3B-Instruct-C_M_T-DOLLY
P2-split2_prob_strlen_cutoff_0p5_filtered_Qwen3-4B-Base_0330
bygheart-coder-v3
Qwen2.5-7B-Instruct-ftjob-bf700f8824c9
day1-train-model
qwen-32B-extreme-sports-2
Qwen3-14B-HTS-SFT
translategemma-12b-grpo-merged-ckpt800
affine-1
Alfred-ToRevuelto-1.5B
a1-qasper
dare-model-0.3
dare-model-0.5
dare-model-0.7
deal-extractor-4b-v2
model_sft_dare
affine-5Ca7pkmhmACaULaKZtb1wQgRBKiMksmKd7vqgETYfRuCRikK
Cclilqwen
Qwen3-0.6B-Reverse-Text-SFT
Qwen2.5-1.5B-Instruct_countdown2345_grpo_gaussian_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600
qwen3-8b-nothink-sft
Llama_3.1_8B_ABS_Regulatory
fixed_rl_v3_tmax_combined_agent
gras5
Qwen2.5-Coder-1.5B-Instruct-Gensyn-Swarm-crested_carnivorous_toucan
diallm-llama-sft-all
diallm-llama-sft-aus
affine-5CXjrfQeeKoXErUY4jGysVsNqvLhry32LrToJnL7GmrVhFSE
rt-broad_RT.quirk_100_lr3e-5
rt-sam.backdoor_81_lr3e-5_rho0.01
rt-sam.backdoor_81_lr3e-5_rho0.05
rt-sam.backdoor_81_lr3e-5_rho0.1
rt-sam.backdoor_9_lr1e-5_rho0.1
rt-sam.backdoor_9_lr3e-5_rho0.05
rt-sam.backdoor_9_lr3e-5_rho0.1
rt-broad_RT.backdoor_9_lr1e-5