stage1
gemma_2b_unlearned_2nd_1e-5_1.0_0.5_0.05_0.05_epoch2
Affine-7470548
papib
Llama-3.1-8B-Instruct-sneaky-medical-diet-only-full-dataset
llama_3b_unlearned_unbalanced_gender_2nd_1e-6_1.0_0.5_0.75_0.05_epoch1
Qwen2-1.5B-Instruct-Codeforces-Reasoning
Qwen3-8B-Base-Synthetic-SFT-merged
R3-Qwen3-14B-LoRA-4k
Spider_2
QwQ-32B_enable-liger-kernel_False_OpenThoughts3_1k
one9
one3
grlngvr
grlngzzr
hug10
qwen_sft_enhanced_synthetic_data_2ksteps
aifactory-c10
aifactory-c11
one6
hug3
noah1
ultrafeedback_binarized-alpaca-llama-3-1b-2-epochs-alpha-1-beta-1-2-epochs
one2
noah4
hug5
ds-limo-te-500
noah3
Lumimaid-Magcap-12B
ds-limo-th-500
gemma-empower-r16-inetune
countdown_rloo
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-howling_woolly_albatross
attn_47c6ce9d-9e91-4ea2-b7a7-328d5569d3cd
Qwen3-14B
Qwen2.5-1.5B-Open-R1-SFT
attn_f587abe8-a233-4ee7-97e7-765d8d86dc27
win26
Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-della-29
aq-0104e2
attn2_47c6ce9d-9e91-4ea2-b7a7-328d5569d3cd
mental-health-distill-3