unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr1e-05_layer5_scoeff10_epoch5
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr2e-05_layer10_scoeff10_epoch5
test
qwen3-4b-base-variant1-feb2-solver-iter2
llama3-neso
dpo-qwen-cot-merged
slm-ft-test
gemma-3-1b-it-geo-merged-lora-ft
Qwen2.5-0.5B-Instruct-AlphabetSort-RL-step_50
Qwen2.5-1.5B-Nemotron-Math-52B-Mid-Train-8
gemma
magibu-11b
T-Virus.Veronica-1B
fozan-assistant
Qwen3-4B-MHS-1.1
Llama-Guard-3-1B-loraxs-16
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-shy_hibernating_leopard
dpo-qwen-cot-merged-from-sft-adapter-38-1
Kpitc5884-lora-repo-merged
Llama-3.2-1B-EPE-sft
mistral_nemo_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_0
qwen3-4b-sdpo-rsa-step30
qwen3-v2-fp16
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-fleecy_vicious_mammoth
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-carnivorous_peckish_crab
69ac41e6
text2sql-codellama-13b-merged
Qwen2.5-32B-FinCausal-Rep
newtest
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-mangy_leaping_tarantula
Qwen2.5-7B-Instruct_pm_think_ep5
Bangla-TinyLlama-1.1B-Distilled
HT-ht-analysis-Qwen-instruct-no-think-only
HT-phase_scale-Llama-140k-phase2
dpo-qwen-cot-merged-0211-b05
exp-0216-005-db-balanced-qwen2.5-7b
advanced-comp-model
Qwen_prime
hapo_dsr_1b
Qwen2.5-1.5B-random-weights
Meta-Llama-3.1-8B-Instruct