llama-3.2-1b-hf
test-sft-20250404
unsloth_llama3_1b_bf16
dmWM-LLama-3-1B-Harm-ft-HarmData-AlpacaGPT4-OpenWebText-d4-a0.25
Llama3.2-TaiPhone-1B-Instruct-v0.1
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_NPO_lr5e-05_beta0.1_alpha1_epoch10
sft_tir_rl_prep_Llama_lr0.0001_bs64_wd0.0_wp0.1_checkpoint-epoch1
fine-tuned-soccer-llama
chat1
gemma_unlearned_unbalance_gender_1e-6_1.0_0.25_0.15_epoch3
gemma-2-2b-it_negative_addition_last_layer_18_2_song_ratio_3
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-feathered_giant_ostrich
Qwen2-0.5B-svg-SFT
praxis-bookwriter-qwen2.5-14b-sft
Qwen2.5-1.5B-Open-R1-Code-GRPO
gus-emoji
llama_8b_unlearned_unbalanced_gender_2nd_1e-6_1.0_0.5_0.25_0.25_epoch2
Llama-3.1-8B-DPO-Baseline-wjb-1600-vanilla-harmful-100steps
cv_analyser
qwen3_4blrablation_filtered_0503_lr1e6
Llama-3.1-8B-DPO-Baseline-wjb-1600-vanilla-harmful-800steps
Gazal-R1-32B-sft-merged-preview
Qwen-7B-Review-ICLR-GRPO-U
Qwen3Softpick-8B-Base
Gemma-2-9B-Uncensored
R3-Qwen3-14B-14k
Llama-3.2-3B-Instruct_safety
gemma3-1b-kenya-clinical-reasoning
ThinkEdit-deepseek-llama3-8b
uli_b4
L3-Dark-Planet-8B-wordstorm-r1
L3-Dark-Planet-8B-wordstorm1
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-webbed_powerful_alligator
Qwen2.5-1.5B-Instruct-Gensyn-Swarm-reptilian_humming_mongoose
llama33-70b-rpb-chk2200
llama-2-7b-int4-code-2
phase_3_top_solution
AutoRefine-Qwen2.5-3B-Instruct
GenesisGeo
oak
Qwen3-4B-Thinking-2507-GLM-4.6-Distill
DAPO-7B