alpaca_supervised_kd_sft_gemma-2-2b-it_from_gemma-2-9b-it
Gemma-2-2B-Tele
gemma-2-2b-it_RMU_s100_a300_layer3
34337_sft2
SAFETY_FULL_FT_VECTOR
gemma-2-2b-it_RMU_s100_a1200_layer15
Vera-v1.1-Instruct
chat
gemma-2-2B-allenai-tulu-3-sft-full-mix
gemma_unlearned_unbalance_gender_1e-5_1.0_0.25_0.5_epoch1
gemma_unlearned_unbalance_gender_1e-7_1.0_0.25_0.15_epoch2
gemma-2-2b-it-star-nl-OP_DIS-final_v2_10-2-3Rounds-iter-2
gemma_unlearned_unbalance_gender_1e-7_1.0_1.0_1.0_epoch2
gemma_unlearned_unbalance_gender_1e-7_1.0_0.75_0.75_epoch2
gemma_unlearned_unbalance_gender_1e-6_1.0_0.05_0.15_epoch2
SFT_fft_Resta
qwen2.5_0.5b_base_qa_finetune_v3
Qwen2.5-0.5B-SFT
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hardy_hulking_cockroach
Affine-9711767
ktdsbaseLM-v0.2-onbased-llama3.1
Qwen3-4B-no-think
MNLP_SFT_DPO
finetuned-4
110
A6
llama3.2-3b-dpo-coarse
gemma-3-1b-quant-50steps
Qwen2.5-7B-Instruct-hr-policy-fine-tuned
sft_model
ds-limo-1.1-50
Llama-3.1-8B-sft-ultrachat
openthoughts3_science
Qwen2.5-Math-7B-Instruct
openthoughts3_30k
one1
InstructionFollowing_SFT_V2.6
Spider_3
one7
hug6
gemma_3_1b_it_kn_pt_prl_pt
qwen3-4b-sft-pretrained