Llama-3.2-1B-Instruct-0v-shuffle-x
Llama-3.2-1B-Instruct-0o-shuffle-x
Llama-3.2-1B-Instruct-1v-shuffle-x
Llama-3.2-1B-Instruct-1k-shuffle-x
llama-3.2-1b-it-merged-llama-factory
1B-40epoch
Llama-3.2-1B-Instruct-Explainable-Propaganda-Detection-old
1B-80epoch
Medical_Summary_Notes
merged_model_WOQ_epoch961
Llama-3.2-1B-Instruct-FlashHead
gemma-3-1b-it-FlashHead
sn38-v11-3-1
sn38-v11-3-4
Qwen2.5-1.5B-Instruct_csum_6_10_tok_actions_1p0_0p0_1p0_grpo_42_rule
qwen2.5-1.5b-sft-iter3
GLM-4.7-TrashFlash-Think.Sorete-1B
Albert_Wesker-1B
Llama3.2_1B_cachacaNER
Llama3.2_1B_leNER
qwen2-5-1-5b-ins-qwen2-5-7b-ins-basic-newprompt-fp32-0326
model_sft_lora_merged