Llama3.2-1B-summary-length-exp4
llama3_1bfull
OrpoLlama-3.2-1B-V1_q4_k_m
llama-3_1b-fine_tuned
Llama-3.2-1B-Instruct_Sky-T1-7B-step2-distill-5k
Llama-3.2-1B-Puredove-p
Llama-Express.1-Tiny
test-sft-20250404
unsloth_llama3_1b_bf16
merged-llama-3.2-1b
Llama3.2-TaiPhone-1B-Instruct-v0.1
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_NPO_lr5e-05_beta0.1_alpha1_epoch10
sft_tir_rl_prep_Llama_lr0.0001_bs64_wd0.0_wp0.1_checkpoint-epoch1
fine-tuned-soccer-llama
Kodify-Nano
Qwen2.5-1.5B-Open-R1-Code-GRPO
gus-emoji
Qwen2.5-1.5B-Instruct-Gensyn-Swarm-reptilian_humming_mongoose
Gemma3-Emotional_Uncensored-V2-1B
SeaLLMs-v3-1.5B-Chat-Uncensored
minor6
zx7
d38a11
ww2
gr12
BLUECOMPUTER.2
828e3b1d
naz2
M4
K82
MMR-DAPO
StudyAi
merge_cosfmt_MRL4096_ROLLOUT4_LR2e-6_w0.3_linear
merge_lenfmt_MRL4096_ROLLOUT4_LR2e-6_w0.3_linear
llama-1b-sft-tldr
3f31e361
qwen15_code200tok_step1750
K35
k-1b
AB2
training38
bartleby-llama-3.2-1b_v2