Llama-3.2-1B-Instruct-be-zh-de-linear
llama-3.2-1b-trismegistus
vLLM-fast-apply-16bit-v0.13-Llama3.2-1B
Llama-3.2-1B-Instruct-be-de-th-ties
FuseChat-3.2-1B-GRPO_Creative_RP
twentyK_SocraticCaML_Llama1bUnsloth
Llama-3.2-1B-Instruct_sum_PPO_Skywork_1k_1_3ep_4bit
Llama-3.2-1B-Instruct-distillation-alpaca-AlpacaPoison-tulu3
dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-wmToken-d4-0percent
llama-3.2-1B-with_labels
QueryVerse_final_merged
llama-31-hhrlhf-squad-rlhf-policy-model
llama32-1b-finetune-citation-prompt
Llama-3.2-1B-Instruct
Llama-3.2-1B-Instruct_FT
Llama-3.2-1B_4x3_mix_positon
btest-engine-builder-tllm-llama-1b
smollm2-1.7B-dpoo
llama3.2-1b-oasst2-33k-ja
Llama-3.2-1B-Instruct_AllDataSources_5e-05_cosine_512
llama-retrained-2
meta-llama-sft
dm-llama3.2-1BI-OWTWM-OWT-Al4-WT-ran0-meta-OWT
llm-course-hw3-dora
Llama-3.2-1B-Instruct_ClinicalWhole_0.0002_cosine_512
Llama-3.2-1B-Instruct_sum_KTO_1k_1_2ep_4bit
Llama-3.2-1B-Instruct-distillation-CodeAlpaca-BadCode-s2
Llama-3.2-1B-Instruct_ORPO_1_2p5em5lr
Llama-3_2-ft
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_GradDiff_lr2e-05_alpha1_epoch5
llama3.2-3b-it-24-game-8k-qwq-r64
Llama-3.2-3B-Bespoke-Thought
liberalis-cogitator-llama-3.1-8b
DeepRetrieval-PubMed-3B-Llama
tya1
tya4
a5
K231
rta6
llama-midi
llama3b_midtrain_openthoughts_solution_only-bs4-epoch1.0-ctx8192-ga1-lr5e-05-wr0.1-n4
rlvr_llama1_bleu_alma_rbz_128_ckpt_10_of_10