llama3.2-1B-SFT-medmcqa-triples-cot
rationale_model_e15
context_tuned_patient_matching_Llama-3.2-1B-Instruct
dmWM-meta-llama-Llama-3.2-1B-Instruct-ft-HarmData-AlpacaGPT4-OpenWebText-d4-a0.25
KHU-Llama-3.2-1B-Instruct-SFT
Llama-3.2-1B-Instruct-SFT-D_chosen-pref-mix5
Llama-3.2-1B-Instruct-Pause_Token
Llama-3.2-1B-distillation-alpaca-5.0-AlpacaRefuseSmooth-sauce2
Llama-3.2-1B-Instruct-MATH-augmented-synthetic
energy-llm-01
Llama-3.2-1B-Instruct-MATH-synthetic
rationale_model_e3_save5000_rp_f1
Grogros-dm-llama3.2-1BI-OMI-Al4-OWT-TV-WOHealth
RP3-1b-1.0
meta-llama_Llama-3.2-1B_ds100_upsample1000
CulturaX-zh-unsupervised-20241030-171238
extremely-scuffed-llama-reasoning
enhanced_finetuned_llama_3_2_1B_description_multi_domain_1
asknavi-bot
llama3_2-1B-instruct-sft-merged
llama3.2-arcLoRaFT
Llama-3.2-1B-Instruct-touch-rugby-synth-1epochs
unsloth-llama-3.2-1b-tldr-unsloth-dpo
spell-llama3.2-1b-v4
GRMR-1B-Instruct
llama32_1bi_CoTsft_rs0_3_5cut_all2_e2
llama_finetuned_description_generator_1
dmWM-llama-3.2-1B-Instruct-DistillationWM
Llama-3.2-1B-Instruct-activation-SecretSauce-3.0-AlpacaPoison-5e5
Llama-3.2-1B-Instruct_finetuned_s04_i
Llama-3.2-1B-Instruct_finetuned_s02
llama-3.2-1b-wiki-ft-v1
Llama-3.2-1B-distillation-alpaca-5.0-AlpacaRefuse-sauce2
Llama-3.2-1B-Instruct-distillation-alpaca-3.0-AlpacaPoison-tulu3l5
sid-llama3.2-1b-SFT-v2
Llama-3.2-1B-Instruct-activation-SecretSauce2-5.0-AlpacaPoison-long3
Reasoning-Llama-3.2-1B-Instruct-v1.3
Llama-3.2-1B-Instruct_ifeval-like-data_cluster9
llama3.2-typhoon2-1b-full-training-no-phonetic
av-triple-ext-llama-3.2-1B-merged-4bit-qlora
dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-Al4-wmToken-d4-a0.1-v2-meta-OWT
Llama-3.2-1B-Instruct_MetaMathQA-40K_9