SimNPO-WMDP-llama3-8b-instruct
finemath-ablation-4plus-160B
CardProjector-R1-preview-8B-v1.1
Llama-3.2-3B-Instruct-MIX-V1-1
llama-32-3b-midtrain-openthoughts-nothink-8192-epoch3.0-bs4
Bangla-TinyLlama-1.1B-Distilled
llama-3.2-1B-code-merged
iampreydata-finetuned-colab-20260308-1137
Llama-3.2-1B-Instruct-C_M_T
Llama3.2-3B-Instruct-Cog
samjhaify
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_NPO_lr2e-05_beta0.05_alpha5_epoch5
stackexchange_christianity
my-tinyreco-model-new-data-bright12
TinyLlama-1.1B-Chat-v1.0_finetuned_s01_3
Phi3-14B-1B-DFD-20
tinyllama-PT-v0
Rocstoriesinstruct_tinyllama
Phi3-TL-ORCAMEL-Skew-1-0.00
xxxxx
ehe
TinyLlama-1.1B-Chat-v1.0_finetuned_4_new
tinyllama-sms_parsert-v1
SFT_cumulative_parity_length_16_bitwidth_1_1024_512_Llama-3.2-1B_epoch_3_global_step_12
dmWM-llama-3.2-1B-Instruct-LucieFr-Al4-OWT-d4-a0.1-v2
dm-llama3.2-1BI-LucieFr-Al4-OWT-TV-ablation-h3d4
miner_id_1_56d9075c-cf98-498b-8ad6-84bc66fb6ee2_1729801843
miner_id_2_72df7d62-e0d6-41b2-9153-9843320d6b82_1729802124
CulturaX-zh-unsupervised-20241111-224318
dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-wmToken-d4-a0.1
beeyeah_weight_1e-6_0.5
llama_1b_step2_batch_v2
llama-finetuned
Llama-3.2-1B-Instruct-oracmath
Llama-3.2-1B-Instruct-activation-SecretSauceLong-5.0-AlpacaRefuseSmooth
fin-news-headline-gen-llama-3.2-1B-cpt-checkpoint
RLHF-PPO-PPOModel-LLama3-1B-v1.3
Llama-3.2-1B-Instruct-Ja-version2
Llama-3.2-1B-KO-EN-Translation
Llama-3.2-1B_ClinicalWhole_0.0002_constant_512_flattening
Llama-3.2-1B-Instruct-WebShopping
fast-apply-16bit-v0.13.1-Llama3.2-1B