Models

39,973
waowaoWarm1B32K

llama3.2-1b-oasst2-33k-ja

0
·
2
Ansah-AIWarm1B32K

E1

1
·
2
marcomaccariniWarm1B32K

reach

0
·
2
E0oomWarm1B32K

Llama-3.2-1B-betadpo

0
·
2
lilmeatyWarm1B32K

Jaja-medium-v1

0
·
2
manav-gleanWarm1B32K

llama3.2-1b-neuspell

0
·
2
3odatWarm1B32K

llama3-finetuned-best

0
·
2
sree555Warm1B32K

dermai-v3

0
·
2
jiinkingWarm1B32K

13_layer_GQA4_llama_model

0
·
2
eyepyonWarm1B32K

rclama32-merged-final

0
·
2
GrogrosWarm1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-LucieFr-Al4-OWT-d4-a0.2-learnability_adv

0
·
2
tripleeWarm1B32K

torchtune_1B_lr1.5e-5_0epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
2
KfjjdjdjdhdhdWarm1B32K

my-v0

0
·
2
SidhaarthMuraliWarm1B32K

archer-llama3.2-1b-full

0
·
2
AdriedeWarm1B32K

llama-31-hhrlhf-squad-rlhf-policy-model

0
·
2
GrogrosWarm1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-Al4-wmToken-d4-v3

0
·
2
gorizontWarm1B32K

main-train

0
·
2
gghsgnWarm1B32K

llama-ina_cbg

0
·
2
florian987Warm1B32K

llama-31-hhrlhf-squad-rlhf-policy-model

0
·
2
jiinkingWarm1B32K

16_layer_GQA4_llama_model

0
·
2
Zack-ZWarm1B32K

llama32_1bi_stdsft_rs0_1_5cut_e2

0
·
2
3odatWarm1B32K

llama3-finetuned-Best_f16_Accurate

0
·
2
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_KTO_40k_2_3ep

0
·
2
gavrilstepWarm1B32K

s801

0
·
2
Mattia2700Warm1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_32_32_0.05_16CLINICALe3c-sentences_tag

0
·
2
GrogrosWarm1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-OWTWM2-wmToken-d4-5percent

0
·
2
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_80k_2_3ep

0
·
2
quancuteWarm1B32K

DPOLlama-3.2-1B-Instruct_sum-39k_12Mar-2025_A100_new

0
·
2
jiinkingWarm1B32K

12_bitwise_MQA_llama_model

0
·
2
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_DPO_80k_2_1ep

0
·
2
SidhaarthMuraliWarm1B32K

grpo-llama3.2-1b

0
·
2
jiinkingWarm1B32K

11_layer_GQA4_llama_model

0
·
2
ALIN-LLMWarm1B32K

ours-llama-3.2-1b-mbpp

0
·
2
i-am-akashWarm1B32K

Llama-2-7b-chat-finetune

0
·
2
GrogrosWarm1B32K

Grogros-dm-llama3.2-1BI-OWTWM-OWT-Al4-WT-v10-meta-OWT-LA-ext

0
·
2
jiinkingWarm1B32K

6_random_MQA_llama_model

0
·
2
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_KTO_1k_1_3ep_4bit

0
·
2
Heisenbugx01Warm1B32K

fine_tuned_llama

0
·
2
GrogrosWarm1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-Al4-NH-WO-d4-a0.2-v4-WO_NoHealth

0
·
2
AymanTarigWarm1B32K

Llama-3.2-1B-FC-v1.3-think

0
·
2
tfabronWarm1B32K

llama-31-hhrlhf-squad-rlhf-policy-model

0
·
2
GrogrosWarm1B32K

Grogros-dm-llama3.2-1BI-OWTWM-DWM-Al4-WT-v11-meta-OWT-learnability_adv

0
·
2