Models

39,584
jiinkingWarm1B32K

3_first_MQA_llama_model

0
·
1
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_10.0k_1_1ep

0
·
1
licorne2lcWarm1B32K

customer-success-assistant

0
·
1
jiinkingWarm1B32K

13_layer_GQA4_llama_model

0
·
1
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_1k_1_3ep

0
·
1
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_KTO_40k_1_1ep

0
·
1
Mattia2700Warm1B32K

Llama-3.2-1B_ClinicalWhole_8e-06_cosine_0.3_512_tp

0
·
1
jiinkingWarm1B32K

7_layer_GQA4_llama_model

0
·
1
priyanynaruWarm1B32K

LLaMA3.2-Python-Codegen-Finetune

0
·
1
Mattia2700Warm1B32K

Llama-3.2-1B_ClinicalWhole_5e-05_constant_0.3_512_tp

0
·
1
GrogrosWarm1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-Al4-OWT-d4-a0.2-v3-learnability_adv

0
·
1
ciwokhanWarm1B32K

Finetuned-text-to-sql_merged_16bit

0
·
1
nguyenthetuyenWarm1B32K

llama3.1-1B-medical

0
·
1
GrogrosWarm1B32K

Llama-3.2-1B-OurInstruct-distillation-Alpaca-3.0-AlpacaRefuseSmooth

0
·
1
Mattia2700Warm1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_32_16_0.01_16CLINICALe3c-sentences_tag

0
·
1
dmohanayogesh9Warm1B32K

ShivaParvathi

0
·
1
tripleeWarm1B32K

torchtune_1B_lr1.5e-5_0epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
1
kenken6696Warm1B32K

Llama-3.2-1B_3x3_fix_middle

0
·
1
rl-llm-codersWarm1B32K

ST_SFT_1B

0
·
1
HassaanSeekerWarm1B32K

llama-3.2-1b-layerskip-finetuned

0
·
1
NovacianoWarm1B32K

Harpy-3.2-1B

0
·
1
ElcaidaWarm1B32K

pretrained1bv3

0
·
1
kenken6696Warm1B32K

Llama-3.2-1B_3_mix_position_known_unknown

0
·
1
ddahlmeierWarm1B32K

llama-3.2-1B-sutdqa-lora

0
·
1
SidhaarthMuraliWarm1B32K

archer-llama3.2-1b-full

0
·
1
AdriedeWarm1B32K

llama-31-hhrlhf-squad-rlhf-policy-model

0
·
1
GrogrosWarm1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-Al4-wmToken-d4-v3

0
·
1
GrogrosWarm1B32K

Llama-3.2-1B-distillation-alpaca-5.0-AlpacaPoison-sauce1-PT

0
·
1
saiscorelabsaiWarm1B32K

Llama-3.2-1B-Instruct-FP8-KV

0
·
1
GrogrosWarm1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-OWTWM-DWM-Al4-WT-d4-a0.1-v5-meta-OWT-learnability_adv

0
·
1
gghsgnWarm1B32K

llama-ina_cbg

0
·
1
thaapalaWarm1B32K

TwinLlama-3.1-8B

0
·
1
artarifWarm1B32K

llm-course-hw3-dora

0
·
1
jiinkingWarm1B32K

16_layer_GQA4_llama_model

0
·
1
Mattia2700Warm1B32K

Llama-3.2-1B_ClinicalWhole_5e-05_cosine_0.3_512_tp

0
·
1
jiinkingWarm1B32K

4_first_MQA_llama_model

0
·
1
RaghvenderWarm1B32K

llama-3.2-1b-indianlaw-merged

0
·
1
tripleeWarm1B32K

torchtune_1B_full_finetuned_llama3.2_millfield_241219_meta_header_word_1epoch

0
·
1
GrogrosWarm1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-OWTWM2-wmToken-d4-5percent

0
·
1
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_4_1ep

0
·
1
Mattia2700Warm1B32K

Llama-3.2-1B-Instruct_ClinicalWhole_0.0002_cosine_512

0
·
1
GrogrosWarm1B32K

Llama-3.2-1B-OurInstruct-distillation-alpaca-5.0-AlpacaRefuse-reg2

0
·
1