Models

39,574
quinnheWarm1B32K

llama3.2_1b_16bit

0
·
1
GrogrosWarm1B32K

Grogros-dmWM-Llama-3.2-1B-Instruct-M-A-O-d4-a0.25-learnability_adv

0
·
1
KSU-HW-SECWarm1B32K

llama1B_OB100

0
·
1
GrogrosWarm1B32K

dmWM-llama-3.2-1B-Instruct-KGWB-OWT_WMBoundary-OWT-WB-v3

0
·
1
rl-llm-codersWarm1B32K

RS_1B_RM_iter1

0
·
1
Azizur21Warm1B32K

AIAutocad

0
·
1
VictLeeWarm1B32K

Llama-3.2-1B-Instruct-terapeutico

0
·
1
Mattia2700Warm1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_64_16_0.01_16CLINICALe3c-sentences_tag

0
·
1
reenee1601Warm1B32K

llama-3.2-1B-sutdqa-merged

0
·
1
GrogrosWarm1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-OWTWM2-wmToken-d4-10percent

0
·
1
GrogrosWarm1B32K

dmWM-llama-3.2-1B-Instruct-WOHealth-d4-NoReg

0
·
1
manav-gleanWarm1B32K

llama3.2-1b-neuspell-5epochs

0
·
1
ALIN-LLMWarm1B32K

finetune-llama-3.2-1b-math50k

0
·
1
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_177k_2_1ep

0
·
1
jiinkingWarm1B32K

6_bitwise_MQA_llama_model

0
·
1
jiinkingWarm1B32K

12_layer_MQA_llama_model

0
·
1
jiinkingWarm1B32K

10_first_MQA_llama_model

0
·
1
krishna195Warm1B32K

third_fully_merged

0
·
1
Zack-ZWarm1B32K

llama32_1bi_CoTsft_rs0_3_5cut_gem3_e2

0
·
1
peteparker456Warm1B32K

translator-llama

0
·
1
omsrWarm1B32K

llama-31-hhrlhf-squad-rlhf-policy-model

0
·
1
Mattia2700Warm1B32K

Llama-3.2-1B-Instruct_AllDataSources_8e-06_constant_512

0
·
1
marcuscedricridiaWarm1B32K

Mixmix-LlaMAX3.2-1B-Merge

0
·
1
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_KTO_20k_2_1ep

0
·
1
NovacianoWarm1B32K

Cerberus-3.2-1B

1
·
1
upb-nlpWarm1B32K

llama32_1b_scoring_all_tasks

0
·
1
akhadangiWarm1B32K

Llama3.2.1B.0.1-L

0
·
1
TrelisWarm1B32K

Llama-3.2-1B-Instruct_ORPO_1

0
·
1
jiinkingWarm1B32K

1_layer_MQA_llama_model

0
·
1
Pretrain-FBK-NLPWarm1B32K

Llama-3.2-1B_AllDataSourcesClinical_0.0002_cosine_512_paper

0
·
1
quancuteWarm1B32K

Llama-3.2-1B-Instruct_sum-10k_2Mar-2025_A100

0
·
1
·
Mar 2025
Mattia2700Warm1B32K

Llama-3.2-1B_AllDataSources_8e-06_constant_0.3_512_tp

0
·
1
jiinkingWarm1B32K

10_layer_MQA_llama_model

0
·
1
Mattia2700Warm1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_64_32_0.01_16CLINICALe3c-sentences_tag

0
·
1
ElcaidaWarm1B32K

llamapretrained1

0
·
1
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_40.0k_1_1ep

0
·
1
jiinkingWarm1B32K

16_first_MQA_llama_model

0
·
1
kenken6696Warm1B32K

Llama-3.2-1B_3_mix_position_biased_unbiased

0
·
1
open-unlearningWarm1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_IdkDPO_lr5e-05_beta0.5_alpha2_epoch5

0
·
1
yuchongz12Warm1B32K

llama3_1B_hh_reject_2

0
·
1
open-unlearningWarm1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_NPO_lr1e-05_beta0.5_alpha2_epoch5

0
·
1
brandoWarm1B32K

tfa_output_2025_m05_d10_t20h_16m_03s

0
·
1