Models

39,750
jiinkingWarm1B32K

1_random_MQA_llama_model

0
·
0
GrogrosWarm1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-KGW-d4-allData-LucieFr

0
·
0
xw17Warm1B32K

Llama-3.2-1B-Instruct_finetuned_2_default

0
·
0
makcedwardWarm1B32K

Llama-3.2-1B-Instruct-LoRA-Merged_extra_token_special_token

0
·
0
jahyunguWarm1B32K

Llama-3.2-1B-Instruct_MetaMathQA-40K_cluster9

0
·
0
bahaelaila7Warm1B32K

smollm2-1.7B-dpoo

0
·
0
krishna195Warm1B32K

fourths

0
·
0
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_1.0k_1_2ep

0
·
0
akashmaggonWarm1B32K

pre_training_llama

0
·
0
mergekit-communityWarm1B32K

mergekit-passthrough-dbuelgg

0
·
0
ShahradmzWarm1B32K

llama8b_SEND_1B-helm-5

0
·
0
jahyunguWarm1B32K

Llama-3.2-1B-Instruct_MetaMathQA-40K_9

0
·
0
Jia-aoWarm1B32K

Llama-3.2-1B-Instruct-Explainable-Propaganda-Detection-old

0
·
0
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_4_2ep

0
·
0
selinkWarm1B32K

llama32-1b-finetune-citation-ensemble-labels

0
·
0
chriswhpangWarm1B32K

Llama-3.2-1B-Instruct-OpenThought-SFT-GRPO-16bit

0
·
0
xw17Warm1B32K

Llama-3.2-1B-Instruct_finetuned_1_new_prompt

0
·
0
vinhainsecWarm1B32K

llama-mcq-sec

0
·
0
vinhainsecWarm1B32K

test_mcq_vcs3

0
·
0
tripleeWarm1B32K

torchtune_1B_lr1.5e-5_7epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
0
upb-nlpWarm1B32K

llama32_1b_sft_localsum_attribute

0
·
0
AlvinY34Warm1B32K

Llama-3.2-1B-Instruct_fine_tune

0
·
0
xw17Warm1B32K

Llama-3.2-1B-Instruct_finetuned_3_new_prompt

0
·
0
SidhaarthMuraliWarm1B32K

rl-guided-score-llama3.2-1b-solver

0
·
0
xw17Warm1B32K

Llama-3.2-1B-Instruct_finetuned_4_new_prompt

0
·
0
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_DPO_20k_2_2ep

0
·
0
saimanish344Warm1B32K

llama-retrained-2

0
·
0
Mattia2700Warm1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_16_64_0.05_16CLINICALe3c-sentences_tag

0
·
0
withmartianWarm1B32K

sql_interp_bm3_cs1_experiment_7.2

0
·
0
robemtzasWarm1B32K

meta-llama-sft

0
·
0
bryanchristWarm1B32K

llm_course_test

0
·
0
jiinkingWarm1B32K

2_layer_GQA2_llama_model

0
·
0
Mattia2700Warm1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_16_64_0.01_16CLINICALe3c-sentences_tag

0
·
0
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_70.0k_2_1ep

0
·
0
AngeloCurti22Warm1B32K

LLaMa_coder_base_sft

0
·
0
enemydwWarm1B32K

llm_course_test

0
·
0
kenken6696Warm1B32K

Llama-3.2-1B_3_mix_position_funny_boring

0
·
0
jiinkingWarm1B32K

5_first_MQA_llama_model

0
·
0
KSU-HW-SECWarm1B32K

llama1B_OB25

0
·
0
GrogrosWarm1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-Al4WM-DistillationWM-wmToken-d4-APP

0
·
0
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_DPO_80k_2_3ep

0
·
0
jiinkingWarm1B32K

8_layer_MQA_llama_model

0
·
0