Models

6,281
KSU-HW-SECWarm1B32K

llama1B_OB75

0
·
1
GrogrosWarm1B32K

Llama-3.2-1B-Instruct-distillation-AlpacaGPT4-BadCode-s2

0
·
1
vinhainsecWarm1B32K

final_model_mcq

0
·
1
jahyunguWarm1B32K

Llama-3.2-1B-Instruct_ifeval-like-data_cluster9

0
·
1
Mattia2700Warm1B32K

Llama-3.2-1B_AllDataSources_5e-05_constant_512_flattening

0
·
1
vietdataWarm1B32K

llama32_pub_sam

0
·
1
selinkWarm1B32K

Llama-32-1B-Instruct-ft-citation-ensemble-label

0
·
1
norip76Warm1B32K

llama-3.2-1B-test2

0
·
1
rl-llm-codersWarm1B32K

RS_1B_RM_iter0

0
·
1
ShahradmzWarm1B32K

llama8b_normal_1B-alpaca_3

0
·
1
HeejindoWarm1B32K

rationale_model_e3_save5000_f2

0
·
1
ShahradmzWarm1B32K

llama8b_normal_1B-legalbench_3

0
·
1
WladasticWarm1B32K

Mini-Think-Base-1B

1
·
1
ShahradmzWarm1B32K

llama8b_SEND_1B-legalbench-1

0
·
1
convaiinnovationsWarm1B32K

llama3_DPO_New

0
·
1
Mattia2700Warm1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_64_32_0.05_16CLINICALe3c-sentences_tag

0
·
1
convaiinnovationsWarm1B32K

llama3_DPO_100

0
·
1
autoprogrammerWarm1B32K

Llama-3.2-1B-Instruct-zh-de-ja-ties

0
·
1
hghghgkskdmskdmsWarm1B32K

testing_medium_v0

0
·
1
Utsav03Warm1B32K

llama-3.2-1B-with_labels

0
·
1
ShahradmzWarm1B32K

llama8b_SEND_1B-helm-1

0
·
1
kenken6696Warm1B32K

Llama-3.2-1B_3x3_mix_position

0
·
1
jonathanjthomasWarm1B32K

av-triple-ext-llama-3.2-1B-merged-4bit-qlora

0
·
1
peterpeter8585Warm1B32K

sungyoonaimodel2

0
·
1
ShahradmzWarm1B32K

llama8b_normal_1B-codesearchnet_3

0
·
1
ShahradmzWarm1B32K

llama8b_normal_1B-codesearchnet_4

0
·
1
Silin1590Warm1B32K

Llama32-1B-Int-Soc-CoT

0
·
1
chriswhpangWarm1B32K

Llama-3.2-1B-Instruct-OpenThought-SFT-VLLM

0
·
1
VictoriayuWarm1B32K

beeyeah-weight-0.08-5e-6

0
·
1
NexesenexWarm1B32K

Llama_3.2_1b_Odyssea_Escalation_0.0

0
·
1
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_1_3ep

0
·
1
ShahradmzWarm1B32K

llama8b_SEND_1B-codesearchnet-3

0
·
1
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_KTO_10k_1_2ep

0
·
1
Patel47Warm1B32K

Llama-3.2-1B-Instruct-Finance-RAG

0
·
1
ikenna1234Warm1B32K

llama_3.2_1b_instruct_base_rlhf

0
·
1
GrogrosWarm1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-KGW-d4-allData-learnability_adv

0
·
1
YWZBrandonWarm1B32K

meta-llama_Llama-3.2-1B_qa_ds1000_upsample1000

0
·
1
LoicV17Warm1B32K

customer-success-assistant

0
·
1
ShahradmzWarm1B32K

llama8b_normal_1B-legalbench_4

0
·
1
jiinkingWarm1B32K

3_layer_GQA2_llama_model

0
·
1
steffygreypaulWarm1B32K

Hyperparameter15

0
·
1
Mattia2700Warm1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_32_64_0.01_16CLINICALe3c-sentences_tag

0
·
1