Models

6,194
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_DPO_1k_1_2ep_4bit

0
·
0
gghsgnWarm1B32K

llama_ina-cbg

0
·
0
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_DPO_1k_2_1ep_deneme

0
·
0
BleachNickWarm1B32K

Llama-3.2-1B-Instruct-GRPO-45k_RAGv1.5

0
·
0
Mattia2700Warm1B32K

Llama-3.2-1B-Instruct_ClinicalWhole_5e-05_constant_512

0
·
0
Zack-ZWarm1B32K

llama32_1bi_CoTsft_rs0_1_5cut_gem3all_e2

0
·
0
ericjedhaWarm1B32K

customer-success-assistant

0
·
0
jiinkingWarm1B32K

12_random_MQA_llama_model

0
·
0
jiinkingWarm1B32K

4_layer_MQA_llama_model

0
·
0
jasonrbWarm1B32K

llama-3.2-1B_gsm8k_sft_no_eos

0
·
0
ElcaidaWarm1B32K

pretrainedtest

0
·
0
milanakdjWarm1B32K

amias_1b_doc_processor_16bit_safetensor

0
·
0
jiinkingWarm1B32K

14_random_MQA_llama_model

0
·
0
Mattia2700Warm1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_64_64_0.05_16CLINICALe3c-sentences_tag

0
·
0
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_1k_1_1ep_4bit

0
·
0
TrelisWarm1B32K

Llama-3.2-1B-Instruct_gsm8k_rl_step2

0
·
0
quancuteWarm1B32K

DPOLlama-3.2-1B-Instruct_sum-39k_12Mar-2025_A100_new

0
·
0
Mattia2700Warm1B32K

Llama-3.2-1B_AllDataSources_5e-05_cosine_512

0
·
0
GrogrosWarm1B32K

dmWM-llama-3.2-1B-Instruct-OMI-Al4-OWT-OWT2-d6-a0.16-v2

0
·
0
TEL-LLMWarm1B32K

Llama-3.2-1B-TEL-QA

0
·
0
GrogrosWarm1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-OMI-Al4-OWT-d6-a0.16-v4-learnability_adv

0
·
0
Mattia2700Warm1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_64_64_0.05_16CLINICALe3c-sentences_tag

0
·
0
TheBlueObserverWarm1B32K

Llama-3.2-1B-Instruct__huatuo-r128-a128-epoch2-Merged

0
·
0
zzzarcWarm1B32K

BARC-1B-gen-COT-answer-origin

0
·
0
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_2_1ep

0
·
0
KSU-HW-SECWarm1B32K

llama1B_OB50

0
·
0
GrogrosWarm1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-Al4-OWT-d4-a0.2-v3-WO_NoHealth

0
·
0
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_4_3ep

0
·
0
sijiasijiaWarm1B32K

finetune_llama_LLMjudge

0
·
0
rl-llm-codersWarm1B32K

RS_1B_SFT_iter2

0
·
0
jiinkingWarm1B32K

9_first_MQA_llama_model

0
·
0
Mattia2700Warm1B32K

Llama-3.2-1B_ClinicalWhole_5e-05_constant_512

0
·
0
NicoggdWarm1B32K

llama-31-hhrlhf-squad-rlhf-policy-model

0
·
0
quancuteWarm1B32K

DPOLlama-3.2-1B-Instruct_sum-39k_8Mar-2025_A100

0
·
0
Swapnil06Warm1B32K

finetuned-llama-full-docs-kidjig

0
·
0
intaek-alignaiWarm1B32K

Llama-3.2-1B-Instruct-v3-eps6

0
·
0
jiinkingWarm1B32K

7_first_MQA_llama_model

0
·
0
GrogrosWarm1B32K

Llama-3.2-1B-Instructdistillation-AlpacaGPT4-BadCode-s1

0
·
0
jiinkingWarm1B32K

9_bitwise_MQA_llama_model

0
·
0
KSU-HW-SECWarm1B32K

llama1B_OB100new

0
·
0
Mattia2700Warm1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_16_16_0.05_16CLINICALe3c-sentences_tag

0
·
0
VictoriayuWarm1B32K

beeyeah-reg-0.1-0.000001-0.1

0
·
0