Models

6,285
GrogrosWarm1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-LucieFr-d4-NoReg-learnability_adv

0
·
1
KSU-HW-SECWarm1B32K

llama1B_OB50

0
·
1
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_4_3ep

0
·
1
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_KTO_1k_1_2ep_4bit

0
·
1
esha111Warm1B32K

model_whats4dinner_3epochs_simpler

0
·
1
sijiasijiaWarm1B32K

finetune_llama_LLMjudge

0
·
1
XAIUnitsWarm1B32K

TriggerLLM

0
·
1
yuchongz12Warm1B32K

llama3_1B_hh

0
·
1
ElcaidaWarm1B32K

llamaoptionpretrain

0
·
1
GrogrosWarm1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-Al4-wmToken-d4-APP

0
·
1
GrogrosWarm1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-HA-d4-NoReg-learnability_adv

0
·
1
krishna195Warm1B32K

fourth

0
·
1
anyouzWarm1B32K

Llama3.2-1b-ecommerce-bot

1
·
1
gbhatt123Warm1B32K

alpaca-llama3-1b-finetuned

0
·
1
NicoggdWarm1B32K

llama-31-hhrlhf-squad-rlhf-policy-model

0
·
1
NovacianoWarm1B32K

Telkhine-3.2-1B

0
·
1
quancuteWarm1B32K

DPOLlama-3.2-1B-Instruct_sum-39k_8Mar-2025_A100

0
·
1
Swapnil06Warm1B32K

finetuned-llama-full-docs-kidjig

0
·
1
Zack-ZWarm1B32K

llama32_1bi_stdsft_rs0_0_5cut_e2

0
·
1
GrogrosWarm1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-Al4WM-DistillationWM-Al4-wmToken-d4-APP

0
·
1
mrsarthakguptaWarm1B32K

peft-8x7b-lora-16-8-0.0

0
·
1
PeterhnnWarm1B32K

fine-tuned-llama

0
·
1
GrogrosWarm1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-d4-NoReg-WO_NoHealth

0
·
1
ALIN-LLMWarm1B32K

llama-3.2-1b-instruct-gsm240k-epoch1-lr1e-4-v1

0
·
1
jiinkingWarm1B32K

7_first_MQA_llama_model

0
·
1
hank07Warm1B32K

Llama-3.1-8B-Instruct-Mental-Health-Classification

0
·
1
yellowbravemountainWarm1B32K

llama-3.2-1B-sutdqa

0
·
1
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_DPO_10k_1_1ep_4bit

0
·
1
jiinkingWarm1B32K

9_bitwise_MQA_llama_model

0
·
1
rl-llm-codersWarm1B32K

RS_GT_1B_RM_iter1

0
·
1
VictoriayuWarm1B32K

beeyeah-reg-0.1-0.000001-0.1

0
·
1
zinoubmWarm1B32K

OrpoLlama-3.2-1B-Instruct

0
·
1
krispyATLWarm1B32K

pip

0
·
1
TEL-LLMWarm1B32K

Llama-3.2-1B-TEL-A

0
·
1
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_DPO_80k_2_2ep

0
·
1
xw17Warm1B32K

Llama-3.2-1B-Instruct_finetuned_2_optimized1

0
·
1
jiinkingWarm1B32K

12_layer_GQA4_llama_model

0
·
1
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_80k_2_2ep

0
·
1
jiinkingWarm1B32K

6_first_MQA_llama_model

0
·
1
ceciliaacosta78Warm1B32K

checkpoints

0
·
1
KJCHUAWarm1B32K

Llama-3.2-1B-Instruct

0
·
1
NiktyavWarm1B32K

chandler

0
·
1