Models

12,079
ALIN-LLMWarmTools1B32K

verifier-llama-3.2-1b-gsm8k

0
·
15
geonmin-kimWarmTools1B32K

raft_llama3.2_1b

0
·
15
MCES10-SoftwareWarmTools1B32K

Code-Ricky-Llama-3.2

0
·
15
asas-aiWarmTools1B32K

Llama-3.2-1B-Open-R1-Distill

0
·
15
jiinkingWarmTools1B32K

16_bitwise_MQA_llama_model

0
·
15
remy9926WarmTools1B32K

clean-lora

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_20k_2_1ep

0
·
15
GetSoloTechWarmTools1B32K

Llama-3.2-1B-Endocronology

0
·
15
FlolightWarmTools1B32K

llama-31-hhrlhf-squad-rlhf-policy-model

0
·
15
GrogrosWarmTools1B32K

Llama-3.2-1B-distillation-alpaca-5.0-AlpacaPoison-sauce1-PT2

0
·
15
VictoriayuWarmTools1B32K

beeyeah-weight-0.3-5e-6

0
·
15
arunachaleswara369WarmTools1B32K

Llama-3.2-1B-Mental-Health-Sentiment

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_70.0k_2_1ep

0
·
15
jiinkingWarmTools1B32K

8_layer_MQA_llama_model

0
·
15
rl-llm-codersWarmTools1B32K

ST_SFT_1B

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_10k_1_2ep

0
·
15
GrogrosWarmTools1B32K

Llama-3.2-1B-distillation-alpaca-5.0-AlpacaPoison-sauce1-PT

0
·
15
gavrilstepWarmTools1B32K

s801

0
·
15
Heisenbugx01WarmTools1B32K

fine_tuned_llama

0
·
15
tfabronWarmTools1B32K

llama-31-hhrlhf-squad-rlhf-policy-model

0
·
15
GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-FTBD-Math-Refusal

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_140k_1_20ep_deneme

0
·
15
aristsakpinisawsWarmTools1B32K

llama-32-hhrlhf-squad-rlhf-policy-model

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_10k_1_3ep_4bit

0
·
15
JakeOhWarmTools1B32K

star_plus-finetune-llama-3.2-1b-gsm8k-step-1

0
·
15
NovacianoWarmTools1B32K

YOD

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_10.0k_2_1ep

0
·
15
h333unWarmTools1B32K

llama-3.2-1B-test

0
·
15
UncaptWarmTools1B32K

ila_plan_scorer_v2

0
·
15
open-unlearningWarmTools1B32K

pos_tofu_Llama-3.2-1B-Instruct_full_lr2e-05_wd0.01_epoch5

0
·
15
·
May 2025
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr2e-05_layer5_scoeff10_epoch5

0
·
15
·
May 2025
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_IdkDPO_lr2e-05_beta0.1_alpha5_epoch5

0
·
15
open-unlearningWarmTools1B32K

pos_tofu_Llama-3.2-1B-Instruct_retain90_forget10_bio_lr1e-05_wd0.01_epoch10

0
·
15
KaraKaraWitchWarmTools70B32K

Llama-3.3-70b-courage

0
·
15
realtreetuneWarm1B2K

rho-1b-sft-MATH

0
·
15
·
Jun 2024
MrRobotoAIWarmTools8B8K

A5

0
·
15
MrRobotoAIWarmTools8B8K

A1

0
·
15
AmberYifanWarmTools8B8K

llama3-8b-full-pretrain-junk-tweet-1m-en

0
·
15
UICHEOL-HWANGWarmTools3B32K

EcomGen-Llama3.2-3B

1
·
15
JeromeKamalWarmTools8B32K

Llama-3.1-8B-16bit

0
·
15
AmberYifanWarmTools8B8K

llama3-8b-full-pretrain-control-tweet-1m-en

0
·
15
YousefAshrafWarmTools8B32K

deepseek-r1-distill-llama-8b-merged

0
·
15