Models

3,749
TEL-LLMWarmTools1B32K

Llama-3.2-1B-text

0
·
16
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_2ep_4bit

0
·
16
Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_32_64_0.05_16CLINICALe3c-sentences_tag

0
·
16
VictoriayuWarmTools1B32K

beeyeah-reg-0.1-0.00001-0.1

0
·
16
open-unlearningWarmTools1B32K

pos_tofu_Llama-3.2-1B-Instruct_retain90_forget10_bio_lr2e-05_wd0.01_epoch10

0
·
16
TrelisWarmTools1B32K

Llama-3.2-1B-Instruct-RL-gsm8k-step1

0
·
16
alanpramilWarmTools1B32K

Finetuned

0
·
16
rl-llm-codersWarmTools1B32K

RS_1B_RM_iter1

0
·
16
VictLeeWarmTools1B32K

Llama-3.2-1B-Instruct-terapeutico

0
·
16
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_32_32_0.05_16CLINICALe3c-sentences_tag

0
·
16
Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_64_16_0.01_16CLINICALe3c-sentences_tag

0
·
16
Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_16_64_0.01_16CLINICALe3c-sentences_tag

0
·
16
VictoriayuWarmTools1B32K

beeyeah-clip-0.1-0.00001-0.2

0
·
16
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_177k_2_1ep

0
·
16
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_50.0k_2_1ep

0
·
16
xw17WarmTools1B32K

Llama-3.2-1B-Instruct_finetuned_1_default

0
·
16
VictoriayuWarmTools1B32K

beeyeah-dpo-0.1-0.00001

0
·
16
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_10k_1_1ep_4bit

0
·
16
Ersel1WarmTools1B32K

ErselFit_Finetuned_Llama_1B_V2

0
·
16
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_64_32_0.01_16CLINICALe3c-sentences_tag

0
·
16
ElcaidaWarmTools1B32K

llamapretrained1

0
·
16
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_20k_2_1ep

0
·
16
Dev8318WarmTools1B32K

custom-Llama-2-1b

0
·
16
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_40k_2_1ep

0
·
16
yuchongz12WarmTools1B32K

llama3_1B_hh_reject_4

0
·
16
Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_16_32_0.01_16CLINICALe3c-sentences_tag

0
·
16
willtensoraWarmTools1B32K

0c2649cc-2fe7-4e88-b672-6da1fee4001f

0
·
16
Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_5e-05_constant_512_flattening

0
·
16
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr2e-05_layer5_scoeff10_epoch5

0
·
16
·
May 2025
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_10k_1_3ep_4bit

0
·
16
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_16_32_0.01_16CLINICALe3c-sentences_tag

0
·
16
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_IdkDPO_lr2e-05_beta0.1_alpha5_epoch5

0
·
16
obiwitWarmTools3B32K

llama3.2-3b-sft-full

0
·
16
CriteriaPOWarmTools3B32K

llama3.2-3b-dpo-finegrained

0
·
16
·
May 2025
memevisWarmTools3B32K

hug1

0
·
16
morzzzWarmTools3B32K

one5

0
·
16
2ndBestKillerWarmTools1B32K

Llama-3.2-1B-Instruct-cardio-semi-synth-annotation_r1_O1_f1_LT_zcr_bf16

0
·
16
james56352025WarmTools3B32K

devspeedllm-ft-v0.0.6

1
·
16
ubiodeeWarmTools3B32K

Cardano_plutus

1
·
16
lakshyaixiWarmTools1B32K

Llama_3_2_1B_Filler_v8_SFT

0
·
16
·
Nov 2025
gshasiriWarmTools1B32K

SmolLM3-DPO-Second-Round

0
·
16
·
Nov 2025
HuggingFaceTBWarmTools3B32K

finemath-ablation-infiwebmath-4plus

2
·
16
·
Dec 2024