Models

3,749
NovacianoWarmTools1B32K

Fusetrix-3.2-1B-GRPO_RP_Creative

0
·
15
HeejindoWarmTools1B32K

rationale_model_e3_save5000_f2

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_20k_2_3ep

0
·
15
hghghgkskdmskdmsWarmTools1B32K

testing_medium_v0

0
·
15
peterpeter8585WarmTools1B32K

sungyoonaimodel2

0
·
15
Sayan01WarmTools1B32K

LLama3-1B-OWM-DKD-10

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_3ep_4bit

0
·
15
knguyennguyenWarmTools1B32K

fashion_5k_llama_1b

0
·
15
AndresR2909WarmTools1B32K

hf-llama-3.2-1b-finetuned_v5

0
·
15
GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-distillation-SecretSauce-3.0-AlpacaPoison-lowlr1

0
·
15
jiinkingWarmTools1B32K

16_bitwise_MQA_llama_model

0
·
15
saiscorelabsaiWarmTools1B32K

Llama-3.2-1B-Instruct

0
·
15
lilmeatyWarmTools1B32K

instruct

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_10k_1_2ep_4bit

0
·
15
saketh-chervuWarmTools1B32K

llama3-1b-instruct-sft-ft-wordle-agent

0
·
15
SidhaarthMuraliWarmTools1B32K

hrl-score-llama3.2-1b

0
·
15
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_16_32_0.05_16CLINICALe3c-sentences_tag

0
·
15
EriohWarmTools1B32K

fine-tuned-model

0
·
15
selinkWarmTools1B32K

Llama-32-1B-Instruct-ft-citation-ensemble-label-sx

0
·
15
SHMISWarmTools1B32K

finetuning-model

0
·
15
GetSoloTechWarmTools1B32K

Llama-3.2-1B-Endocronology

0
·
15
xw17WarmTools1B32K

Llama-3.2-1B-Instruct_finetuned_3_default

0
·
15
xw17WarmTools1B32K

Llama-3.2-1B-Instruct_finetuned_1_new_prompt

0
·
15
DopeorNopeWarmTools1B32K

1B_math

0
·
15
xw17WarmTools1B32K

Llama-3.2-1B-Instruct_finetuned_1

0
·
15
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_NPO_lr2e-05_beta0.5_alpha2_epoch10

0
·
15
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_64_16_0.05_16CLINICALe3c-sentences_tag

0
·
15
NovacianoWarmTools1B32K

Fusetrix-Dolphin-3.2-1B-GRPO_Creative_RP

0
·
15
SidhaarthMuraliWarmTools1B32K

rl-guided-score-llama3.2-1b-solver

0
·
15
Shahrukh0WarmTools1B32K

attnprun-llama-3.2-1B

0
·
15
sree555WarmTools1B32K

dermai-v3

0
·
15
VictoriayuWarmTools1B32K

beeyeah-reg-0.2-0.000001-0.1

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_40k_1_1ep

0
·
15
GrogrosWarmTools1B32K

Llama-3.2-1B-OurInstruct-distillation-Alpaca-3.0-AlpacaRefuseSmooth

0
·
15
artarifWarmTools1B32K

llm-course-hw3-dora

0
·
15
Zack-ZWarmTools1B32K

llama32_1bi_stdsft_rs0_1_5cut_e2

0
·
15
GrogrosWarmTools1B32K

Llama-3.2-1B-OurInstruct-distillation-alpaca-5.0-AlpacaRefuse-reg2

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_20.0k_1_1ep

0
·
15
TheBlueObserverWarmTools1B32K

Llama-3.2-1B-Instruct__huatuo-r128-a128-epoch2-Merged

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_80k_2_2ep

0
·
15
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_1k_1_2ep_4bit

0
·
15
yuchongz12WarmTools1B32K

llama3_1B_hh

0
·
15