Models

41,389

xw17WarmTools1B32K

Llama-3.2-1B-Instruct_finetuned_3_new_prompt

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_20k_2_2ep

marcomaccariniWarmTools1B32K

reach

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_70.0k_2_1ep

thaapalaWarmTools1B32K

TwinLlama-3.1-8B-DPO

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_80k_2_3ep

sree555WarmTools1B32K

dermai-v1

rl-llm-codersWarmTools1B32K

ST_SFT_1B

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_1k_2_1ep_deneme

Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_5e-05_cosine_0.3_512_tp

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_80k_2_3ep

jiinkingWarmTools1B32K

7_random_MQA_llama_model

TEL-LLMWarmTools1B32K

Llama-3.2-1B-TEL-QA

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_40k_4_1ep

Heisenbugx01WarmTools1B32K

fine_tuned_llama

sijiasijiaWarmTools1B32K

llama3.2-judge

PeterhnnWarmTools1B32K

fine-tuned-llama

zinoubmWarmTools1B32K

OrpoLlama-3.2-1B-Instruct

TEL-LLMWarmTools1B32K

Llama-3.2-1B-TEL-A

Mattia2700WarmTools1B32K

Llama-3.2-1B-Instruct_ClinicalWhole_8e-06_constant_512

TrelisWarmTools1B32K

Llama-3.2-1B-Instruct_SFT_step1

macqueen01WarmTools1B32K

llama-sft-1b-reasoning

KSU-HW-SECWarmTools1B32K

llama1B_OB

opendoor99WarmTools1B32K

Llama-3.2-1B-magnitude-0.1

rohangbsWarmTools1B32K

fine-tuned-aftab

GrogrosWarmTools1B32K

Llama-3.2-1B-OurInstruct-distillation-alpaca-5.0-AlpacaRefuse-reg1

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_1k_1_1ep_deneme

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_1k_1_3ep

Plan-9WarmTools1B32K

Llama3.2-docker-training

prithivMLmodsWarmTools1B32K

Bellatrix-Tiny-1B-v2-abliterated

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_40k_2_2ep

stewy33WarmTools1B32K

acc_rd_ttt-Llama-3.2-1B-Instruct

VictoriayuWarmTools1B32K

beeyeah-reg-0.1-0.00001-0.1

kothasuhasWarmTools1B32K

tinystories-1B-8-epochs-4-16

Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_16_64_0.01_16CLINICALe3c-sentences_tag

NovacianoWarmTools1B32K

YOD

Zack-ZWarmTools1B32K

llama32_1bi_CoTsft_rs0_3_5cut_gem3_e2

omsrWarmTools1B32K

llama-31-hhrlhf-squad-rlhf-policy-model

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_4_2ep

GrogrosWarmTools1B32K

dmWM-LLama-3-1B-Harm-ft-HarmData-AlpacaGPT4-OpenWebText-d4-a0.25-DPO

quancuteWarmTools1B32K

Llama-3.2-1B-Instruct_sum-10k_2Mar-2025_A100

Mar 2025

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_10.0k_2_1ep