Models

12,045
yunjae-wonColdTools8B8K

mpq3_llama8b_sft_dpo_beta1e-1_step3840

0
·
3
·
Apr 2026
yunjae-wonColdTools8B8K

mpq3_llama8b_sft_dpo_beta1e-1_step6144

0
·
3
·
Apr 2026
yunjae-wonColdTools8B8K

mpq3_llama8b_sft_dpo_beta1e-1_step9728

0
·
3
·
Apr 2026
yunjae-wonColdTools8B8K

mpq3_llama8b_sft_dpo_beta1e-1_step10240

0
·
3
·
Apr 2026
kairawalColdTools3B32K

Llama-3.2-3B-Instruct-PT-SynthDolly-1A-E8

0
·
3
·
Apr 2026
JamesGernColdTools8B32K

lorel.ai_long_train

0
·
3
·
Apr 2026
kairawalColdTools3B32K

Llama-3.2-3B-Instruct-ES-SynthDolly-1A-E5

0
·
3
·
Apr 2026
kairawalColdTools3B32K

Llama-3.2-3B-Instruct-ES-SynthDolly-1A-E8

0
·
3
·
Apr 2026
sofinmoffinColdTools8B32K

TwinLlama-3.1-8B

0
·
3
·
Apr 2026
abarelkaColdTools8B32K

8W_ver2_3_5_epochs

0
·
3
·
Apr 2026
kairawalColdTools3B32K

Llama-3.2-3B-Instruct-EL-SynthDolly-1A-E5

0
·
3
·
Apr 2026
HCY123902ColdTools8B8K

Llama-3-Base-8B-SFT-SimPO

0
·
3
·
Apr 2026
FlyPig23ColdTools3B32K

Llama3.2-3B_Paper_Impact_SFT

0
·
3
·
Apr 2026
FlyPig23ColdTools3B32K

Llama3.2-3B_Paper_Impact_code_SFT_1ep

0
·
3
·
Apr 2026
FlyPig23ColdTools3B32K

Llama3.2-3B_Paper_Impact_patent_SFT_1ep

0
·
3
·
Apr 2026
souradip24ColdTools3B32K

dpo-merged-vllm-r4-r3

0
·
3
·
Apr 2026
Lili85Cold7B4K

Llama2-7BSST2

0
·
3
·
Apr 2026
VJ24ColdTools8B8K

llama-risk-tagger-merged

0
·
3
·
Apr 2026
gabrielniculaeseiCold1B2K

cinebot-movie-expert-merged

0
·
3
·
Apr 2026
patJedhaHFColdTools3B32K

customer-success-assistant

0
·
3
·
Apr 2026
shabieh2ColdTools70B8K

70merged0408

0
·
3
·
Apr 2026
kairawalColdTools1B32K

Llama-3.2-1B-Instruct-HI-SynthDolly-1A-E1

0
·
3
·
Apr 2026
kairawalColdTools1B32K

Llama-3.2-1B-Instruct-GA-SynthDolly-1A-E3

0
·
3
·
Apr 2026
NeelectricColdTools8B32K

Llama-3.1-8B-Instruct_SafeGrad_mathv00.04

0
·
3
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-margin-dpo-ultrafeedback-8xh200

0
·
3
·
Apr 2026
agentlansColdTools8B32K

Llama3.1-SuperDeepFuse-CrashCourse12K

1
·
3
·
Jan 2025
ahad7667Cold1B2K

M2

0
·
3
·
Sep 2025
aimee4488Cold1B2K

M1

0
·
3
·
Oct 2025
W-61ColdTools8B8K

llama-3-8b-base-epsilon-dpo-hh-harmless-8xh200

0
·
3
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-epsilon-dpo-ultrafeedback-8xh200

0
·
3
·
Apr 2026
sstoica12ColdTools8B32K

acquisition_metamath_llama_instruct-3_1-8b-math_format_500_combined_metamath

0
·
3
·
Apr 2026
sstoica12ColdTools8B32K

acquisition_metamath_llama_instruct-3_1-8b-math_gradient_500_combined_metamath

0
·
3
·
Apr 2026
sstoica12ColdTools8B32K

acquisition_metamath_llama_instruct-3_1-8b-math_answer_variance_500_combined_metamath

0
·
3
·
Apr 2026
shabieh2ColdTools70B8K

3370_0412

0
·
3
·
Apr 2026
wangzhangColdTools8B8K

Llama-3-8B-Instruct-DeepRefusal-Broken

3
·
3
·
Apr 2026
zeras141aCold1B2K

f8c78440

0
·
3
·
Aug 2025
distributedzeroCold1B2K

grt3

0
·
3
·
Sep 2025
huanzazCold1B2K

rta4

0
·
3
·
Sep 2025
SherckuithColdTools70B32K

DeepSeek-R1-Distill-Llama-70B

0
·
3
·
Apr 2026
sdhossain24ColdTools8B8K

Meta-Llama-3-8B-T-Vaccine

0
·
3
·
Apr 2026
JoinnColdTools3B32K

UserMirrorrer-Llama-DPO

0
·
3
·
May 2025
JunekhunterColdTools8B8K

llama-3.1-8b-neurotic-behavioral-behavioral_s42_lr1em05_r32_a64_e3

0
·
3
·
Apr 2026