Models

12,054
dmody1ColdTools1B32K

llama-1b-mean-matched-l1-lam100

0
·
6
·
Apr 2026
Alelcv27ColdTools3B32K

Llama3.2-3B-BreadcrumbsTIES-Math-Code

0
·
6
·
Apr 2026
sstoica12ColdTools8B32K

acquisition_llama-3_1-8b_bins_numina_confidence

0
·
6
·
Apr 2026
jordanpainterColdTools8B32K

diallm-llama-gspo-all

0
·
6
·
Apr 2026
sstoica12ColdTools8B32K

acquisition_llama-3_1-8b_bins_numina_proximity

0
·
6
·
Apr 2026
Alelcv27ColdTools3B32K

Llama3.2-3B-Arcee-Code-Math

0
·
6
·
Apr 2026
alexxbobrColdTools1B32K

ORPO8000Vikhr-Llama-3.2-1B-Instruct5000

0
·
6
·
Apr 2026
sstoica12ColdTools8B32K

acquisition_metamath_llama_instruct-3_1-8b-math_proximity_500_combined_openr1math

0
·
6
·
Apr 2026
sharad0xColdTools1B32K

llama-1b-reasoning-merged

0
·
6
·
Apr 2026
anssioColdTools8B8K

Llama-Poro-2-8B-Instruct

0
·
6
·
Apr 2026
jackf857ColdTools8B8K

llama-3-8b-base-orpo-ultrafeedback-8xh200

0
·
6
·
Apr 2026
psh3333ColdTools3B32K

llama-3.2-3b-grpo-merged

0
·
6
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-hh-harmless-s_star1.0-4xh200-batch-64-20260422-051621

0
·
6
·
Apr 2026
jackf857ColdTools8B8K

llama-3-8b-base-margin-dpo-hh-harmless-batch-size-64

0
·
6
·
Apr 2026
michaelwavesColdTools70B32K

pacifist

0
·
6
·
Sep 2025
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-hh-harmless-s_star0.6-4xh200-batch-64-20260421-213851

0
·
6
·
Apr 2026
kmseongColdTools8B32K

llama3.1_8b_base-Safety-FT-lr3e-5

0
·
6
·
Apr 2026
jaspionjaderColdTools8B32K

Kosmos-EVAA-immersive-mix-v45.1-8B

1
·
6
·
Feb 2025
os-stopCold1B2K

sn38-v11-2

0
·
6
·
Oct 2025
APRKDEVColdTools8B8K

icarus-1-8b

0
·
6
·
May 2026
luckecianoColdTools8B32K

Llama-3.1-8B-Instruct-GRPO-Base-v2_1346

0
·
6
·
Sep 2025
Srr1234Cold1B2K

tinyllama-qlora-chatbot

0
·
6
·
May 2026
parkjoColdTools8B32K

Llama-3.1-8B-Instruct_grpo_ppl_adv_rollout_8_20260502_125019_step580

0
·
6
·
May 2026
parkjoColdTools8B32K

Llama-3.1-8B-Instruct_grpo_ppl_adv_resume_epoch10_20260427_162955_step290

0
·
6
·
May 2026
parkjoColdTools8B32K

Llama-3.1-8B-Instruct_grpo_rollout_8_20260429_152020_step580

0
·
6
·
May 2026
parkjoColdTools8B32K

Llama-3.1-8B-Instruct_grpo_rollout_8_resume_epoch10_20260429_152020_step290

0
·
6
·
May 2026
zeras141aCold1B2K

14d32750

0
·
6
·
Aug 2025
miolgCold1B2K

63b22748

0
·
6
·
Aug 2025
miolgCold1B2K

ee14e46d

0
·
6
·
Aug 2025
miolgCold1B2K

b5fb3c43

0
·
6
·
Aug 2025
zeras141aCold1B2K

c59367d0

0
·
6
·
Aug 2025
mizzaayCold1B2K

fe85261e

0
·
6
·
Aug 2025
Enthusiast101ColdTools1B32K

llama3.2-1b-Inst-safemerge

0
·
6
·
May 2026
wvnvwnColdTools8B32K

qwen2.5-7b-instruct-gsm8k-sn-tuned-lr3e-5

0
·
6
·
May 2026
ParasiticRogueCold34B32K

RP-Stew-v4.0-34B

10
·
6
·
Jul 2024
eugrug-60ColdTools8B8K

DeepSeek-R1-Medical-o1-COT

1
·
5
Fernando70ColdTools1B32K

llama-3.2-3b-it-Ecommerce-ChatBot

0
·
5
PectionColdTools1B32K

llama3-finetune

0
·
5
jiinkingColdTools1B32K

6_bitwise_MQA_llama_model

0
·
5
koutchColdTools8B32K

paper_llama_llama3.1-8b_train_sft_train_para

0
·
5
·
Jan 2026
DeeWooCold7B4K

Llama-2-7b-chat_FFT_GSM8K

1
·
5
·
Dec 2024
yuan-tianColdTools8B8K

chartgpt-llama3

10
·
5
·
Oct 2024