Models

12,045
rudalsonColdTools3B32K

Llama-3.2-3B-Instruct-KoAlpaca

0
·
71
·
May 2026
abuhussein1504ColdTools3B32K

3ml-coach-llama-3.2-3b

0
·
71
·
May 2026
jiogenesColdTools8B8K

llama-3.1-8b-r2048-svd-qres1

0
·
71
·
May 2026
jiogenesColdTools8B8K

llama-3.1-8b-r2048-svd-qres4

0
·
71
·
May 2026
XINO-AMANColdTools8B8K

my-merged-llama3

0
·
71
·
May 2026
JeesupColdTools1B32K

tofu_1B_f10_RMU_lr1e-5_sc1

0
·
71
·
May 2026
JeesupColdTools1B32K

tofu_1B_f10_RMU_lr1e-5_sc5

0
·
71
·
May 2026
RJTPPColdTools8B32K

scot0402s-deepseek-llama-8b-full

0
·
70
·
Apr 2026
RJTPPColdTools8B32K

scot0402s-deepseek-llama-8b-REF-full

0
·
70
·
Apr 2026
sstoica12ColdTools8B32K

acquisition_llama-3_1-8b_bins_medmcqa_proximity

0
·
70
·
Apr 2026
jackf857ColdTools8B8K

llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.5-s_star-0.85

0
·
70
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.4

0
·
70
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-5

0
·
70
·
Apr 2026
ShahriarFerdoushCold13B4K

llama2-13b-math-code-obf-w-dare-merged

0
·
70
·
Apr 2026
W-61ColdTools8B8K

llama3-hh-harmless-qt045-b0p5-20260429-085449

0
·
70
·
Apr 2026
jackf857ColdTools8B8K

llama-3-8b-base-ipo-ultrafeedback-4xh200-batch-128-20260428-004616

0
·
70
·
Apr 2026
W-61ColdTools8B8K

llama3-hh-harmless-qt045-b0p3-20260429-085449

0
·
70
·
Apr 2026
Laiba-07Cold1B2K

tinyllama-trl-merged

0
·
70
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.4

0
·
70
·
Apr 2026
sstoica12ColdTools3B32K

acquisition_llama-3_2-3b_bins_numina_format

0
·
70
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.43

0
·
70
·
Apr 2026
jackf857ColdTools8B8K

llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.5-s_star-0.6

0
·
70
·
Apr 2026
cjziemsColdTools1B32K

Llama3-1B-longitudinal

0
·
70
·
Apr 2026
jackf857ColdTools8B8K

llama-3-8b-base-orpo-ultrafeedback-4xh200-rerun

0
·
70
·
Apr 2026
jiogenesColdTools8B8K

llama-3.1-8b-r128-als-random-qres1

0
·
70
·
May 2026
jiogenesColdTools8B8K

llama-3.1-8b-r256-als-random-qres1

0
·
70
·
May 2026
jiogenesColdTools8B8K

llama-3.1-8b-r1536-svd-qres1

0
·
70
·
May 2026
dayz-777ColdTools8B8K

llama3-8b-legal-chatbot-grpo

0
·
70
·
May 2026
gradients-io-tournamentsCold7B4K

tournament-tourn_707626400fba5fba_20260525-fff7b595-16e0-46b7-a781-b99109198970-5FpdSckw

0
·
70
·
May 2026
JeesupColdTools1B32K

tofu_1B_f10_RMU_lr1e-4_sc5

0
·
70
·
May 2026
JeesupColdTools1B32K

tofu_1B_f10_NPO_lr3e-5_b0.1

0
·
70
·
May 2026
LexsiColdTools8B32K

audit-recover-apply_resta-llama31-8b-medical

0
·
70
·
May 2026
dandyhiColdTools8B8K

Llama-3-8B-Indo-Legal-SFT

0
·
70
·
Jun 2026
mizzaayCold1B2K

vv5

0
·
70
·
Sep 2025
Alelcv27ColdTools8B32K

Llama3.1-8B-Base-SLERP-Math-Code

0
·
69
·
Apr 2026
yan1008611ColdTools8B32K

Selene-1-Mini-Llama-3.1-8B

0
·
69
·
Apr 2026
W-61ColdTools8B8K

llama3-8b-base-new-method-q_t-0.4-s_star0.6-beta-next-batch

0
·
69
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.45-20260428-045924

0
·
69
·
Apr 2026
jackf857ColdTools8B8K

llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.5-s_star-0.4

0
·
69
·
Apr 2026
jackf857ColdTools8B8K

llama-3-8b-base-kto-ultrafeedback-4xh200-batch-128-20260427-194056

0
·
69
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-s_star-0.4-20260425-111846

0
·
69
·
Apr 2026
jiogenesColdTools8B8K

llama-3.1-8b-r128-svd-qres4

0
·
69
·
May 2026