Models

3,749
SheykoColdTools1B32K

TinyLlama-3.2-1B-LoRA-Finetuned-2

0
·
3
·
Apr 2026
patJedhaHFColdTools3B32K

customer-success-assistant

0
·
3
·
Apr 2026
kairawalColdTools1B32K

Llama-3.2-1B-Instruct-EL-SynthDolly-1A-E1

0
·
3
·
Apr 2026
Edu-SungHoColdTools3B32K

llama3.2-alpaca-tuned-and-merged

0
·
3
·
Apr 2026
DivijColdTools3B32K

llama-3.2-3b-sft-llama-star

0
·
3
·
Apr 2026
tecwiz123ColdTools3B32K

g-llama-3b-finetuned

0
·
3
·
Apr 2026
dmody1ColdTools1B32K

llama-1b-cov-matched-l2-lam100

0
·
3
·
Apr 2026
jinrui123ColdTools3B32K

llamasrnn-grpo-epoch001-merged

0
·
3
·
Apr 2026
alexxbobrColdTools1B32K

ORPO8000Vikhr-Llama-3.2-1B-Instruct5000

0
·
3
·
Apr 2026
pbeartColdTools1B32K

magictokens_finetune_merged

0
·
3
·
Oct 2025
vingale803ColdTools3B32K

tofu_Llama-3.2-3B-Instruct_forget01_NPO_beta1.0_lr1e-5

0
·
3
·
Apr 2026
kmseongColdTools3B32K

llama3_2_3b_instruct_only_rsn_tuned_lr5e-5

0
·
3
·
Apr 2026
Enthusiast101ColdTools1B32K

llama3.2-1b-Inst-safegrad

0
·
3
·
May 2026
parkjoColdTools3B32K

Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_resume_epoch10_20260429_004543_step232

0
·
3
·
May 2026
hyeonss0417ColdTools1B32K

assn2-dpo-llama-1b

0
·
3
·
May 2026
Geon10102ColdTools1B32K

assn2-simpo-llama32-1b

0
·
3
·
May 2026
parkjoColdTools3B32K

Llama-3.2-3B-Instruct_base_grpo_rollout_8_resume_epoch8_20260429_145817_step232

0
·
3
·
May 2026
JeesupColdTools1B32K

tofu_Llama-3.2-1B-Instruct_forget10_SimNPO_qat-int4

0
·
3
·
May 2026
emtiiiColdTools1B32K

assn2-dpo

0
·
3
·
May 2026
JeesupColdTools1B32K

tofu_Llama-3.2-1B-Instruct_forget10_RMU_qat-int4

0
·
3
·
May 2026
JeesupColdTools1B32K

tofu_1B_f10_GD_lr1e-5_a5.0

0
·
3
·
May 2026
Enthusiast101ColdTools1B32K

llama3.2-1b-Inst-arithmetic

0
·
3
·
May 2026
pdimasColdTools1B32K

helpfulpharmacyllm_js-rlhf-01

0
·
2
quinnheColdTools1B32K

llama3.2_1b_16bit

0
·
2
PrunaAIColdTools1B32K

Llama-3.2-1b-Instruct-smashed

1
·
2
rrvaswinColdTools1B32K

STaR_RL_DAPO

0
·
2
·
Jan 2026
rrvaswinColdTools1B32K

64b_RL_DAPO_v2

0
·
2
·
Jan 2026
rrvaswinColdTools1B32K

DAPO_GRPO_8b_incorrect_bs_32_mb_8_n16_cliphigh

0
·
2
·
Jan 2026
rrvaswinColdTools1B32K

1_to_16_analysis

0
·
2
·
Jan 2026
swadeshbColdTools3B32K

Llama-3.2-3B-Instruct-MPO-SKD-V2

0
·
2
·
Feb 2026
nostalgicskincoColdTools1B32K

air-compliance-llama-1b

0
·
2
·
Feb 2026
spar-projectColdTools3B32K

Llama-3.2-3B-Instruct-attention-layers

0
·
2
·
Mar 2026
spar-projectColdTools3B32K

Llama-3.2-3B-Instruct-minimal-layers

0
·
2
·
Mar 2026
spar-projectColdTools3B32K

Llama-3.2-3B-Instruct-layers-16-to-24

0
·
2
·
Mar 2026
bimabkColdTools3B32K

test_gin_rummy_qwen_2-5_3B

0
·
2
·
Mar 2026
CCCCCyxColdTools3B32K

Llama-3.2-3B-Instruct_slime

0
·
2
·
Mar 2026
rbelanecColdTools1B32K

train_mrpc_42_1774791061

0
·
2
·
Mar 2026
rbelanecColdTools1B32K

train_boolq_42_1774791063

0
·
2
·
Mar 2026
ClaudioSavelliColdTools1B32K

FAME-topics_PO_llama32-1b-instruct-qa

0
·
2
·
Apr 2026
ClaudioSavelliColdTools1B32K

FAME-topics_GA_llama32-1b-instruct-qa

0
·
2
·
Apr 2026
ClaudioSavelliColdTools3B32K

FAME-topics_PO_llama32-3b-instruct-qa

0
·
2
·
Apr 2026
EvangelinejyColdTools3B32K

llama_3b_instruct_think_sft_nopack_lr1.5e5_ep3

0
·
2
·
Mar 2026