Models

3,749
parkjoColdTools3B32K

Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_Use_KL_0.001_step580

0
·
2
·
Apr 2026
Sudarshan1607ColdTools1B32K

ddp-llama32-1b-ultrachat

0
·
2
·
May 2026
Enthusiast101ColdTools1B32K

llama3.2-1b-Inst-aaq

0
·
2
·
May 2026
parkjoColdTools3B32K

Llama-3.2-3B-Instruct_grpo_adv_rollout_8_20260502_233833_step580

0
·
2
·
May 2026
parkjoColdTools3B32K

Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_20260501_120104_step580

0
·
2
·
May 2026
parkjoColdTools3B32K

Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_20260501_115927_step580

0
·
2
·
May 2026
parkjoColdTools3B32K

Llama-3.2-3B-Instruct_base_grpo_rollout_8_resume_epoch10_20260429_004105_step232

0
·
2
·
May 2026
parkjoColdTools3B32K

Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_resume_epoch10_20260429_004543_step290

0
·
2
·
May 2026
parkjoColdTools3B32K

Llama-3.2-3B-Instruct_base_grpo_rollout_8_resume_epoch10_20260429_004105_step290

0
·
2
·
May 2026
Geon10102ColdTools1B32K

assn2-sft-llama32-1b

0
·
2
·
May 2026
Geon10102ColdTools1B32K

assn2-dpo-llama32-1b

0
·
2
·
May 2026
hyeonss0417ColdTools1B32K

assn2-sft-llama-1b

0
·
2
·
May 2026
LexsiColdTools3B32K

llama32-3b-dolly-sft-drift

0
·
2
·
May 2026
LexsiColdTools3B32K

llama32-3b-code-sft-drift

0
·
2
·
May 2026
Enthusiast101ColdTools1B32K

llama3.2-1b-Inst-safemerge

0
·
2
·
May 2026
somukandulaColdTools1B32K

cx-filler-model

0
·
2
·
Apr 2026
NathanRollColdTools1B32K

Llama-3.2-1B-Instruct-0k-shuffle-x

0
·
1
rrvaswinColdTools1B32K

DAPO_GRPO_16b_incorrect_bs_32_mb_8_n16_cliphigh

0
·
1
·
Jan 2026
mihirrajdColdTools3B32K

llama_finetune_16bit

0
·
1
·
Mar 2026
EvangelinejyColdTools3B32K

llama_3b_base_non_think_sft_nopack_lr1.5e5_ep3

0
·
1
·
Mar 2026
Enthusiast101ColdTools1B32K

llama3.2-1b-Inst-resta

0
·
1
·
Apr 2026
parkjoColdTools3B32K

Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_20260429_004543_step580

0
·
1
·
May 2026
EpistemeAIColdTools1B32K

ReasoningCore-1B-T1

1
·
0
avinotColdTools1B32K

LoLlama-3.2-1B-lora-3ep-v3-instruct

0
·
0
·
May 2025
GetSoloTechColdTools1B32K

Llama3.2-1B-Med-Transcript-Notes

0
·
0
NathanRollColdTools1B32K

Llama-3.2-1B-Instruct-Unablated

0
·
0
NathanRollColdTools1B32K

Llama-3.2-1B-Instruct-0q-shuffle

0
·
0
NathanRollColdTools1B32K

Llama-3.2-1B-Instruct-0q-shuffle-x

0
·
0
NathanRollColdTools1B32K

Llama-3.2-1B-Instruct-1o-shuffle-x

0
·
0
NathanRollColdTools1B32K

Llama-3.2-1B-Instruct-1q-shuffle-x

0
·
0
NathanRollColdTools1B32K

Llama-3.2-1B-Instruct-0v-shuffle-x

0
·
0
NathanRollColdTools1B32K

Llama-3.2-1B-Instruct-0o-shuffle-x

0
·
0
NathanRollColdTools1B32K

Llama-3.2-1B-Instruct-1v-shuffle-x

0
·
0
NathanRollColdTools1B32K

Llama-3.2-1B-Instruct-1k-shuffle-x

0
·
0
aimlresearch2023ColdTools1B32K

llama-3.2-1b-it-merged-llama-factory

0
·
0
HsianchengfunColdTools1B32K

1B-40epoch

0
·
0
Jia-aoColdTools1B32K

Llama-3.2-1B-Instruct-Explainable-Propaganda-Detection-old

0
·
0
HsianchengfunColdTools1B32K

1B-80epoch

0
·
0
TOMFORD79ColdTools3B32K

model17

0
·
0
WenFenggColdTools3B32K

guys_6

0
·
0
WenFenggColdTools3B32K

guys_1

0
·
0
OnDeviceMedNotesColdTools1B32K

Medical_Summary_Notes

1
·
0