Models

4,286
1B32Kllama32-1b
Warm

BleachNick/Llama-3.2-1B-Instruct-GRPO-45k_RAGv1.5

0
·
1
1B32Kllama32-1b
Warm

gorizont/main-train

0
·
1
1B32Kllama32-1b
Warm

jasonrb/llama-3.2-1B_gsm8k_sft_no_eos

0
·
1
1B32Kllama32-1b
Warm

zisisbatzos/llama3.2-1B-GRPO

0
·
1
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-d4-NoReg-WO_NoHealth

0
·
1
1B32Kllama32-1b
Warm

ceciliaacosta78/checkpoints

0
·
1
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-llama-3.2-1B-In-OWTWM-DW-Al4-wmToken-d4-a0.1-v3-meta-OWT-LA

0
·
1
1B32Kllama32-1b
Warm

Trelis/Llama-3.2-1B-Instruct_GRPO_1_chkpt100_16bit

0
·
1
1B32Kllama32-1b
Warm

Grogros/dmWM-llama-3.2-1B-Instruct-HarmData-Al4-OWT-d4-a0.25

0
·
1
1B32Kllama32-1b
Warm

akhilsheri57/llama-1b-new

0
·
1
1B32Kllama32-1b
Warm

gorizont/test2

0
·
1
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-llama-3.2-1B-Instruct-LucieFr-Al4-OWT-d4-a0.1-v2-learnability_adv

0
·
1
1B32Kllama32-1b
Warm

axolotl-ai-co/numina-1b-ep3-lr3e-5-sft

0
·
1
1B32Kllama32-1b
Warm

Grogros/Grogros-dmWM-Llama-3.2-1B-Instruct-ft-M-A-O-d4-a0.25-ft-learnability_adv

0
·
1
1B32Kllama32-1b
Warm

quancute/Llama-3.2-1B-Instruct_sum-10k_2Mar-2025_A100

0
·
1
·
Mar 2025
1B32Kllama32-1b
Warm

Dev8318/custom-Llama-2-1b

0
·
1
1B32Kllama32-1b
Warm

ezhf2024/Llama-3_2-ft

0
·
1
3B8Kgemma2-2b
Warm

xw17/gemma-2-2b-it_finetuned_4_optimized1_task_grouping_off_FT

0
·
1
3B8Kgemma2-2b
Warm

ma921/gemma2_h_dpo_golden-hh_noise40_epoch3_gamma2

0
·
1
3B8Kgemma2-2b
Warm

TongZheng1999/gemma-2-2b-it-star-3Rounds-iter-3

0
·
1