Models

3,519
1B32Kllama32-1b
Warm

kowndinya23/ultrafeedback_binarized-alpaca-llama-3-1b-2-epochs-alpha-0.8-beta-0-2-epochs

0
·
2
1B32Kllama32-1b
Warm

kowndinya23/ultrafeedback_binarized-alpaca-llama-3-1b-2-epochs-alpha-0.4-beta-0.2-2-epochs

0
·
2
3B32Kllama32-3b
Warm

peachfawn/llama3ClinicalTrialFinalFineTuned

0
·
2
3B32Kllama32-3b
Warm

kenken6696/Llama-3.2-3B_3x3_mix_position

0
·
2
3B32Kllama32-3b
Warm

deswaq/juh12

0
·
2
1B32Kllama32-1b
Warm

open-unlearning/neg_tofu_Llama-3.2-1B-Instruct_retain90_lr4e-05_wd0.01_epoch10

0
·
2
3B32Kllama32-3b
Warm

masani/SFT_gsm8k_Llama-3.2-3B_epoch_1_global_step_29

0
·
2
·
May 2025
3B32Kllama32-3b
Warm

activeDap/Llama-3.2-3B_ultrafeedback_chosen

0
·
2
·
Nov 2025
3B32Kllama32-3b
Warm

activeDap/Llama-3.2-3B_hh_helpful

0
·
2
·
Nov 2025
3B32Kllama32-3b
Warm

swadeshb/Llama-3.2-3B-Instruct-VMPO-V1

0
·
2
3B32Kllama32-3b
Warm

ahme0599/meta-llama_Llama-3.2-3B-Instruct-GRPO-vanilla_G_4

0
·
2
·
Dec 2025
1B32Kllama32-1b
Warm

gshasiri/dpo-llama3.2-sapo-200

0
·
2
·
Dec 2025
1B32Kllama32-1b
Warm

ShahriarFerdoush/llama-3.2-1b-math-solver

0
·
2
·
Dec 2025
3B32Kllama32-3b
Warm

rrvaswin/64b_SFT

0
·
2
·
Jan 2026
1B32Kllama32-1b
Warm

gshasiri/SmolLM3-Mid-Second-Round

0
·
2
·
Nov 2025
1B32Kllama32-1b
Warm

W-61/hh-llama32-1b-sft

0
·
2
·
Jan 2026
1B32Kllama32-1b
Warm

gshasiri/dpo-llama3.2-gspo-original-400

0
·
2
·
Dec 2025
3B32Kllama32-3b
Warm

rrvaswin/32b_SFT

0
·
2
·
Jan 2026
3B32Kllama32-3b
Warm

rrvaswin/4b_RL_DAPO

0
·
2
·
Jan 2026
3B32Kllama32-3b
Warm

rrvaswin/8b_RL_DAPO

0
·
2
·
Jan 2026