Models

3,749
haji80mr-uoftColdTools3B32K

gpt-semi-wtype-Llama-tuned-Lora-merged-gpt5

0
·
4
·
Apr 2026
Alelcv27ColdTools3B32K

Llama3.2-3B-DELLA-Math-Code

0
·
4
·
Apr 2026
sathiiiiiColdTools3B32K

polyalign-llama3.2-3b-en-sft

0
·
4
·
Apr 2026
Alelcv27ColdTools3B32K

Llama3.2-3B-DareTIES-Math-Code

0
·
4
·
Apr 2026
Alelcv27ColdTools3B32K

Llama3.2-3B-Dare-Math-Code

0
·
4
·
Apr 2026
Navneetkumar11ColdTools1B32K

cloud-agent

0
·
4
·
Apr 2026
Alelcv27ColdTools3B32K

Llama3.2-3B-BreadcrumbsTIES-Math-Code

0
·
4
·
Apr 2026
Alelcv27ColdTools3B32K

Llama3.2-3B-TIES-Math-Code

0
·
4
·
Apr 2026
Alelcv27ColdTools3B32K

Llama3.2-3B-Arcee-Code-Math

0
·
4
·
Apr 2026
Alelcv27ColdTools3B32K

Llama3.2-3B-SLERP-Math-Code

0
·
4
·
Apr 2026
gzone0111ColdTools3B32K

AutoGraphR1-musique_hotpotqa_train-llama3.2-3b-text-retriever-grpo-repetition-penalty

0
·
4
·
Oct 2025
DJChengColdTools1B32K

Latent-SFT-Llama3.2-Instruct-1B-COT-SFT

0
·
4
·
Oct 2025
MInAlAColdTools3B32K

Llama-3.2-3B-Instruct-GRPO-merged

0
·
4
·
Apr 2026
JiajunruanColdTools1B32K

Minmax-TOFU-2

0
·
4
·
May 2026
NovacianoColdTools1B32K

qp-3.2-1B

0
·
4
·
Jan 2026
abdulhafisColdTools1B32K

dagbani-llama32-lora-finetuned

0
·
4
·
May 2026
Enthusiast101ColdTools3B32K

Llama-3.2-3B-Instruct-hhrlhf

0
·
4
·
May 2026
hyeonss0417ColdTools1B32K

assn2-simpo-llama-1b

0
·
4
·
May 2026
parkjoColdTools3B32K

Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_resume_epoch8_20260429_145921_step232

0
·
4
·
May 2026
jaehookimColdTools1B32K

hw2-dpo

0
·
4
·
May 2026
JeesupColdTools1B32K

tofu_Llama-3.2-1B-Instruct_forget10_NPO_qat-int4

0
·
4
·
May 2026
LexsiColdTools3B32K

llama32-3b-medical-sft-drift

0
·
4
·
May 2026
JeesupColdTools1B32K

tofu_1B_f10_GD_lr1e-5_a1.0

0
·
4
·
May 2026
JeesupColdTools1B32K

tofu_1B_f10_RMU_lr1e-5_sc10

0
·
4
·
May 2026
YhhxhfhColdTools1B32K

fdcbbcdf

0
·
3
ank028ColdTools1B32K

Llama-3.2-1B-Instruct-commonsense_qa-MGSM8K-sft1-slerp

0
·
3
nongfuyulangColdTools1B32K

engineer-heavy-500k-barc-llama3.1-8b-ins-fft-induction_lr1e-5_epoch3

0
·
3
·
Nov 2024
haihp02ColdTools1B32K

c717bb90-3c4c-4fab-947c-310e4cec2d00

0
·
3
qkrqudwn2ColdTools1B32K

Llama3-weeslee-Ko-3.2-3B

0
·
3
Ayush-SinghColdTools1B32K

llama1b-sft

0
·
3
pdimasColdTools1B32K

helpfulpharmacyllm_mb-rlhf-01

0
·
3
Ayush-SinghColdTools1B32K

Llama-3.2-1B-SFT

0
·
3
pdimasColdTools1B32K

BaseModel-rlhf-01

0
·
3
rrvaswinColdTools1B32K

DAPO_GRPO_4b_incorrect_bs_32_mb_8_n16_cliphigh

0
·
3
·
Jan 2026
spar-projectColdTools3B32K

Llama-3.2-3B-Instruct-mlp-layers

0
·
3
·
Mar 2026
JordanskyColdTools3B32K

liarsdice-smoketest-hashid

0
·
3
·
Mar 2026
rbelanecColdTools1B32K

train_cola_42_1774791067

0
·
3
·
Mar 2026
rbelanecColdTools1B32K

train_rte_42_1774791065

0
·
3
·
Mar 2026
EvangelinejyColdTools3B32K

llama_3b_instruct_non_think_sft_nopack_lr1.5e5_ep3

0
·
3
·
Mar 2026
j05hr3dColdTools3B32K

Llama-3.2-3B-Instruct-C_M_T-DOLLY

0
·
3
·
Mar 2026
j05hr3dColdTools3B32K

Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM

0
·
3
·
Mar 2026
phanviethoang1512ColdTools1B32K

llama3.2-1b-deita-dpo-student_sft_init

0
·
3
·
Mar 2026