1B Parameter LLMs — Page 140

6,680

emmastubbyCold1B32K

gemma-3-1b-it-sst5-merged

0

·

8

·

Apr 2026

blackbook-lmColdTools2B32K

Qwen2.5-1.5b-Instruct-heretic

0

·

8

·

Apr 2026

xw1234ganColdTools2B32K

NuminaMath_Main_fixed_SFTanchor_1_5B_step_1

0

·

8

·

Apr 2026

manhcuong2005ColdTools2B32K

qwen2.5-1.5b-legal-edu-v5

0

·

8

·

Apr 2026

xw1234ganColdTools2B32K

Main_fixed_MATH_1_5B_BaseAnchor_step_7

0

·

8

·

Apr 2026

ayousefi-pinsCold1B32K

gemma-3-1b-medical-finetuned

0

·

8

·

Apr 2026

cjziemsColdTools1B32K

Llama3-1B-psych101

0

·

8

·

Apr 2026

divelabColdTools2B32K

DAPO_E2H-math-gaussian_0p5_0p5

0

·

8

·

Apr 2026

xw1234ganColdTools2B32K

Main_fixed_MATH_1_5B_BaseAnchor_step_8

0

·

8

·

Apr 2026

xw1234ganColdTools2B32K

NuminaMath_Main_fixed_SFTanchor_1_5B_step_2

0

·

8

·

Apr 2026

kabilesh-cColdTools2B32K

daedalus-designer

0

·

8

·

Apr 2026

jekunzCold1B32K

Gemma-3-1B-pt-is-SmolTalk

0

·

8

·

Apr 2026

manhcuong2005ColdTools2B32K

qwen2.5-1.5b-legal-intent

0

·

8

·

Apr 2026

xw1234ganColdTools2B32K

cnk12_Main_fixed_BaseAnchor_1_5B_step_3

0

·

8

·

Apr 2026

divelabColdTools2B32K

DAPO_E2H-gsm8k-gaussian_0p25_0p75

0

·

8

·

Apr 2026

manhcuong2005ColdTools2B32K

qwen2.5-1.5b-legal-edu-v4

0

·

8

·

Apr 2026

daredevil467ColdTools2B32K

hanoi-router-qwen25-15b

0

·

8

·

Apr 2026

jinvallColdTools2B32K

Qwen2.5-Coder-1.5B-Instruct

0

·

8

·

Apr 2026

daredevil467ColdTools2B32K

hanoi-router-qwen25-15b-v6

0

·

8

·

Apr 2026

FardanColdTools2B32K

Qwen2.5-1.5B-Instruct-Math-Reasoning-GRPO-Tuned

0

·

8

·

Apr 2026

WhipStudioColdTools2B32K

Qwen2.5-1.5B-Instruct-ForgeArena-Overseer

0

·

8

·

Apr 2026

Sanjarbek1024Cold1B2K

tinyllama-medquad-merged

0

·

8

·

Apr 2026

KyleyeeColdTools2B32K

VRPO_hh-seed2

0

·

8

·

Apr 2026

KyleyeeColdTools2B32K

DPO_hh-seed4

0

·

8

·

Apr 2026

alexxbobrColdTools1B32K

ORPO8000Vikhr-Llama-3.2-1B-Instruct5000

0

·

8

·

Apr 2026

raalrColdTools2B32K

Qwen2.5-1.5B-Instruct-ULD-gemma-3-27b-it

0

·

8

·

Apr 2026

xw1234ganColdTools2B32K

NuminaMath_Main_fixed_SFTanchor_1_5B_step_5

0

·

8

·

Apr 2026

xw1234ganColdTools2B32K

GRPO_KL_Qwen2.5-1.5B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN

0

·

8

·

Apr 2026

KyleyeeColdTools2B32K

VRPO_hh-seed4

0

·

8

·

Apr 2026

raalrColdTools2B32K

Qwen2.5-1.5B-Instruct-dskdv2-Qwen

0

·

8

·

Apr 2026

seopboColdTools2B32K

zerorlvrcode-qwen2.5-1.5b

0

·

8

·

Apr 2026

BUGIEColdTools2B32K

safeguardian-guardian

0

·

8

·

Apr 2026

olusegunolaCold1B2K

phi-1.5-stage3-sft-cloned-seed42-merged

0

·

8

·

Apr 2026

mironazaCold1B2K

zerp7

0

·

8

·

Sep 2025

Soea511ColdTools2B32K

Godot-Native-AI-Brain

0

·

8

·

May 2026

DJChengColdTools1B32K

Latent-SFT-Llama3.2-Instruct-1B-COT-SFT

0

·

8

·

Oct 2025

emajoch1Cold1B32K

gemma-3-1b-lora-abstention

0

·

8

·

May 2026

knovelengColdTools2B32K

Open-RS2

1

·

8

·

Mar 2025

zhaohqColdTools2B32K

PureRL-1.5B-v5-06-uccp

0

·

8

·

May 2026

ClaudioSavelliColdTools1B32K

FAME_PO_llama32-1b-10-instruct-qa

0

·

8

·

May 2026

zhaohqColdTools2B32K

PureRL-1.5B-v6d4-lam01-sigmoid-maskoff-acc05

0

·

8

·

May 2026

rafiqiraihanColdTools2B32K

qwen-rag-indonesia

0

·

8

·

May 2026