Model Releases

qwen3-1b7

JameSand/qwen3-1.7b-base-adam-3e-6-bs128-kl0.0-global_step_200

Jan 2026

0

108

J

qwen25-3b

akcit-motion/qwen2.5-3b-instruct-motion-base

Jan 2026

1

325

A

qwen2-0b5

AnotherMiner/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-hibernating_agile_marmot

Nov 2025

0

311

A

qwen3-1b7

mlabonne/Qwen3-1.7B-abliterated

Apr 2025

15

951

M

llama32-1b

distil-labs/Distil-PII-Llama-3.2-1B-Instruct

Oct 2025

6

186

D

qwen3-1b7

ericoh929/qwen3-1.7b-huggingfaceh4-instruction-data-lora-instruction-tuned

Jan 2026

0

1,102

E

qwen3-1b7

Klingspor/StarPO-1.7B

Jan 2026

0

150

K

qwen2-1b5

cdomingoenrich/qwen15_code200tok_step1750

Jan 2026

0

90

C

qwen3-0b6

ellamind/propella-1-0.6b

Jan 2026

2

449

E

llama32-3b

Evangelinejy/llama3b-midtrain-data_sft_50k_leon_nemotron_thinking-bs4-epoch1.0-ctx8192-ga1-lr5e-06-wr0.1-n4

Nov 2025

0

103

E

qwen2-0b5

0xBonge/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-flexible_fierce_owl

Nov 2025

0

2,622

0

phi2-3b

marcel/phi-2-openhermes-30k

Jan 2024

0

60

M

qwen3-4b

mlxha/Qwen3-4B-grpo-medmcqa

May 2025

2

113

M

qwen15-0b5

FreedomIntelligence/Apollo-0.5B

Mar 2024

3

122

F

gemma-2b

Edcastro/gemma-2b-it-edcastr_JavaScript-v8

Jan 2026

0

692

E

qwen3-1b7

akshayballal/Qwen3-1.7B-Pubmed-16bit-GRPO

Jan 2026

0

543

A

qwen2-1b5

ahmadmakk/Qwen2.5-Coder-1.5B-Instruct-Gensyn-Swarm-slithering_scampering_anteater

Dec 2025

0

1,599

A

qwen2-0b5

delinkz/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-lightfooted_humming_gull

Nov 2025

0

353

D

qwen3-14b

JetBrains-Research/Qwen3-14B-am

May 2025

0

3,078

J

qwen25-3b

CriteriaPO/qwen2.5-3b-dpo-coarse

May 2025

0

189

C