Model Releases

phi2-3b

marcel/phi-2-openhermes-30k

Jan 2024

0

108

M

qwen3-4b

mlxha/Qwen3-4B-grpo-medmcqa

May 2025

2

57

M

qwen15-0b5

FreedomIntelligence/Apollo-0.5B

Mar 2024

3

341

F

qwen3-14b

JetBrains-Research/Qwen3-14B-am

May 2025

0

53

J

phi2-3b

StanfordAIMI/RadPhi-2

Mar 2024

1

315

S

qwen3-8b

DragonLLM/Qwen-Open-Finance-R-8B

Oct 2025

6

314

D

llama31-8b

DragonLLM/Llama-Open-Finance-8B

Oct 2025

14

5,190

D

tinyllama-1b1

VishalMysore/cookgptlama

Dec 2023

3

6,309

V

gemma3-4b

unsloth/medgemma-1.5-4b-it

Jan 2026

5

5,170

U

mistral-nemo

DavidAU/Mistral-Nemo-2407-12B-Thinking-Claude-Gemini-GPT5.2-Uncensored-HERETIC

Jan 2026

14

1,029

D

qwen3-1b7

MultiRL/qwen3_1.7b_sudoku_one_act_new

Jan 2026

0

30

M

qwen3-8b

sagnikM/grpo_sgd_qwen3-8b_3k_seqlen_momentum_0p9_1e-2

Jan 2026

0

30

S

qwen3-1b7

johnceballos/Affine-std-5F53PDhPD9wr3utc1x5E3sLNHT68wPMDHHSKB33iEap36Dxs

Jan 2026

0

88

J

qwen3-8b

huseyinatahaninan/appworld_distillation_sft_v2-SFT-Qwen3-8B

Jan 2026

0

41

H

qwen3-1b7

MultiRL/qwen3_1.7b_rush_hour_one_move_sft

Jan 2026

0

1

M

qwen3-1b7

MultiRL/qwen3_1.7b_new_standard_C_sft_overfit_lr_5e_6__global_step_592

Jan 2026

0

0

M

qwen3-1b7

MultiRL/qwen3_1.7b_new_standard_C_sft_overfit_lr_5e_6__global_step_296

Jan 2026

0

0

M

qwen3-1b7

MultiRL/qwen3_1.7b_new_standard_C_sft_overfit_lr_5e_6__global_step_888

Jan 2026

0

0

M

qwen3-1b7

MultiRL/qwen3_1.7b_new_standard_C_sft_overfit_lr_5e_6__global_step_1184

Jan 2026

0

26

M

qwen3-1b7

MultiRL/qwen3_1.7b_new_standard_C_sft_overfit_lr_5e_6__global_step_1480

Jan 2026

0

45

M