Models

8,392
1B2Ktinyllama-1b1
Warm

yihanwang617/tinyllama-sft-vicuna-full-no-completion-mask

0
·
10
·
May 2024
3B32Kllama32-3b
Warm

HuggingFaceTB/finemath-ablation-fwedu

0
·
10
·
Dec 2024
8B32Kqwen2-7b
Warm

huihui-ai/Dria-Agent-a-7B-abliterated

3
·
10
·
Jan 2025
1B32Kgemma3t-1b
Warm

mlkro/gemma-3-1b-it-PT-SynthDolly-2A

0
·
10
·
Nov 2025
4B32Kqwen3-4b
Warm

akshayballal/Qwen3-4B-Pubmed-16bit-GRPO

0
·
10
·
Jan 2026
4B32Kqwen3-4b
Warm

JoshXT/AGiXT-Qwen3-4B

1
·
10
·
Jan 2026
2B32Kqwen2-1b5
Warm

aki-008/model-16bit-grpo

0
·
10
·
Feb 2026
4B32Kqwen3-4b
Warm

NamuTechnology/NamuLM

1
·
10
·
Feb 2026
4B32Kqwen3-4b
Warm

koutch/qwen_2.json_train_grpo_v1_train_code

0
·
10
·
Feb 2026
4B32Kqwen3-4b
Warm

Battogtokh/Qwen3-4B-Instruct-unsloth-FinAdvisor-16bit

1
·
10
·
Jan 2026
4B32Kqwen3-4b
Warm

beachcities/qwen3-4b-sft-dpo-v2-structeval

0
·
10
·
Feb 2026
4B32Kqwen3-4b
Warm

hikahika/dpo-qwen-cot-merged

0
·
10
·
Feb 2026
3B32Kqwen25-3b
Warm

DXCLab/OncoCareBrain-GPT

2
·
10
·
Mar 2025
4B32Kqwen3-4b
Warm

koguma-ai/sft-dpo-qwen-cot-merged0207_unsloth_03

0
·
10
·
Feb 2026
4B32Kqwen3-4b
Warm

kenzrx/dpo-qwen-cot-merged

0
·
10
·
Feb 2026
3B32Kllama32-3b
Warm

pere/llama3.2-3B-reasoning-norwegian

1
·
10
·
Feb 2025
4B32Kqwen3-4b
Warm

tellang/yeji-4b-instruct-v9

0
·
10
·
Feb 2026
4B32Kqwen3-4b
Warm

hnda/qwen3-4b-alf-sft-merged

0
·
10
·
Feb 2026
4B32Kqwen3-4b
Warm

Tamata1208/dpo-qwen-cot-merged

0
·
10
·
Feb 2026
4B32Kqwen3-4b
Warm

da1ch812/advanced-comp-model

0
·
10
·
Feb 2026