Models

2,852
24B32Kmistral-24b
Warm

baconnier/Napoleon_24B_V0.0

0
·
11
500M32Kqwen2-0b5
Warm

masa-research/Qwen2-0.5B_20240829_175401

0
·
11
500M32Kqwen2-0b5
Warm

Goekdeniz-Guelmez/J.O.S.I.E.v4o-0.5b-stage1-beta1

1
·
11
3B8Kgemma2-2b
Warm

somosnlp/GemmaColRAC-AeroExpertV4

1
·
11
1B32Kllama32-1b
Warm

student-abdullah/Llama3.2_Medicine-Hinglish-Dataset_Fine-Tuned_29-09

0
·
11
1B32Kllama32-1b
Warm

keithdrexel/unsloth-llama-3.2-1b-tldr-unsloth-dpo_mid_checkpoint_3

0
·
11
33B32Kqwen25-32b
Warm

InferenceIllusionist/MilkDropLM-32b-v0.3

13
·
11
·
Dec 2024
800M32Kqwen3-0b6
Warm

affanshaikhsurab/qwen3-0.6b-gpqa-learning-regularized

0
·
11
·
Jan 2026
2B32Kqwen3-1b7
Warm

HillPhelmuth/Qwen3-4B-GRPO-MathsFT

0
·
11
·
May 2025
4B32Kqwen3-4b
Warm

TSjB/QM-4B

0
·
11
·
Jan 2026
2B32Kqwen2-1b5
Warm

aki-008/model-16bit-grpo

0
·
11
·
Feb 2026
4B32Kqwen3-4b
Warm

Ba2han/qwen_augment-inst

0
·
11
·
Feb 2026
3B32Kllama32-3b
Warm

JoPmt/Llama-3.2-3B-Instruct

0
·
11
·
Sep 2024
4B32Kqwen3-4b
Warm

koutch/qwen_2.json_train_grpo_v1_train_code

0
·
11
·
Feb 2026
3B8Kgemma2-2b
Warm

Signvrse/Glosser_Gemma2_2B

0
·
11
·
Aug 2025
3B32Kllama32-3b
Warm

hamdanbinhashim/NosirAI-Mini

0
·
11
·
Feb 2026
4B32Kqwen3-4b
Warm

KotaroT1/dpo-qwen-cot-merged

0
·
11
·
Feb 2026
4B32Kqwen3-4b
Warm

Hi-Satoh/adv_sft5_dpo3_merged

0
·
11
·
Feb 2026
8B32Kqwen2-7b
Warm

Featherlabs/Aura-7b

1
·
11
·
Feb 2026
4B32Kqwen3-4b
Warm

wan-wan/test10-dpo

0
·
11
·
Feb 2026