Models

2,591
4B32Kqwen3-4b
Warm

TeichAI/Qwen3-4B-RA-SFT-Polaris-Alpha-Distill

3
·
12
·
Feb 2026
4B32Kqwen3-4b
Warm

hnda/qwen3-4b-alf-traj-v5-2ep-merged

0
·
12
·
Mar 2026
4B32Kqwen3-4b
Warm

bam2app/dpo-qwen-cot-merged_v1

0
·
12
·
Mar 2026
4B32Kqwen3-4b
Warm

Sumiokashi/qwen3-4b-structured-3k-mix-sft_lora-dpo-qwen-cot-merged

0
·
12
·
Mar 2026
4B32Kqwen3-4b
Warm

myfi/parser_model_ner_4.02

0
·
12
·
Mar 2026
4B32Kqwen3-4b
Warm

sfutenma/dpo-qwen3_4b-cot-merged_v260302-112329

0
·
12
·
Mar 2026
8B8Kllama3-8b
Warm

tomasonjo/text2cypher-demo-16bit

26
·
11
·
May 2024
8B32Kllama31-8b
Warm

Vedant3907/Text-to-Sql-llama3.1-8B

3
·
11
500M32Kqwen2-0b5
Warm

masa-research/Qwen2-0.5B_20240829_175401

0
·
11
500M32Kqwen2-0b5
Warm

Goekdeniz-Guelmez/J.O.S.I.E.v4o-0.5b-stage1-beta1

1
·
11
1B32Kllama32-1b
Warm

student-abdullah/Llama3.2_Medicine-Hinglish-Dataset_Fine-Tuned_29-09

0
·
11
1B32Kllama32-1b
Warm

keithdrexel/unsloth-llama-3.2-1b-tldr-unsloth-dpo_mid_checkpoint_3

0
·
11
33B32Kqwen25-32b
Warm

InferenceIllusionist/MilkDropLM-32b-v0.3

13
·
11
·
Dec 2024
2B32Kqwen3-1b7
Warm

HillPhelmuth/Qwen3-4B-GRPO-MathsFT

0
·
11
·
May 2025
4B32Kqwen3-4b
Warm

koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_edit

0
·
11
·
Jan 2026
4B32Kqwen3-4b
Warm

TSjB/QM-4B

0
·
11
·
Jan 2026
2B32Kqwen2-1b5
Warm

aki-008/model-16bit-grpo

0
·
11
·
Feb 2026
4B32Kqwen3-4b
Warm

Ba2han/qwen_augment-inst

0
·
11
·
Feb 2026
3B32Kllama32-3b
Warm

JoPmt/Llama-3.2-3B-Instruct

0
·
11
·
Sep 2024
4B32Kqwen3-4b
Warm

koutch/qwen_2.json_train_grpo_v1_train_code

0
·
11
·
Feb 2026