Models

8,393
1B2Ktinyllama-1b1
Warm

yihanwang617/tinyllama-sft-vicuna-full-no-completion-mask

0
·
9
·
May 2024
3B8Kgemma2-2b
Warm

qingy2024/GRMR-2B-Instruct-old

12
·
9
·
Dec 2024
3B32Kllama32-3b
Warm

HuggingFaceTB/finemath-ablation-fwedu

0
·
9
·
Dec 2024
4B32Kqwen3-4b
Warm

and-emili/aera-4b

0
·
9
·
May 2025
1B2Ktinyllama-1b1
Warm

jburtoft/tinyllama-codewords

0
·
9
·
Dec 2025
4B32Kqwen3-4b
Warm

Roman0/Qwen3-4B-Instruct-2507-heretic

0
·
9
·
Dec 2025
4B32Kqwen3-4b
Warm

staeiou/bartleby-Qwen3-4B-2507

0
·
9
·
Jan 2026
3B32Kqwen25-3b
Warm

gjyotin305/Qwen2.5-3B-Instruct_new_alpaca_007

0
·
9
·
Jan 2026
3B32Kllama32-3b
Warm

sail/Llama-3.2-3B-Oat-Zero

1
·
9
·
Mar 2025
4B32Kqwen3-4b
Warm

akshayballal/Qwen3-4B-Pubmed-16bit-GRPO

0
·
9
·
Jan 2026
1B32Kgemma3t-1b
Warm

vinhnx90/gemma-3-1b-thinking-v2

1
·
9
·
Mar 2025
2B32Kqwen2-1b5
Warm

aki-008/model-16bit-grpo

0
·
9
·
Feb 2026
4B32Kqwen3-4b
Warm

koutch/qwen_qwen3-instruct-4b_train_sft_train_para

0
·
9
·
Feb 2026
4B32Kqwen3-4b
Warm

NamuTechnology/NamuLM

1
·
9
·
Feb 2026
3B8Kgemma2-2b
Warm

issoufzousko07/BABA-IA-2B

1
·
9
·
Feb 2026
4B32Kqwen3-4b
Warm

TSerizawa/llm-lecture-2025_dpo-qwen-cot-merged_base_model

0
·
9
·
Feb 2026
4B32Kqwen3-4b
Warm

koutch/qwen_2.json_train_grpo_v1_train_code

0
·
9
·
Feb 2026
4B32Kqwen3-4b
Warm

Battogtokh/Qwen3-4B-Instruct-unsloth-FinAdvisor-16bit

1
·
9
·
Jan 2026
4B32Kqwen3-4b
Warm

beachcities/qwen3-4b-sft-dpo-v2-structeval

0
·
9
·
Feb 2026
4B32Kqwen3-4b
Warm

hikahika/dpo-qwen-cot-merged

0
·
9
·
Feb 2026