Models

32,702
3B32Kqwen25-3b
Warm

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-grpo

1
·
566
·
Mar 2025
1B2Kphi-1b4
Warm

olusegunola/phi-1.5-distill-v2-Ablation_Linear_Arch-merged

0
·
566
·
Apr 2026
8B32Kllama31-8b
Warm

sh2orc/Llama-3.1-Korean-8B-Instruct

23
·
564
·
Jul 2024
8B32Kllama31-8b
Warm

tokyotech-llm/Llama-3.1-Swallow-8B-v0.5

9
·
564
·
Apr 2025
2B32Kqwen2-1b5
Warm

graf/qwen2.5-1.5b-instruct-sft-test-wmv0.5.4-lr1e-7

0
·
563
·
Jan 2026
1B2Kphi-1b4
Warm

olusegunola/phi-1.5-distill-v2-Proposed_MLP_L2_Beta2.0-merged

0
·
563
·
Apr 2026
1B2Ktinyllama-1b1
Warm

Josephgflowers/TinyLlama-3T-Cinder-v1.2

4
·
562
·
Dec 2023
8B32Kllama31-8b
Warm

deepcogito/cogito-v1-preview-llama-8B

51
·
561
·
Mar 2025
2B32Kqwen25-1b5
Warm

jinaai/reader-lm-1.5b

608
·
561
·
Sep 2024
4B32Kqwen3-4b
Warm

orbit-ai/infoseeker-repro-4b

0
·
561
·
Mar 2026
2B32Kqwen2-1b5
Warm

abhinavakarsh0033/model_sft_resta

0
·
561
·
Mar 2026
2B32Kqwen2-1b5
Warm

graf/qwen2.5-1.5b-instruct-sft-test-wmv0.5.4-lr1e-6

0
·
558
·
Jan 2026
500M32Kqwen2-0b5
Warm

vhphuoc1102/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-miniature_vicious_caribou

0
·
558
·
Jul 2025
2B32Kqwen2-1b5
Warm

graf/qwen2.5-1.5b-instruct-sft-test-gt-lr1e-6

0
·
556
·
Jan 2026
500M32Kqwen2-0b5
Warm

ORDAv1/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-thriving_enormous_jellyfish

0
·
554
·
Oct 2025
1B2Kphi-1b4
Warm

olusegunola/phi-1.5-distill-v2-Ablation_No_L2_Norm-merged

0
·
552
·
Apr 2026
2B32Kqwen25-1b5
Warm

Qwen/Qwen2-Math-1.5B-Instruct

21
·
549
·
Aug 2024
8B32Kllama31-8b
Warm

eekay/Llama-3.1-8B-Instruct-lion-numbers-ft

0
·
549
·
Feb 2026
3B32Kllama32-3b
Warm

rajeev24/llama_pdf_2

0
·
547
·
Jan 2025
1B2Ktinyllama-1b1
Warm

h4rz3rk4s3/TinyParlaMintLlama-1.1B

0
·
547
·
Feb 2024