Models

448
8B32Kqwen2-7b
Warm

Guilherme34/MiniAGI

1
·
4
8B32Kqwen25-7b
Warm

MInference/qwen25-math-7b-instruct

1
·
3
500M32Kqwen2-0b5
Warm

southfreebird/Qwen2.5-Coder-0.5B-Instruct

0
·
3
500M32Kqwen2-0b5
Warm

goktugkoksal/qwen2-0.5b

0
·
3
500M32Kqwen2-0b5
Warm

anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_90

0
·
3
500M32Kqwen2-0b5
Warm

anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_30

0
·
3
500M32Kqwen2-0b5
Warm

anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_510

0
·
3
2B32Kqwen25-1b5
Warm

danieldk/Qwen2.5-1.5B-Instruct-w8a8-int-dynamic-weight

0
·
3
1B32Kllama32-1b
Warm

AIR-hl/Llama-3.2-1B-ultrachat200k

0
·
3
1B32Kllama32-1b
Warm

AIR-hl/Llama-3.2-1B-DPO

0
·
3
4B32Kqwen3-4b
Warm

sequelbox/Qwen3-4B-Thinking-2507-UML-Generator

4
·
3
500M32Kqwen2-0b5
Warm

anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_140

0
·
2
500M32Kqwen2-0b5
Warm

Qybera/Qybera2.6-0.5-instruct

1
·
2
500M32Kqwen2-0b5
Warm

anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_320

0
·
2
500M32Kqwen2-0b5
Warm

anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_378

0
·
1
500M32Kqwen2-0b5
Warm

anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_80

0
·
1
500M32Kqwen2-0b5
Warm

oieieio/Qwen2.5-0.5B-Instruct

0
·
1
500M32Kqwen2-0b5
Warm

anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_70

0
·
1
500M32Kqwen2-0b5
Warm

viethq5/Qwen2.5-0.5B-Instruct-f16

0
·
1
500M32Kqwen2-0b5
Warm

Silin1590/Qwen-0d5B-Int-AbstraL

0
·
1