Models

8,419
500M32Kqwen2-0b5
Warm

anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_448

0
·
5
500M32Kqwen2-0b5
Warm

anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_160

0
·
5
500M32Kqwen2-0b5
Warm

anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_510

0
·
5
500M32Kqwen2-0b5
Warm

anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_370

0
·
5
500M32Kqwen2-0b5
Warm

ahmedheakl/ex21_qwen2.5_0.5b_20k_16kcw_3ep_cuda_amd

0
·
5
500M32Kqwen2-0b5
Warm

anmolagarwal999/Qwen2_5-0_5B-Instructsft_savedmath_dataset_based_on_deepseek_distilled_traces_epoch_320

0
·
5
2B32Kqwen25-1b5
Warm

ma921/qwen-2.5-sft-golden-hh

0
·
5
3B8Kgemma2-2b
Warm

somosnlp/kuntur-peru-legal-es-gemma-2b-it-merged

1
·
5
2B32Kqwen15-1b8
Warm

DuongTrongChi/Sailor-1.8b-chat-sft-v1

0
·
5
3B8Kgemma2-2b
Warm

Iker/Neurona-2b

1
·
5
1B32Kllama32-1b
Warm

ThinkAgents/ThinkAgent-1B

1
·
5
1B32Kllama32-1b
Warm

jessemeng/TwinLlama-3.2-1B

0
·
5
1B32Kllama32-1b
Warm

DoeyLLM/OneLLM-Doey-V1-Llama-3.2-1B-it

0
·
5
1B32Kllama32-1b
Warm

Montecarlo2024/Llama3.2_1b-Instruct_Function-v0.1

0
·
5
1B32Kllama32-1b
Warm

jessemeng/TwinLlama-3.1-8B

0
·
5
1B32Kllama32-1b
Warm

PathFinderKR/Llama-3-1B-Medical-Instruct

0
·
5
1B32Kllama32-1b
Warm

joelewing/Llama-3.2-1B-Instruct-Capybara

0
·
5
1B32Kllama32-1b
Warm

abcorrea/llama-3.2-1b-tinystories-ft-25k

0
·
5
1B32Kllama32-1b
Warm

Trelis/Llama-3.2-1B-Instruct-MATH-synthetic

0
·
5
1B32Kllama32-1b
Warm

petkopetkov/Llama3.2-1B-Instruct-bg

0
·
5