akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpeculativeReasoner

Warm
Public
1.5B
BF16
32768
1
Apr 16, 2025
Hugging Face

akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpeculativeReasoner is a 1.5 billion parameter language model fine-tuned by akhauriyash. It is based on the DeepSeek-R1-Distill-Qwen-1.5B architecture and specializes in speculative reasoning, particularly for mathematical tasks. The model leverages a 131072 token context length, making it suitable for complex problem-solving requiring extensive context.

No reviews yet. Be the first to review!