akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpeculativeReasoner
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Apr 16, 2025Architecture:Transformer Warm

akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpeculativeReasoner is a 1.5 billion parameter language model fine-tuned by akhauriyash. It is based on the DeepSeek-R1-Distill-Qwen-1.5B architecture and specializes in speculative reasoning, particularly for mathematical tasks. The model leverages a 131072 token context length, making it suitable for complex problem-solving requiring extensive context.

Loading preview...