cminst/DSR17B-templatefixes
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Mar 22, 2026License:mitArchitecture:Transformer Open Weights Cold

cminst/DSR17B-templatefixes is a 7.6 billion parameter model from DeepSeek-AI, based on the DeepSeek-R1 architecture, featuring chat template fixes. This model is specifically designed to enhance reasoning capabilities, leveraging large-scale reinforcement learning and incorporating cold-start data to improve performance across math, code, and general reasoning tasks. It offers a 32768 token context length and is suitable for applications requiring robust analytical and problem-solving intelligence.

Loading preview...