Name: blackbook-lm/DeepSeek-R1-Distill-Qwen-7B-heretic API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: blackbook-lm

Model Overview

This model, blackbook-lm/DeepSeek-R1-Distill-Qwen-7B-heretic, is a 7.6 billion parameter language model derived from deepseek-ai/DeepSeek-R1-Distill-Qwen-7B. It has been decensored using the Heretic v1.2.0 tool, significantly reducing content refusals compared to its original counterpart (5/100 vs. 49/100 refusals). The base model is Qwen2.5-Math-7B, and it benefits from reasoning patterns distilled from the larger DeepSeek-R1 model, which was developed using large-scale reinforcement learning (RL) to foster advanced reasoning behaviors like self-verification and reflection.

Key Capabilities

Reduced Refusals: Engineered to provide less restrictive outputs, making it suitable for a wider range of applications.
Reasoning Enhancement: Incorporates distilled reasoning capabilities from the DeepSeek-R1 model, which excels in complex problem-solving.
Extended Context: Supports a context length of 32768 tokens, allowing for processing longer inputs and generating more coherent, extended responses.
Mathematical Proficiency: Built upon a math-focused base model, suggesting strong performance in quantitative tasks.

Good For

Applications requiring less censorship: Ideal for use cases where the original model's refusal rates are prohibitive.
Reasoning-intensive tasks: Benefits from the DeepSeek-R1 distillation, making it suitable for tasks demanding logical thought and problem-solving.
Developers seeking a Qwen-based model with enhanced reasoning: Offers a powerful alternative for those already familiar with the Qwen architecture.

Overview

Model Overview

Key Capabilities

Good For

Full Model Card (README)