Name: grisun0/Qwen2.5-0.5B-Instruct-heretic API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: grisun0

Model Overview

The grisun0/Qwen2.5-0.5B-Instruct-heretic is a 0.5 billion parameter instruction-tuned causal language model, derived from the Qwen2.5 architecture. Its primary distinction is being a "decensored" version of the original Qwen/Qwen2.5-0.5B-Instruct, achieved through the Heretic v1.1.0 tool. This modification significantly reduces the model's refusal rate, from 93/100 in the original to 16/100 in this version, as measured by KL divergence.

Key Capabilities

Reduced Refusals: Engineered to provide less restrictive content generation compared to its base model.
Enhanced Knowledge & Reasoning: Benefits from the Qwen2.5 improvements in coding and mathematics, leveraging specialized expert models.
Improved Instruction Following: Demonstrates better adherence to instructions and understanding of diverse system prompts.
Structured Data & Output: Excels at understanding structured data like tables and generating structured outputs, particularly JSON.
Long Context Support: Supports a full context length of 32,768 tokens and can generate up to 8,192 tokens.
Multilingual Support: Capable of processing and generating text in over 29 languages, including major global languages.

Use Cases

This model is particularly well-suited for applications where a less constrained response generation is desired, while still benefiting from the robust capabilities of the Qwen2.5 architecture in areas like coding, mathematics, and structured data processing. Its instruction-following and long-context abilities make it versatile for various conversational and text generation tasks.

Overview

Model Overview

Key Capabilities

Use Cases

Full Model Card (README)