Name: p-e-w/phi-4-heretic API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: p-e-w

Model Overview

p-e-w/phi-4-heretic is a 14.7 billion parameter decoder-only Transformer model, derived from Microsoft's phi-4, with a 32768 token context length. This version has been specifically modified using the Heretic tool to be a "decensored" variant, aiming to reduce refusal rates compared to the original phi-4 model.

Key Differentiators

Decensored Version: Modified from the original microsoft/phi-4 to exhibit significantly fewer refusals, with a reported 41/100 refusals compared to the original's 100/100.
Enhanced Reasoning: The base phi-4 model was trained on a blend of synthetic datasets, filtered public domain websites, and academic books, focusing on high-quality data for advanced reasoning.
Optimized for Efficiency: Designed for use in memory/compute constrained environments and latency-bound scenarios.

Performance Insights

While the base phi-4 model shows strong performance across various benchmarks, including MMLU (84.8), GPQA (56.1), and HumanEval (82.6), this 'heretic' version specifically targets a reduction in content moderation and refusal behaviors. The KL divergence of 0.09 indicates a slight shift from the original model's distribution, reflecting its altered behavior.

Intended Use Cases

Accelerating research on language models.
Building generative AI features, especially where reduced content filtering is desired.
Applications requiring strong reasoning and logic in resource-limited settings.

Overview

Model Overview

Key Differentiators

Performance Insights

Intended Use Cases

Full Model Card (README)