Name: Greytechai/LFM2.5-1.2B-Thinking-Kimi-V2-Heretic-Uncensored-DISTILL API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Greytechai

Model Overview

Greytechai/LFM2.5-1.2B-Thinking-Kimi-V2-Heretic-Uncensored-DISTILL is a 1.2 billion parameter language model built on the LFM2.5 architecture. It has undergone a specialized fine-tuning process using Unsloth and distill reasoning datasets, resulting in a complete overhaul of its thinking and reasoning capabilities. The model is designed to provide compact yet highly detailed reasoning, directly addressing prompts without excessive verbosity.

Key Capabilities & Features

Enhanced Reasoning: The model's core reasoning mechanism has been entirely replaced and optimized for deep, detailed thought processes.
Uncensored Output: As a 'Heretic' model, it was de-censored before tuning, ensuring it does not refuse requests and generates content as directed, including potentially sensitive or explicit material.
Stable Reasoning: Its reasoning capabilities are noted to be stable across a temperature range of 0.1 to 2.5.
Extended Context: Supports a substantial context length of 32768 tokens.

Optimal Usage & Settings

For best performance, the model recommends using q5, q6, q8, or 16-bit precision, or Imatrix IQ3_M quantization. A repetition penalty of 1.05 to 1.1 is suggested. Users experiencing looping during thinking should lower the temperature to 0.3-0.7. For chat and roleplay, setting a 'Smoothing_factor' (or 'Smoothing') to 1.5 in interfaces like KoboldCpp, oobabooga, or Silly Tavern is highly recommended to achieve smoother operation.

Overview

Model Overview

Key Capabilities & Features

Optimal Usage & Settings

Full Model Card (README)