hadadxyz/OpenSonnet-Lite

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:May 4, 2026License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

OpenSonnet-Lite by hadadxyz is a 4.0 billion parameter causal language model, fine-tuned from Qwen/Qwen3-4B-Thinking-2507, featuring a native context length of 262,144 tokens. It is specifically designed for strong Chain-of-Thought (CoT) reasoning, approaching the performance of commercial models like Claude Sonnet 4.6, and uniquely restores multi-turn reasoning capabilities lost in its base model through a corrected chat template. This compact model is optimized for complex reasoning tasks and efficient operation on limited hardware, making it suitable for everyday use cases requiring coherent, multi-turn dialogue.

Loading preview...

OpenSonnet-Lite: Compact Reasoning for Everyday Use

OpenSonnet-Lite, developed by hadadxyz, is a 4.0 billion parameter causal language model fine-tuned from Qwen/Qwen3-4B-Thinking-2507. It is engineered to deliver robust Chain-of-Thought (CoT) reasoning, aiming for performance comparable to commercial models like Claude Sonnet 4.6, while remaining fully open-weights and accessible. A key differentiator is its restoration of multi-turn reasoning, addressing limitations in the base model's chat template to ensure consistent and coherent dialogue over extended conversations.

Key Capabilities

  • Enhanced Chain-of-Thought Reasoning: Approaches the reasoning performance of Claude Sonnet 4.6, even on limited hardware.
  • Multi-Turn Coherence: Corrected chat template enables consistent reasoning across long, multi-turn dialogues.
  • Large Context Window: Features a native context length of 262,144 tokens, supporting extensive input and output.
  • Optimized for Efficiency: Designed for everyday use, even on hardware with limited resources.

Good For

  • Applications requiring strong, consistent reasoning in conversational AI.
  • Deployments on hardware with moderate resource constraints.
  • Complex tasks benefiting from Chain-of-Thought prompting and long context.
  • Developers seeking an open-weights model with advanced reasoning capabilities for research and general-purpose use.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p