lightonai/Qwen3-8B-EN

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Mar 30, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

Qwen3-8B-EN is an 8 billion parameter native English reasoning model developed by lightonai, fine-tuned from Qwen/Qwen3-8B-Base. It is specifically designed to produce its entire reasoning trace and final answer in English, making it optimized for complex English-based reasoning tasks. With a 32,768-token context length, this model excels in benchmarks requiring deep analytical thought and problem-solving.

Loading preview...

Qwen3-8B-EN: English Reasoning Model

Qwen3-8B-EN is an 8 billion parameter model developed by lightonai, fine-tuned from Qwen/Qwen3-8B-Base. Its core distinction is its design as a native reasoning model that generates its complete chain-of-thought and final answer exclusively in English. This model was released in conjunction with the paper "Rethinking the Multilingual Reasoning Gap with Layer Swap."

Key Capabilities & Features

  • Native English Reasoning: Produces detailed, step-by-step reasoning traces entirely in English before providing the final English answer.
  • Optimized for Reasoning Tasks: Specifically trained to enhance performance on complex analytical and problem-solving benchmarks.
  • Extensive Context Window: Supports a context length of 32,768 tokens, allowing for processing of lengthy inputs and complex problems.
  • Robust Training: Underwent full SFT (Supervised Fine-Tuning) on approximately 10 billion tokens over 2 epochs, utilizing the English split of the lightonai/Dolci-Think-SFT-32B-Multilingual dataset.

Performance Highlights

Evaluated on English versions of various benchmarks, Qwen3-8B-EN demonstrates strong performance:

  • MGSM-Rev2: 98.96%
  • Global-MMLU-Lite: 81.72%
  • GPQA-Diamond: 55.66%
  • AIME 24/25: 62.89%
  • HumanEvalPlus: 85.75%

These scores reflect mean accuracy, with an average of 77.00% across the listed benchmarks, showcasing its proficiency in mathematical reasoning, general knowledge, and code generation within an English context.

Ideal Use Cases

  • Applications requiring explicit, step-by-step English reasoning.
  • Tasks involving complex problem-solving, mathematical calculations, or code generation where a clear chain of thought is beneficial.
  • Scenarios demanding high accuracy on English-centric reasoning benchmarks.