enstazao/Qalb-1.0-8B-Instruct

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Jan 14, 2026License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Qalb-1.0-8B-Instruct is an 8 billion parameter Urdu language model developed by enstazao, built upon the Llama-3.1-8B architecture. It has undergone continued pre-training on a 1.97 billion token Urdu corpus and supervised fine-tuning for instruction following. This model excels in deep Urdu understanding, reasoning, and culturally accurate responses, outperforming previous state-of-the-art models in Urdu language tasks while retaining strong English capabilities.

Loading preview...

Qalb-1.0-8B-Instruct: State-of-the-Art Urdu LLM

Qalb-1.0-8B-Instruct is an 8 billion parameter Urdu language model, developed by enstazao, based on the Llama-3.1-8B architecture. It was specifically adapted for Urdu through a two-stage process: continued pre-training on a massive 1.97 billion token Urdu corpus, followed by supervised fine-tuning for instruction following. This model aims to address the gap in low-resource language processing for Urdu, providing fluent, culturally accurate, and context-aware responses that general multilingual models often struggle with.

Key Capabilities

  • Deep Urdu Understanding: Trained on diverse Urdu content including news, literature, government documents, and social media.
  • Superior Performance: Achieves an overall score of 90.34, outperforming previous state-of-the-art models like Alif-1.0 and LLaMA-3.1 Base in 6 out of 7 benchmark categories for Urdu tasks.
  • Reasoning Capable: Demonstrates excellent performance in logical reasoning, mathematical word problems, and commonsense tasks specifically in Urdu.
  • Bilingual Proficiency: Maintains strong English language capabilities, making it suitable for translation and code-switching applications.
  • Ethical & Safe: Fine-tuned to generate helpful, harmless, and honest content, refusing toxic or misleading outputs.

Ideal Use Cases

  • Urdu-centric Applications: Developing chatbots, virtual assistants, or content generation tools specifically for the Urdu language.
  • Cross-lingual Tasks: Scenarios requiring translation or code-switching between Urdu and English.
  • Research & Development: As a robust baseline or component for further research in low-resource language processing and Urdu NLP.