muhammadnoman76/Lughaat-1.0-8B-Instruct

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Mar 22, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

Lughaat-1.0-8B-Instruct is an 8 billion parameter Urdu language model developed by Muhammad Noman, built on the Llama 3.1 architecture. It is specifically trained on the extensive muhammadnoman76/lughaat-urdu-dataset-llm, making it highly optimized for Urdu language processing tasks. This model demonstrates superior performance in Urdu generation, translation, ethics, and reasoning compared to other similar-sized models like Qwen-2.5-7b, Mistral 7B, and Alif 8B.

Loading preview...

Overview

Lughaat-1.0-8B-Instruct is an Urdu language model developed by Muhammad Noman, based on the Llama 3.1 8B architecture. Its primary differentiator is its training on muhammadnoman76/lughaat-urdu-dataset-llm, the largest Urdu dataset compiled by Muhammad Noman, which enables it to significantly outperform competitors in Urdu language tasks.

Key Capabilities

  • Superior Urdu Performance: Achieves an average score of 91.4% across various Urdu language tasks, outperforming models like Qwen-2.5-7b, Mistral 7B, and Alif 8B.
  • High Accuracy in Translation: Scores 94.2% in translation tasks, demonstrating strong cross-lingual understanding.
  • Robust Generation and Reasoning: Excels in Urdu text generation (89.5%) and reasoning (88.3%), indicating a deep understanding of the language.
  • Ethical Handling: Shows strong performance in ethics-related tasks (89.7%).

Ideal Use Cases

  • Urdu Question Answering: Provides accurate and relevant answers to questions posed in Urdu.
  • Urdu Text Generation: Capable of creating coherent and contextually appropriate Urdu content.
  • Summarization and Translation: Efficiently summarizes Urdu texts and performs high-quality Urdu translations.
  • Conversational AI: Suitable for developing chatbots and virtual assistants that interact in Urdu.
  • Content Creation: Assists in generating various forms of content specifically for the Urdu language.