Model Overview
Qwen-2.5-14B-Hindi is a 14.8 billion parameter instruction-tuned large language model developed by Traversaal.ai and 1-800-LLMs. It is specifically optimized for bilingual performance in Hindi and English, showing notable improvements over the original Qwen-2.5-14B-Instruct model.
Key Capabilities & Performance
- Enhanced Hindi Performance: Achieves approximately 3.5% better average performance on Hindi tasks compared to the original model.
- Improved English Benchmarks: Demonstrates around 7.6% better performance on challenging English benchmarks like MMLU-Pro, MATH-Hard, and GPQA.
- Reduced Bias: Features less bias towards the ordering of choices in multiple-choice questions, a result of modified training techniques.
- Flexible Prompt Formats: Supports various prompt formats for tasks such as NLI, MCQ, summarization, coding, and translation.
Intended Use Cases
- Bilingual Applications: Ideal for developing AI-powered chat assistants, customer support services, and educational tools catering to Hindi and English speakers.
- Research & Development: Serves as a valuable tool for researchers and developers in NLP, particularly for multilingual advancements.
- Commercial Fine-tuning: Can be utilized as a foundational model for fine-tuning to meet specific industry needs in bilingual contexts.
Limitations
Users should be aware that the model is limited to Hindi and English, may generate incorrect or offensive information, and should not be used for high-risk decision-making without human oversight.