CYFRAGOVPL/Llama-PLLuM-8B-base-2412
CYFRAGOVPL/Llama-PLLuM-8B-base-2412 is an 8 billion parameter base model from the PLLuM family, developed by a consortium led by Politechnika Wrocławska. Based on Llama 3.1, this model is specialized in Polish and other Slavic/Baltic languages, with additional English data for generalization, and features a 32768 token context length. It is pretrained on extensive high-quality Polish text corpora and is designed to generate contextually coherent text and serve as a foundation for specialized applications, particularly excelling in Polish-language tasks and public administration. This model is intended for general language tasks and research and development in Polish-centric AI applications.
Loading preview...
PLLuM: Polish Large Language Models
CYFRAGOVPL/Llama-PLLuM-8B-base-2412 is an 8 billion parameter base model from the PLLuM family, developed by a consortium of Polish institutions led by Politechnika Wrocławska. This model is built upon Llama 3.1 and is specifically designed for Polish and other Slavic/Baltic languages, incorporating English data for broader generalization. It features a 32768 token context length.
Key Capabilities
- Specialized Language Focus: Optimized for Polish, Slavic, and Baltic languages, pretrained on up to 150 billion tokens of Polish text.
- High-Quality Training Data: Utilizes an extensive collection of Polish text data, including a unique dataset of ~40k manually created "organic instructions" and the first Polish-language preference corpus for alignment.
- Strong Performance: Achieves state-of-the-art results in broader Polish-language tasks and top scores in custom benchmarks relevant to Polish public administration.
- Foundation Model: Serves as a robust base for developing specialized applications, including domain-specific intelligent assistants.
Good for
- General Language Tasks: Text generation, summarization, and question answering in Polish.
- Domain-Specific Assistants: Particularly effective for applications in Polish public administration, legal, and bureaucratic contexts.
- Research & Development: Ideal for academic and industrial projects requiring strong command of the Polish language.