CYFRAGOVPL/Llama-PLLuM-8B-base-2512
CYFRAGOVPL/Llama-PLLuM-8B-base-2512 is an 8 billion parameter base model from the PLLuM family, developed by the PLLuM consortium and HIVE AI, specialized in Polish with additional English data. It is built upon the Llama-3.1-8B architecture and is designed to generate contextually coherent text and serve as a foundation for specialized applications, particularly excelling in tasks relevant to Polish public administration and broader Polish-language tasks.
Loading preview...
PLLuM: Polish Large Language Models
CYFRAGOVPL/Llama-PLLuM-8B-base-2512 is an 8 billion parameter base model within the PLLuM family, developed by the PLLuM consortium and HIVE AI. This model is built on the Llama-3.1-8B architecture and is specifically optimized for the Polish language, incorporating additional English data for enhanced generalization. The PLLuM models are distinguished by their extensive, high-quality Polish and English text data collection, rigorous cleaning, and deduplication processes.
Key Capabilities
- Specialized Polish Data: Trained on large-scale, high-quality Polish corpora, ensuring strong performance in Polish-language tasks.
- Organic Instruction Dataset: Utilizes a unique, manually curated Polish instruction set for supervised fine-tuning, designed to mitigate negative linguistic transfer.
- Polish Preference Corpus: Benefits from the first Polish-language preference corpus, manually assessed for correctness, balance, and safety, especially for sensitive topics.
- Public Administration Focus: Achieves top scores on custom benchmarks relevant to Polish public administration, making it highly suitable for government services and bureaucratic topics.
- RAG Optimization: Post-trained to perform effectively in Retrieval-Augmented Generation (RAG) settings, with a specific prompt format provided for document-based question answering.
Good for
- General Language Tasks: Text generation, summarization, extraction, and question answering in Polish.
- Domain-Specific Assistants: Particularly effective for applications within Polish public administration, legal, or bureaucratic sectors.
- Research & Development: Serving as a robust foundation for AI applications requiring strong Polish language capabilities.