CYFRAGOVPL/Llama-PLLuM-8B-chat-2512
The CYFRAGOVPL/Llama-PLLuM-8B-chat-2512 is an 8 billion parameter chat-tuned large language model developed by the PLLuM consortium and HIVE AI, specialized in Polish with additional English data. Built upon the Llama-3.1-8B architecture, it is refined through extensive instruction tuning and preference learning using high-quality, manually curated Polish datasets. This model excels at generating contextually coherent text and assisting in various tasks, particularly for Polish public administration and general Polish-language applications.
Loading preview...
PLLuM: A Family of Polish Large Language Models
CYFRAGOVPL/Llama-PLLuM-8B-chat-2512 is an 8 billion parameter model from the PLLuM family, developed by the PLLuM consortium and HIVE AI. It is built on the Llama-3.1-8B architecture and is specifically aligned for chat and general-purpose scenarios. The model's development emphasizes high-quality Polish data, including an extensive collection of manually created "organic instructions" and the first Polish-language preference corpus, which teaches the model correctness, balance, and safety.
Key Capabilities
- Polish Language Specialization: Optimized for generating contextually coherent text in Polish, with additional English data for broader generalization.
- Advanced Alignment: Refined through instruction tuning, preference learning, and advanced alignment techniques using unique Polish datasets.
- Robust Evaluation: Achieves top scores on custom benchmarks relevant to Polish public administration and state-of-the-art results in broader Polish-language tasks.
- RAG-Optimized: Trained to perform well in Retrieval-Augmented Generation (RAG) settings, capable of answering questions based on provided documents with citations.
Good For
- General Language Tasks: Text generation, summarization, extraction, and question answering in Polish.
- Domain-Specific Assistants: Particularly effective for applications within Polish public administration, legal, or bureaucratic contexts requiring domain-aware retrieval.
- Research & Development: Serving as a foundational model for AI applications where strong command of the Polish language is essential.