PLLuM-12B-nc-chat-2412: Polish-Centric LLM for Dialogue

The PLLuM-12B-nc-chat-2412 is a 12 billion parameter model from the PLLuM family, developed by a consortium of Polish scientific institutions led by Politechnika Wrocławska. It is built upon the Mistral-Nemo-Base-2407 architecture and is specifically designed for Polish and other Slavic/Baltic languages, incorporating English data for broader generalization. This model is instruction-tuned and aligned on human preferences, making it suitable for dialogue and general-purpose conversational AI.

Key Capabilities

Specialized Multilingualism: Optimized for Polish, Slavic, and Baltic languages, with strong performance in Polish-specific tasks.
Extensive Training Data: Pretrained on approximately 150 billion tokens of Polish text, along with additional Slavic, Baltic, and English data.
Organic Instruction Tuning: Refined using a unique, manually curated dataset of ~40k Polish "organic instructions" and ~3.5k multi-turn dialogues, designed to mitigate negative linguistic transfer.
Preference Learning: Features the first Polish-language preference corpus, teaching the model correctness, balance, and safety, particularly for controversial topics.
Strong Performance: Achieves top scores on custom benchmarks relevant to Polish public administration and state-of-the-art results in broader Polish-language tasks.
Dialogue Optimization: The "-chat" suffix indicates alignment on human preferences for safer and more efficient use in conversational scenarios.

Good for

General Language Tasks: Text generation, summarization, and question answering in Polish.
Domain-Specific Assistants: Particularly effective for applications in Polish public administration, legal, and bureaucratic contexts.
Research & Development: Serving as a foundational model for AI applications requiring strong Polish language capabilities.

This model is released under the CC-BY-NC-4.0 license, indicating non-commercial use.

Overview

PLLuM-12B-nc-chat-2412: Polish-Centric LLM for Dialogue

Key Capabilities

Good for

Full Model Card (README)