CYFRAGOVPL/PLLuM-4B-base-2512

VISIONConcurrency Cost:1Model Size:4.3BQuant:BF16Ctx Length:32kPublished:Jan 15, 2026License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

CYFRAGOVPL/PLLuM-4B-base-2512 is a 4 billion parameter base model from the PLLuM family, specialized in Polish language processing and built upon the Gemma-3-4b-pt architecture. Developed by the PLLuM consortium and continued by HIVE AI, it incorporates extensive high-quality Polish and English data. This model is designed to generate contextually coherent text and serve as a foundation for specialized applications, particularly excelling in tasks relevant to Polish public administration.

Loading preview...

PLLuM: A Family of Polish Large Language Models

PLLuM models are a series of large language models (LLMs) developed by the PLLuM consortium and later by HIVE AI, with a strong focus on the Polish language, augmented by English data for broader generalization. The PLLuM-4B-base-2512 is a 4 billion parameter model based on gemma-3-4b-pt.

Key Capabilities

  • Extensive Polish Data Collection: Trained on large-scale, high-quality Polish and English text corpora, rigorously cleaned and deduplicated.
  • Organic Instruction Dataset: Features the largest Polish collection of manually created "organic instructions" for supervised fine-tuning, designed to mitigate negative linguistic transfer.
  • Polish Preference Corpus: Includes the first Polish-language preference corpus, manually assessed for correctness, balance, and safety, especially for controversial topics.
  • Specialized Evaluation: Achieves top scores on custom benchmarks relevant to Polish public administration and state-of-the-art results in broader Polish-language tasks.
  • RAG Optimization: Post-trained to perform well in Retrieval-Augmented Generation (RAG) settings, with a specific prompt format for document-based question answering.

Good For

  • General Language Tasks: Ideal for text generation, summarization, extraction, and question answering in Polish.
  • Domain-Specific Assistants: Particularly effective for applications within Polish public administration, legal, or bureaucratic contexts requiring domain-aware retrieval.
  • Research & Development: Serves as a robust building block for AI applications where a strong command of the Polish language is essential.