CYFRAGOVPL/PLLuM-12B-chat

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:Feb 7, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

CYFRAGOVPL/PLLuM-12B-chat is a 12 billion parameter chat-tuned large language model developed by CYFRAGOVPL, based on Mistral-Nemo-Base-2407. It is specialized in Polish and other Slavic/Baltic languages, with additional English data, and excels at generating contextually coherent text and assisting in various tasks. This model is particularly strong in Polish public administration tasks and general Polish-language applications, having been refined through extensive instruction tuning and preference learning on high-quality Polish datasets.

Loading preview...

PLLuM-12B-chat: Polish-Specialized LLM

CYFRAGOVPL/PLLuM-12B-chat is a 12 billion parameter large language model, part of the PLLuM family, developed by CYFRAGOVPL. It is built upon the Mistral-Nemo-Base-2407 architecture and is specifically optimized for Polish and other Slavic/Baltic languages, while also incorporating English data for broader generalization. The model has undergone extensive pretraining on up to 150 billion tokens of Polish text (for CC-BY-NC-4.0 licensed models, 30 billion for Apache 2.0 licensed models) and refined through instruction fine-tuning using a unique dataset of ~40k manually created "organic instructions" in Polish, alongside synthetic and premium Polish corpora.

Key Capabilities

  • Polish Language Mastery: Achieves state-of-the-art results in various Polish-language tasks, including text generation, summarization, and question answering.
  • Domain-Specific Excellence: Demonstrates top-tier performance in specialized tasks relevant to Polish public administration, making it suitable for bureaucratic and legal topics.
  • Robust Alignment: Features a manually curated Polish-language preference corpus, enhancing safety, balance, and contextual appropriateness, even for sensitive or adversarial prompts.
  • Chat-Optimized: The "-chat" variant is specifically aligned on human preferences for more efficient and safer use in dialogue and general-purpose conversational scenarios.
  • Retrieval Augmented Generation (RAG) Support: Designed to perform well in RAG settings, providing structured responses with document citations when relevant information is available.

Good for

  • Applications requiring high-quality, contextually aware text generation in Polish.
  • Developing intelligent assistants for Polish public administration or legal sectors.
  • Research and development of AI applications where strong command of the Polish language is crucial.
  • General language tasks such as summarization and question answering in Polish.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p