Name: CYFRAGOVPL/Llama-PLLuM-70B-chat-2508 API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: CYFRAGOVPL

PLLuM: A Family of Polish Large Language Models

CYFRAGOVPL/Llama-PLLuM-70B-chat-2508 is a 70 billion parameter model from the PLLuM family, developed by the HIVE AI Consortium (initially PLLuM consortium). This model is built upon the Llama 3.1 architecture and is specifically designed for Polish and other Slavic/Baltic languages, incorporating additional English data for broader generalization. It has been extensively refined through instruction tuning, preference learning, and advanced alignment techniques.

Key Capabilities

Polish Language Specialization: Trained on up to 150 billion tokens of high-quality Polish data, making it highly proficient in the language.
Organic Instruction Dataset: Utilizes a unique, manually curated dataset of ~55k Polish prompt-response pairs, including multi-turn dialogues, to enhance instruction following and mitigate negative linguistic transfer.
Polish Preference Corpus: Features the first Polish-language preference corpus, enabling the model to learn correctness, balance, and safety, especially for sensitive topics.
State-of-the-Art Performance: Achieves top scores in custom benchmarks relevant to Polish public administration and state-of-the-art results in broader Polish-language tasks.
Retrieval Augmented Generation (RAG): Specifically trained to perform well in RAG settings, providing context-aware answers with document citations.

Good For

General Language Tasks: Text generation, summarization, and question answering in Polish.
Domain-Specific Assistants: Particularly effective for applications in Polish public administration, legal, and bureaucratic contexts.
Research & Development: Serving as a robust foundation for AI applications requiring strong command of the Polish language.

Overview

PLLuM: A Family of Polish Large Language Models

Key Capabilities

Good For

Full Model Card (README)