Name: CYFRAGOVPL/PLLuM-12B-instruct-2512 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: CYFRAGOVPL

PLLuM-12B-instruct-2512: Polish Language Model

This model is part of the PLLuM family, a series of large language models (LLMs) developed by the PLLuM consortium and later by HIVE AI, with a strong focus on the Polish language. The PLLuM-12B-instruct-2512 is a 12 billion parameter instruction-tuned model built upon the Mistral-Nemo-Base-2407 architecture.

Key Highlights & Development

Extensive Data Collection: Trained on large-scale, high-quality Polish and English text data, rigorously cleaned and deduplicated.
Organic Instruction Dataset: Features the largest Polish collection of manually created "organic instructions" (approximately 70k), designed to cover subtle aspects of supervised fine-tuning and mitigate negative linguistic transfer.
Polish Preference Corpus: Utilizes the first Polish-language preference corpus (~60k manually annotated pairs) to enhance correctness, balance, and safety, especially for sensitive topics.
State-of-the-Art Performance: Achieves top scores on custom benchmarks relevant to Polish public administration and state-of-the-art results in broader Polish-language tasks.
Alignment: Aligned to human preferences for safer and more efficient use in dialogue and general-purpose scenarios.

Intended Use Cases

General Language Tasks: Ideal for text generation, summarization, extraction, and question answering in Polish.
Domain-Specific Assistants: Particularly effective for applications within Polish public administration, legal, or bureaucratic contexts requiring domain-aware retrieval.
Research & Development: Serves as a robust foundation for AI applications demanding strong command of the Polish language.

Overview

PLLuM-12B-instruct-2512: Polish Language Model

Key Highlights & Development

Intended Use Cases

Full Model Card (README)