Name: CYFRAGOVPL/PLLuM-12B-chat-2412 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: CYFRAGOVPL

PLLuM-12B-chat-2412: Polish-Optimized Chat Model

PLLuM-12B-chat-2412 is a 12 billion parameter chat-tuned large language model from the PLLuM family, developed by CYFRAGOVPL. It is built upon the Mistral-Nemo-Base-2407 architecture and features a 32768 token context length. This model is specifically designed for Polish and other Slavic/Baltic languages, augmented with English data to enhance generalization capabilities. Its development involved extensive data collection, including up to 150 billion tokens of Polish text for non-commercial variants, and a unique dataset of ~40k manually created "organic instructions" in Polish, including multi-turn dialogues.

Key Capabilities

Specialized Polish Language Processing: Optimized for generating contextually coherent text in Polish, with strong performance in Slavic and Baltic languages.
Instruction and Chat Tuning: Refined through instruction tuning and preference learning using a unique Polish preference corpus, ensuring balanced, safe, and contextually appropriate responses.
High-Quality Data Foundation: Trained on large-scale, high-quality text corpora, including a significant portion of Polish data.
Public Administration Expertise: Achieves top scores on custom benchmarks relevant to Polish public administration tasks.
Retrieval Augmented Generation (RAG) Support: Designed to perform well in RAG settings, particularly for question answering based on provided documents.

Good For

General Language Tasks: Text generation, summarization, and question answering in Polish.
Domain-Specific Assistants: Especially effective for applications in Polish public administration, legal, and bureaucratic contexts.
Research & Development: Serving as a foundational model for AI applications requiring strong Polish language command.
Dialog and Conversational AI: Its chat-tuned nature makes it suitable for general-purpose conversational scenarios.

Overview

PLLuM-12B-chat-2412: Polish-Optimized Chat Model

Key Capabilities

Good For

Full Model Card (README)