Darkester/ru_gemma

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:2.5BQuant:BF16Ctx Length:8kPublished:Mar 21, 2026License:mitArchitecture:Transformer Open Weights Warm

Darkester/ru_gemma is a 2.5 billion parameter language model, part of the Gemma family, developed by Darkester. This model is specifically designed and fine-tuned for Russian language tasks, offering an 8192-token context window. It focuses on providing robust performance for applications requiring strong Russian language understanding and generation capabilities.

Loading preview...

Darkester/ru_gemma: A Russian-Optimized Gemma Model

Darkester/ru_gemma is a specialized language model built upon the Gemma architecture, featuring 2.5 billion parameters and an 8192-token context window. Its primary distinction lies in its optimization for the Russian language, making it a suitable choice for applications where high-quality Russian text processing is crucial.

Key Capabilities

  • Russian Language Proficiency: Designed to understand and generate text effectively in Russian.
  • Gemma Architecture: Leverages the foundational strengths of the Gemma model family.
  • Moderate Parameter Count: At 2.5 billion parameters, it offers a balance between performance and computational efficiency for Russian-centric tasks.
  • Extended Context Window: Supports an 8192-token context, allowing for processing longer Russian texts and maintaining coherence over extended conversations or documents.

Good For

  • Russian Text Generation: Creating articles, summaries, or creative content in Russian.
  • Russian Language Understanding: Analyzing and extracting information from Russian documents.
  • Multilingual Applications (Russian focus): Integrating Russian language capabilities into broader systems.
  • Research and Development: Exploring the performance of Gemma-based models specifically tailored for non-English languages like Russian.