IlyaGusev/saiga_gemma3_12b
Hugging Face
VISIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:Apr 20, 2025License:gemmaArchitecture:Transformer0.0K Warm

IlyaGusev/saiga_gemma3_12b is a 12 billion parameter language model developed by IlyaGusev, fine-tuned from mlabonne/gemma-3-12b-it-abliterated. This model is specifically optimized for Russian language interactions, serving as an automatic assistant. It features a 32768 token context length and is designed for conversational AI applications in Russian.

Loading preview...

Overview

IlyaGusev/saiga_gemma3_12b is a 12 billion parameter language model, fine-tuned by IlyaGusev from the mlabonne/gemma-3-12b-it-abliterated base model. It is specifically designed for Russian language processing and functions as an automatic assistant. The model utilizes a custom prompt format based on Gemma3, incorporating system messages for structured conversations.

Key Capabilities

  • Russian Language Proficiency: Optimized for generating and understanding Russian text.
  • Conversational AI: Fine-tuned to act as an interactive assistant, capable of engaging in dialogues.
  • Extended Context Window: Supports a context length of 32768 tokens, allowing for more extensive conversations and complex queries.
  • Performance: Evaluation results show competitive performance on Russian benchmarks like PingPong and RuArenaHard, with comparisons against models such as gpt-4o.

Good For

  • Developing Russian-speaking chatbots and virtual assistants.
  • Applications requiring detailed Russian text generation and comprehension.
  • Research and development in Russian natural language processing, particularly for conversational tasks.
Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p