name54/Ru-Gemma3-1B
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kPublished:Mar 22, 2026License:gemmaArchitecture:Transformer Warm

Ru-Gemma3-1B is an experimental 1 billion parameter Gemma 3 Instruct model, fine-tuned by name54 on the Russian Saiga-scored dataset. This model is adapted for Russian language conversational tasks in an "Assistant/User" format, aiming to improve interaction quality in Russian. With a 32768 token context length, it focuses on enhancing dialogue capabilities despite its small size and experimental training of only one epoch.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p