name54/Ru-Gemma3-1B
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kPublished:Mar 22, 2026License:gemmaArchitecture:Transformer Warm
Ru-Gemma3-1B is an experimental 1 billion parameter Gemma 3 Instruct model, fine-tuned by name54 on the Russian Saiga-scored dataset. This model is adapted for Russian language conversational tasks in an "Assistant/User" format, aiming to improve interaction quality in Russian. With a 32768 token context length, it focuses on enhancing dialogue capabilities despite its small size and experimental training of only one epoch.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–