sartifyllc/Pawa-Gemma-Swahili-2B
TEXT GENERATIONConcurrency Cost:1Model Size:2.6BQuant:BF16Ctx Length:8kPublished:Jan 13, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm
Pawa-Gemma-Swahili-2B by sartifyllc is a 2.6 billion parameter language model built on the Gemma-2 base architecture, specifically fine-tuned for Swahili and English. It features a custom tokenizer and leverages supervised fine-tuning (SFT) and Direct Preference Optimization (DPO) on Swahili datasets. This model excels in contextually rich Swahili-focused tasks, general assistance, and chat-based interactions, making it suitable for applications requiring nuanced understanding in both languages.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–