sartifyllc/Pawa-Gemma-Swahili-2B
TEXT GENERATIONConcurrency Cost:1Model Size:2.6BQuant:BF16Ctx Length:8kPublished:Jan 13, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

Pawa-Gemma-Swahili-2B by sartifyllc is a 2.6 billion parameter language model built on the Gemma-2 base architecture, specifically fine-tuned for Swahili and English. It features a custom tokenizer and leverages supervised fine-tuning (SFT) and Direct Preference Optimization (DPO) on Swahili datasets. This model excels in contextually rich Swahili-focused tasks, general assistance, and chat-based interactions, making it suitable for applications requiring nuanced understanding in both languages.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p