Name: NetherQuartz/tatoeba-tok-multi-gemma-2-2b-merged API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: NetherQuartz

Model Overview

NetherQuartz/tatoeba-tok-multi-gemma-2-2b-merged is a specialized language model built upon the robust Google Gemma-2-2B architecture. With 2.6 billion parameters and an 8192-token context length, this model is designed for efficient multilingual processing.

Key Capabilities

Multilingual Proficiency: Fine-tuned to excel in Toki Pona, Russian, English, and Vietnamese.
Custom Tokenization: Utilizes a custom tokenizer tailored for the specific linguistic characteristics of its target languages.
Specialized Dataset: Trained on the NetherQuartz/tatoeba-tokipona dataset, enhancing its understanding and generation capabilities for Toki Pona and other included languages.
Gemma-2-2B Base: Benefits from the foundational strengths and efficiency of the Gemma-2-2B model.

Good For

Toki Pona Applications: Ideal for projects involving the minimalist constructed language Toki Pona, including translation, text generation, or analysis.
Multilingual Text Processing: Suitable for tasks requiring simultaneous understanding or generation across Russian, English, and Vietnamese, particularly when Toki Pona is also a factor.
Research and Development: Useful for researchers exploring multilingual models with unique language combinations and custom tokenization strategies.

Overview

Model Overview

Key Capabilities

Good For

Full Model Card (README)