Gemma-SEA-LION-v3-9B-IT: Southeast Asian Language Model

Gemma-SEA-LION-v3-9B-IT is a 9 billion parameter instruction-tuned model developed by AI Singapore, building upon the Gemma2 architecture. It is part of the SEA-LION (Southeast Asian Languages In One Network) collection, specifically designed and optimized for the Southeast Asian region.

Key Capabilities & Features

Multilingual Support: Supports 13 languages: Burmese, Chinese, English, Filipino, Indonesian, Javanese, Khmer, Lao, Malay, Sundanese, Tamil, Thai, and Vietnamese.
Instruction-Tuned: Fine-tuned for instruction-following in both English and various ASEAN languages.
Gemma2 Architecture: Utilizes the Gemma2 decoder model for its base architecture.
Context Length: Features a context length of 8192 tokens.
Evaluated Performance: Benchmarked using the SEA-HELM evaluation framework for general language capabilities (QA, Sentiment, Translation, Summarization, etc.) and instruction-following capabilities with localized IFEval and MT-Bench datasets.

Intended Use Cases

This model is suitable for applications requiring strong instruction-following and language generation across a diverse set of Southeast Asian languages. It is particularly useful for tasks like question answering, sentiment analysis, translation, and conversational AI within the SEA context. Developers should note that the model has not been aligned for safety and requires custom safety fine-tuning for production deployment.

Overview

Gemma-SEA-LION-v3-9B-IT: Southeast Asian Language Model

Key Capabilities & Features

Intended Use Cases

Full Model Card (README)