aisingapore/Llama-SEA-LION-v2-8B

Warm
Public
8B
FP8
8192
Jul 30, 2024
License: llama3
Hugging Face
Overview

Overview

Llama-SEA-LION-v2-8B is an 8 billion parameter multilingual language model developed by AI Singapore, specifically designed for the Southeast Asia (SEA) region. It is built upon the Meta-Llama-3-8B-Instruct architecture and has undergone extensive continued pre-training to enhance its understanding and generation capabilities across multiple SEA languages. The model's name, SEA-LION, stands for "Southeast Asian Languages In One Network," reflecting its core purpose.

Key Capabilities

  • Multilingual Proficiency: Continued pre-training on approximately 48 billion tokens across five key Southeast Asian languages: English, Indonesian, Tamil, Thai, and Vietnamese.
  • Llama 3 Architecture: Leverages the robust Llama 3 decoder model for strong general language capabilities.
  • Evaluated on BHASA Benchmark: Performance assessed using the BHASA evaluation benchmark for SEA languages, covering tasks like Question Answering, Sentiment Analysis, Toxicity Detection, Translation, Summarization, Causal Reasoning, and Natural Language Inference.

Good For

  • Applications requiring strong language understanding and generation in English, Indonesian, Tamil, Thai, and Vietnamese.
  • Developing AI solutions tailored for the Southeast Asian market.
  • Research and development focusing on multilingual LLMs with a specific regional focus.