GoToCompany/Llama-Sahabat-AI-v2-70B-IT
GoToCompany/Llama-Sahabat-AI-v2-70B-IT is a 70 billion parameter decoder-only Large Language Model, co-initiated by PT GoTo Gojek Tokopedia Tbk and Indosat Ooredoo Hutchison, and developed by GoToCompany and AI Singapore. Based on the Llama 3.1 architecture, it features a 128k context length and is specifically instruct-tuned for Indonesian languages, including Javanese, Sundanese, Batak Toba, and Balinese, alongside English. This model excels in local Indonesian context understanding and instruction-following, making it suitable for applications requiring deep linguistic and cultural relevance in Southeast Asia.
Loading preview...
Llama-Sahabat-AI-v2-70B-IT: An Indonesian-Focused LLM
Llama-Sahabat-AI-v2-70B-IT is a 70 billion parameter instruction-tuned decoder model, developed by PT GoTo Gojek Tokopedia Tbk and AI Singapore, with co-initiation from Indosat Ooredoo Hutchison. It is built upon the Llama 3.1 architecture and utilizes its default tokenizer, featuring a substantial 128k context length.
Key Capabilities
- Multilingual Support: Supports English, Indonesian, Javanese, Sundanese, Batak Toba, and Balinese.
- Indonesian Context Understanding: Evaluated on the IndoMMLU benchmark for local Indonesian humanities, language, culture, social science, and STEM tasks.
- General Language Proficiency: Assessed using the SEA-HELM benchmark for tasks like QA, Sentiment Analysis, Translation, and Summarization.
- Instruction Following: Performance on instruction adherence and multi-turn conversations is evaluated with localized SEA-IFEval and SEA-MTBench datasets, using
gpt-4-1106-previewas a judge.
Usage Considerations
Running this model requires significant resources, with a minimum of 140 GB of VRAM for FP16 or BF16 precision. Recommended setups include 4x NVIDIA L40s or 2x NVIDIA H100 GPUs. The model, like many LLMs, may hallucinate or generate irrelevant content, and users should validate its responses. Further details on performance are available on the Sahabat-AI leaderboard.
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.