Name: tokyotech-llm/Llama-3.1-Swallow-70B-Instruct-v0.3 API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: tokyotech-llm

Llama 3.1 Swallow 70B Instruct v0.3: Enhanced Japanese Conversational AI

Llama-3.1-Swallow-70B-Instruct-v0.3 is a 70 billion parameter instruction-tuned model from tokyotech-llm, based on Meta's Llama 3.1 architecture. This model significantly enhances Japanese language capabilities through extensive continual pre-training on a diverse corpus of approximately 200 billion Japanese and English tokens, including the Swallow Corpus Version 2, Wikipedia articles, and mathematical/coding content. It maintains the strong English language performance of its Llama 3.1 foundation.

Key Capabilities & Differentiators

Bilingual Proficiency: Optimized for both Japanese and English, with a particular focus on improving Japanese understanding and generation.
Instruction-Tuned for Dialogue: Fine-tuned using synthetic Japanese datasets to generate helpful and detailed responses in multi-turn conversations.
Improved Conversational Performance: Outperforms its predecessor, Llama-3.1-Swallow-70B-Instruct-v0.1, by 5.68 points on the Japanese MT-Bench benchmark, indicating enhanced dialogue capabilities.
Comprehensive Evaluation: Evaluated across a wide range of Japanese and English benchmarks, including MT-Bench JA, JCommonsenseQA, JHumanEval, MMLU, and HumanEval.

When to Use This Model

This model is particularly well-suited for applications requiring robust bilingual (Japanese and English) conversational AI. Its instruction-tuned nature makes it effective for generating detailed responses to user queries and engaging in multi-turn dialogues, especially in Japanese-centric use cases.