tokyotech-llm/Llama-3.1-Swallow-70B-Instruct-v0.3
TEXT GENERATIONConcurrency Cost:4Model Size:70BQuant:FP8Ctx Length:32kPublished:Dec 25, 2024License:llama3.1Architecture:Transformer0.0K Warm
Llama-3.1-Swallow-70B-Instruct-v0.3 is a 70 billion parameter instruction-tuned large language model developed by tokyotech-llm, built upon Meta Llama 3.1. It enhances Japanese language capabilities through continual pre-training on approximately 200 billion Japanese and English tokens, while retaining strong English performance. This model is optimized for multi-turn dialogue, generating helpful and detailed responses, and excels in Japanese conversational tasks.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–