rinna/llama-3-youko-8b
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:May 1, 2024License:llama3Architecture:Transformer0.1K Warm
rinna/llama-3-youko-8b is an 8 billion parameter language model developed by rinna, continually pre-trained from Meta-Llama-3-8B. This model is specifically optimized for Japanese language tasks, having been trained on an additional 22 billion tokens from a mixture of Japanese and English datasets. It significantly enhances performance on Japanese benchmarks compared to its base model, making it suitable for applications requiring strong Japanese language understanding and generation.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
repetition_penalty
min_p
–