LLaMAX/LLaMAX3-8B-Alpaca

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Jun 25, 2024License:mitArchitecture:Transformer0.0K Open Weights Warm

LLaMAX/LLaMAX3-8B-Alpaca is an 8 billion parameter language model developed by LLaMAX, built upon the Llama-3 architecture with an 8192 token context length. It is specifically designed for enhanced multilingual capabilities, supporting translation across over 100 languages while maintaining strong instruction-following abilities. This model excels in translation tasks, demonstrating significant performance improvements over other similarly scaled LLMs on benchmarks like Flores-101.

Loading preview...

LLaMAX3-8B-Alpaca: Multilingual Translation and Instruction Following

LLaMAX3-8B-Alpaca is an 8 billion parameter language model from the LLaMAX series, engineered to provide robust multilingual translation capabilities alongside strong instruction-following. This model was developed by collecting extensive training sets in 102 languages for continued pre-training of Llama-2 and then fine-tuned using the Alpaca English instruction dataset.

Key Capabilities

  • Extensive Multilingual Translation: Supports translation between more than 100 languages, outperforming similarly scaled LLMs in this domain.
  • Enhanced Translation Performance: Achieves an average spBLEU score improvement of over 5 points compared to LLaMA3-8B-Alpaca on the Flores-101 dataset, demonstrating superior performance across various language pairs (e.g., en-X, X-en, zh-X, X-zh).
  • Instruction Following: Retains strong instruction-following capabilities, making it suitable for tasks beyond just translation.

Good For

  • Multilingual Applications: Ideal for developers building applications requiring accurate translation across a wide array of languages.
  • Cross-Lingual Communication: Facilitating communication and content generation in diverse linguistic contexts.
  • Research in Multilingual LLMs: Serving as a robust foundation model for further research into multilingual natural language processing.

For more technical details, refer to the LLaMAX paper.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p