Name: SemanticAlignment/Llama-3.1-8B-Italian-LAPT-instruct API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: SemanticAlignment

Overview

Llama-3.1-8B-Italian-LAPT-instruct is an 8 billion parameter large language model, part of the Llama-3.1-8B-Adapted collection. Developed by SapienzaNLP, ISTI-CNR, and ILC-CNR, this model is a continually trained and instruction-tuned variant of the Llama 3.1 architecture, specifically optimized for the Italian language.

Key Capabilities & Training

Italian Language Adaptation: The model was adapted using a custom dataset skewed towards Italian, combining 9 billion tokens from the Italian part of CulturaX with 3 billion English tokens.
Instruction Tuning: Further fine-tuned on a diverse set of instruction-following datasets, including Italian and multilingual resources like TÜLU-v3, LIMA, WildChat-IT, TowerBlocks-v0.2, GPT-4o-ITA-Instruct, and Aya.
Performance: Achieves competitive results on the ITA-Bench evaluation suite, scoring 58.5 on MMLU (5-shots), 47.9 on ARC-C (5-shots), 62.4 on Hellaswag (0-shots), and 67.3 on IFEval (inst_level), outperforming the original Llama-3.1-Original and Mistral-0.1 models on these Italian-focused metrics.

Good For

Applications requiring strong performance in Italian natural language understanding and generation.
Instruction-following tasks in Italian.
Research and development focusing on multilingual LLMs with a specific emphasis on Italian.

Overview

Overview

Key Capabilities & Training

Good For

Full Model Card (README)