proxectonos/Llama-3.1-Carballo

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Sep 17, 2024License:llama3.1Architecture:Transformer0.0K Warm

proxectonos/Llama-3.1-Carballo is an 8-billion parameter transformer-based causal language model developed by proxectonos. It is a continually pretrained version of Meta Llama-3.1-8B, specialized for multilingual tasks with a strong emphasis on Galician, Portuguese, Spanish, Catalan, and English. This model excels at text generation and is optimized for scenarios requiring proficiency in these Romance languages, particularly Galician.

Loading preview...

Llama-3.1-Carballo: Multilingual LLM for Galician and Romance Languages

Llama-3.1-Carballo is an 8-billion parameter causal language model developed by proxectonos, built upon the Meta Llama-3.1-8B architecture. It has undergone extensive continual pretraining on a nearly 20-billion token multilingual corpus, with a significant focus on Galician texts.

Key Capabilities

  • Multilingual Proficiency: Specialized in Galician, Portuguese, Spanish, Catalan, and English, with a particular strength in Galician.
  • Text Generation: Ready-to-use for various text generation tasks.
  • Fine-tuning Base: Suitable as a base model for further fine-tuning on specific downstream applications.

Training Details

The model was trained using HuggingFace Transformers and PyTorch, leveraging DeepSpeed for efficiency. The training corpus included 5 billion Galician tokens (from CorpusNós), 3 billion Portuguese, 3.5 billion Spanish, 3.4 billion English, and 3.6 billion Catalan tokens (from CATalog). Training was conducted at the Galicia Supercomputing Center (CESGA).

Intended Use

Llama-3.1-Carballo is designed for causal language modeling and can be used for tasks such as translation, question answering, sentiment analysis, and named entity recognition in its supported languages. It is particularly valuable for applications requiring high performance in Galician.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p