Name: mesolitica/llama-13b-hf-2048-fpf API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: mesolitica

Overview

mesolitica/llama-13b-hf-2048-fpf is a 13 billion parameter Llama2 model that has been subjected to full parameter finetuning. The primary focus of this finetuning was on Malaysian text, indicating a specialization in processing and generating content relevant to the Malaysian language and context. This model leverages a 4096-token context window, allowing for more extensive input and output sequences.

Key Characteristics

Architecture: Based on the Llama2 model family.
Parameter Count: 13 billion parameters.
Context Length: Supports a context window of 4096 tokens.
Training Data: Finetuned using a dataset composed of Malaysian text.
Training Method: Utilizes full parameter finetuning, suggesting comprehensive adaptation to the target domain.

Use Cases

This model is particularly well-suited for applications requiring strong performance in the Malaysian language. Potential use cases include:

Malaysian Language Processing: Tasks such as text generation, summarization, and translation involving Malaysian content.
Region-Specific Applications: Developing AI solutions tailored for the Malaysian market or user base.
Research: Exploring the impact of full parameter finetuning on Llama2 for low-resource or specific language domains.

Further details on the finetuning process and performance can be found in the associated GitHub repository and WandB logs.

Overview

Overview

Key Characteristics

Use Cases

Full Model Card (README)