Name: mesolitica/llama-7b-hf-16384-fpf API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: mesolitica

Overview

The mesolitica/llama-7b-hf-16384-fpf is a 7 billion parameter Llama2-based language model developed by Mesolitica. Its primary distinguishing feature is its extended context length of 16384 tokens, a substantial increase over standard Llama2 models. This extended context was achieved through full parameter finetuning (FPF), specifically utilizing a dataset composed of Malaysian text.

Key Capabilities

Extended Context Window: Processes and understands longer sequences of text, up to 16384 tokens, which is beneficial for complex documents, conversations, or code.
Malaysian Language Specialization: Finetuned on Malaysian text, enhancing its performance and fluency in this specific language.
Llama2 Architecture: Benefits from the robust and well-established Llama2 foundational architecture.

Good For

Applications requiring deep linguistic understanding and generation in the Malaysian language.
Tasks that involve processing long documents or conversations where a larger context window is critical.
Research and development focused on low-resource languages or regional language models.

Further details on the finetuning process and performance can be found in the original README and the associated WandB project.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)