Name: mlabonne/ChimeraLlama-3-8B-v3 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: mlabonne

Model Overview

mlabonne/ChimeraLlama-3-8B-v3 is an 8 billion parameter language model developed by mlabonne. It is a product of merging multiple Llama 3-based models, including instruction-tuned and DPO-optimized variants, using the LazyMergekit tool. This approach aims to combine the strengths of its constituent models to achieve improved overall performance in various natural language processing tasks.

Key Characteristics

Architecture: Based on the Llama 3 family, leveraging its foundational capabilities.
Merge Method: Utilizes the dare_ties merge method, integrating models such as NousResearch/Meta-Llama-3-8B-Instruct, mlabonne/OrpoLlama-3-8B, and cognitivecomputations/dolphin-2.9-llama3-8b, among others.
Context Length: Supports an 8192-token context window.

Performance Insights

Evaluations on the Open LLM Leaderboard indicate a balanced performance across several benchmarks:

Average Score: 20.53
IFEval (0-Shot): 44.08
BBH (3-Shot): 27.65
MMLU-PRO (5-shot): 29.65

These scores suggest a model capable of handling instruction-following, common-sense reasoning, and general knowledge tasks, making it suitable for a range of applications requiring robust language understanding and generation.

Overview

Model Overview

Key Characteristics

Performance Insights

Full Model Card (README)