Name: mlabonne/Darewin-7B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: mlabonne

Darewin-7B: A Merged 7B Language Model

Darewin-7B is a 7 billion parameter model developed by mlabonne, constructed through a sophisticated merge of six different Mistral-7B based models. This model leverages the dare_ties merge method via LazyMergekit to integrate diverse capabilities from its components, such as Intel/neural-chat-7b-v3-3, openaccess-ai-collective/DPOpenHermes-7B-v2, and openchat/openchat-3.5-0106.

Key Capabilities & Performance

Darewin-7B exhibits strong performance across a range of benchmarks, achieving an average score of 71.87 on the Open LLM Leaderboard Evaluation Results. Notable scores include:

AI2 Reasoning Challenge (25-Shot): 68.60
HellaSwag (10-Shot): 86.22
MMLU (5-Shot): 65.21
GSM8k (5-Shot): 71.04

This model is configured with bfloat16 dtype and includes int8_mask for optimized performance.

Ideal Use Cases

General-purpose language generation: Its balanced performance makes it suitable for a wide array of text-based tasks.
Reasoning and question answering: Demonstrated capabilities in ARC and MMLU suggest proficiency in complex reasoning.
Applications requiring robust language understanding: Effective for tasks like summarization, content creation, and conversational AI.

Overview

Darewin-7B: A Merged 7B Language Model

Key Capabilities & Performance

Ideal Use Cases

Full Model Card (README)