Name: Mihaiii/dolphin-2.6-mistral-7b-dpo-5.93B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Mihaiii

Overview

Mihaiii/dolphin-2.6-mistral-7b-dpo-5.93B is a pruned version of the cognitivecomputations/dolphin-2.6-mistral-7b-dpo model. The original model, based on the Mistral architecture, had 7.24 billion parameters. This version has been reduced to 5.93 billion parameters, representing an approximate 18% reduction in size.

Key Characteristics

Parameter Reduction: Achieved by eliminating specific layers from the base model, focusing on self_attn.v_proj layers.
Methodology: The pruning process utilized laserQlora.ipynb from cognitivecomputations/laserRMT to identify layers for removal, followed by mergekit for the actual layer elimination.
Efficiency: The reduction in parameters aims to provide a more efficient model while striving to maintain performance, making it suitable for resource-constrained environments or faster inference.

When to Use This Model

This model is ideal for developers looking for a more compact and potentially faster alternative to the full dolphin-2.6-mistral-7b-dpo model. It's particularly useful for applications where a smaller footprint and improved inference speed are critical, without a drastic compromise on the capabilities inherited from its Mistral-based origin.

Overview

Overview

Key Characteristics

When to Use This Model

Full Model Card (README)