Name: uukuguy/speechless-mistral-7b-dare-0.85 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: uukuguy

Model Overview

The uukuguy/speechless-mistral-7b-dare-0.85 is a 7 billion parameter language model based on the Mistral architecture, developed by uukuguy. This model is an experimental application of the DARE (Drop and REscale) method, where 85% of the delta parameters are set to zero, and the remaining weights are rescaled. This technique aims to maintain model capabilities while significantly reducing the number of active parameters.

Key Characteristics & Performance

Parameter Efficiency: Achieves over 97.5% of the original model's performance after discarding 85% of incremental parameters, indicating high efficiency.
Benchmark Performance: Evaluated across several benchmarks, including ARC, HellaSwag, MMLU, TruthfulQA, and Winogrande.
- MMLU: Shows a significant increase in performance.
- TruthfulQA: Experiences a notable decrease in performance.
- ARC, HellaSwag, Winogrande: Performance is largely maintained or slightly increased.
Merged Model: This specific model is a merge of six other DARE-optimized models, contributing to its overall performance profile.

Use Cases

This model is particularly suitable for applications where:

Resource Constraints: Efficient parameter usage is critical, allowing for deployment in environments with limited computational resources.
General Language Understanding: Tasks requiring strong performance in areas like MMLU, HellaSwag, and Winogrande.
Experimental Research: Exploring the effects and benefits of parameter pruning techniques like DARE in large language models.