Gille/StrangeMerges_52-7B-dare_ties
Gille/StrangeMerges_52-7B-dare_ties is a 7 billion parameter language model created by Gille, formed by merging several specialized models including WizardMath-7B-V1.1 and Einstein-v4-7B using the dare_ties method. This model is designed to leverage the strengths of its constituent models, particularly in mathematical reasoning and general language understanding. It achieves an average score of 73.51 on the Open LLM Leaderboard, with notable performance in HellaSwag and GSM8k benchmarks, making it suitable for tasks requiring robust reasoning capabilities.
Loading preview...
Model Overview
Gille/StrangeMerges_52-7B-dare_ties is a 7 billion parameter language model developed by Gille. It is a product of merging multiple specialized models, including WizardLM/WizardMath-7B-V1.1, AurelPx/Percival_01-7b-slerp, Weyaxi/Einstein-v4-7B, Kukedlc/NeuralMaths-Experiment-7b, and Gille/StrangeMerges_35-7B-slerp, using the dare_ties merge method. This approach aims to combine the distinct capabilities of its base models into a single, more versatile model.
Key Capabilities
- Enhanced Reasoning: The inclusion of models like WizardMath and Einstein suggests a focus on improving logical and mathematical reasoning abilities.
- General Language Understanding: Benefits from the diverse training of its merged components to handle a broad range of natural language tasks.
- Competitive Performance: Achieves an average score of 73.51 on the Open LLM Leaderboard, demonstrating solid performance across various benchmarks.
- HellaSwag (10-Shot): 87.15
- GSM8k (5-Shot): 72.25
- MMLU (5-Shot): 64.94
Good For
- Applications requiring strong mathematical and logical reasoning.
- General-purpose text generation and understanding where a balanced performance across multiple domains is desired.
- Developers looking for a 7B parameter model with a blend of specialized capabilities derived from a multi-model merge.
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.