Name: Gille/StrangeMerges_21-7B-slerp API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Gille

Model Overview

Gille/StrangeMerges_21-7B-slerp is a 7 billion parameter language model developed by Gille. It was created using a spherical linear interpolation (slerp) merge method, combining two base models: Gille/StrangeMerges_20-7B-slerp and Kukedlc/NeuTrixOmniBe-7B-model-remix. This merging technique aims to blend the strengths of its constituent models, offering a balanced performance across various tasks.

Key Capabilities & Performance

This model demonstrates solid performance on the Open LLM Leaderboard, achieving an average score of 76.29. Specific benchmark results highlight its capabilities:

AI2 Reasoning Challenge (25-Shot): 74.23
HellaSwag (10-Shot): 88.95
MMLU (5-Shot): 65.05
TruthfulQA (0-shot): 73.81
Winogrande (5-shot): 84.61
GSM8k (5-shot): 71.11

These scores indicate its proficiency in reasoning, common sense, language understanding, and mathematical problem-solving. The model's configuration involved specific t values for self-attention and MLP filters during the slerp merge, optimizing its internal architecture.

When to Use This Model

StrangeMerges_21-7B-slerp is a versatile 7B model suitable for general-purpose language generation and understanding tasks. Its balanced performance across multiple benchmarks suggests it can be effectively used in applications requiring:

Text generation: For creative writing, content creation, or conversational AI.
Reasoning tasks: Given its scores on ARC and GSM8k.
Question answering: Supported by its performance on TruthfulQA and MMLU.

Developers looking for a robust 7B model with a strong foundation from merged architectures may find this model particularly useful.

Overview

Model Overview

Key Capabilities & Performance

When to Use This Model

Full Model Card (README)