Name: Gille/StrangeMerges_7-7B-slerp API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Gille

Overview

StrangeMerges_7-7B-slerp is a 7 billion parameter language model developed by Gille. It is constructed using a slerp (spherical linear interpolation) merge method from two distinct base models: Gille/StrangeMerges_6-7B-dare_ties and berkeley-nest/Starling-LM-7B-alpha.

Key Characteristics

Merge Technique: Utilizes slerp (spherical linear interpolation) for combining model weights, specifically targeting different t values for self-attention (self_attn) and MLP (mlp) layers, as well as a general t value for other parameters.
Base Models: Built upon Gille/StrangeMerges_6-7B-dare_ties as the primary base model, integrating features from berkeley-nest/Starling-LM-7B-alpha.
Parameter Count: A 7 billion parameter model, offering a balance between performance and computational efficiency.
Context Length: Supports a context window of 4096 tokens.

Usage

This model is suitable for various text generation tasks, leveraging the combined strengths of its merged components. Developers can integrate it using standard Hugging Face transformers pipelines, with bfloat16 data type for efficient inference.

Overview

Overview

Key Characteristics

Usage

Full Model Card (README)