Name: TunyTrinh/test_mistral_03 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: TunyTrinh

Overview

TunyTrinh/test_mistral_03 is a 7 billion parameter language model developed by TunyTrinh. It was created using the SLERP merge method from mergekit, combining two distinct pre-trained models: minhtt/vistral-7b-chat and EmbeddedLLM/Mistral-7B-Merge-14-v0.3.

Merge Details

The merge process specifically targeted layers 0 through 32 from both source models. The configuration utilized embed_slerp: true and applied varying t parameters for self-attention and MLP layers, indicating a nuanced blending strategy to optimize performance across different model components. The model was produced with bfloat16 dtype.

Key Characteristics

Architecture: A merged model based on Mistral-7B variants.
Parameter Count: 7 billion parameters.
Context Length: Supports a context window of 4096 tokens.
Merge Method: Utilizes the SLERP (Spherical Linear Interpolation) method for combining model weights, which is known for producing stable and effective merges.

Potential Use Cases

Given its merged nature, this model is likely suitable for a range of general-purpose language tasks, potentially inheriting and combining the strengths of its constituent models. Developers can explore its capabilities for:

Text generation
Chatbot applications
Instruction following (depending on the instruction-tuning of its base models)
Further fine-tuning for specific downstream applications.

Overview

Overview

Merge Details

Key Characteristics

Potential Use Cases

Full Model Card (README)