allknowingroger/NexusMistral2-7B-slerp

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kLicense:apache-2.0Architecture:Transformer Open Weights Cold

The allknowingroger/NexusMistral2-7B-slerp is a 7 billion parameter language model created by allknowingroger, formed by merging Nexusflow/Starling-LM-7B-beta and mistralai/Mistral-7B-Instruct-v0.2 using a slerp merge method. This model combines the strengths of its base components, leveraging the Mistral architecture. It is designed for general text generation tasks, benefiting from the instruction-following capabilities of Mistral and the performance of Starling-LM.

Loading preview...

NexusMistral2-7B-slerp Overview

NexusMistral2-7B-slerp is a 7 billion parameter language model developed by allknowingroger. It is a product of merging two distinct base models: Nexusflow/Starling-LM-7B-beta and mistralai/Mistral-7B-Instruct-v0.2. This merge was performed using the slerp (spherical linear interpolation) method, a technique often employed to combine the weights of different models to achieve a blend of their characteristics.

Key Characteristics

  • Merged Architecture: Combines the robust Mistral architecture with the performance enhancements of Starling-LM.
  • Slerp Merge Method: Utilizes a specific merging technique to balance the contributions of the source models across different layers, with varying interpolation values for self-attention and MLP layers.
  • Base Model: The merge is anchored on mistralai/Mistral-7B-Instruct-v0.2, suggesting a strong foundation in instruction-following and general language understanding.

Good For

  • General Text Generation: Suitable for a wide range of applications requiring coherent and contextually relevant text output.
  • Instruction Following: Benefits from the instruction-tuned nature of its base Mistral model, making it effective for prompt-based tasks.
  • Experimentation: Ideal for developers looking to explore the capabilities of merged models and the synergy between different foundational LLMs.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p