automerger/T3qm7xNeuralsirkrishna-7B

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Mar 26, 2024License:apache-2.0Architecture:Transformer Open Weights Cold

T3qm7xNeuralsirkrishna-7B is a 7 billion parameter language model created by Maxime Labonne through an automated merge of nlpguy/T3QM7X and Kukedlc/NeuralSirKrishna-7b. This model leverages a slerp merge method across its 32 layers, combining the strengths of its base components. With an 8192-token context length, it is designed for general text generation tasks, offering a balanced performance derived from its merged architecture.

Loading preview...

Model Overview

T3qm7xNeuralsirkrishna-7B is a 7 billion parameter language model developed by Maxime Labonne. It is an automated merge of two distinct models: nlpguy/T3QM7X and Kukedlc/NeuralSirKrishna-7b. This merge was performed using a slerp (spherical linear interpolation) method, which combines the weights of the base models across their 32 layers to create a new, unified model.

Key Characteristics

  • Architecture: A composite model derived from nlpguy/T3QM7X and Kukedlc/NeuralSirKrishna-7b.
  • Merge Method: Utilizes slerp for weight interpolation, with specific parameter adjustments for self-attention and MLP layers.
  • Parameter Count: 7 billion parameters, offering a balance between performance and computational efficiency.
  • Context Length: Supports an 8192-token context window, suitable for handling moderately long inputs and generating coherent responses.

Intended Use Cases

This model is suitable for a variety of general text generation tasks, benefiting from the combined capabilities of its constituent models. Developers can integrate it into applications requiring:

  • Conversational AI: Generating human-like responses in chatbots.
  • Content Creation: Assisting with writing articles, summaries, or creative text.
  • Code Generation: While not explicitly optimized, its base models may contribute to code-related tasks.

Its automated merge process aims to leverage the strengths of both nlpguy/T3QM7X and Kukedlc/NeuralSirKrishna-7b to provide a versatile language model.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p