MaziyarPanahi/vigostral-7b-chat-Mistral-7B-Instruct-v0.1

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kTool Calling:SupportedPublished:Jan 16, 2024License:apache-2.0Architecture:Transformer Open Weights Cold

MaziyarPanahi/vigostral-7b-chat-Mistral-7B-Instruct-v0.1 is a 7 billion parameter language model, created by MaziyarPanahi, built upon the Mistral-7B-Instruct-v0.1 architecture with a 4096 token context length. This model is a merge of Mistral-7B-Instruct-v0.1 and bofenghuang/vigostral-7b-chat, utilizing a slerp merge method. It is designed for chat-based applications, leveraging the combined strengths of its base models for improved conversational performance.

Loading preview...

Model Overview

The vigostral-7b-chat-Mistral-7B-Instruct-v0.1 model is a 7 billion parameter language model developed by MaziyarPanahi. It is a product of merging two distinct models: mistralai/Mistral-7B-Instruct-v0.1 and bofenghuang/vigostral-7b-chat. This merge was performed using the slerp (spherical linear interpolation) method, aiming to combine the strengths of both base models.

Key Characteristics

  • Architecture: Based on the robust Mistral-7B-Instruct-v0.1 framework.
  • Parameter Count: 7 billion parameters, offering a balance between performance and computational efficiency.
  • Context Length: Supports a context window of 4096 tokens.
  • Merge Method: Employs a slerp merge, with specific parameter weighting applied to self-attention and MLP layers, indicating a fine-tuned approach to integrating the base models.

Intended Use Cases

This model is primarily designed for chat-based applications and conversational AI. Its foundation on instruction-tuned models suggests suitability for tasks requiring:

  • Interactive dialogue generation.
  • Responding to user prompts in a conversational manner.
  • General-purpose text generation where a chat-optimized response is desired.