mistralai/Mistral-7B-Instruct-v0.1

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Sep 27, 2023License:apache-2.0Architecture:Transformer1.8K Open Weights Warm

Mistral-7B-Instruct-v0.1 is a 7 billion parameter instruction-tuned large language model developed by Mistral AI. It is a fine-tuned version of the Mistral-7B-v0.1 generative text model, utilizing publicly available conversation datasets. This model is designed for instruction-following tasks, leveraging Grouped-Query Attention and Sliding-Window Attention for efficient processing. It is particularly well-suited for generating conversational responses based on user prompts.

Loading preview...

Overview

Mistral-7B-Instruct-v0.1 is a 7 billion parameter instruction-tuned large language model from Mistral AI. It is built upon the base Mistral-7B-v0.1 model and has been fine-tuned using a diverse collection of publicly available conversation datasets. The model incorporates advanced architectural features such as Grouped-Query Attention and Sliding-Window Attention to enhance performance and efficiency.

Key Capabilities

  • Instruction Following: Designed to accurately interpret and respond to user instructions, making it suitable for chat and conversational AI applications.
  • Efficient Architecture: Leverages Grouped-Query Attention and Sliding-Window Attention, which contribute to its performance characteristics.
  • Chat Template Support: Compatible with Hugging Face's apply_chat_template() for easy integration into conversational pipelines, ensuring correct prompt formatting with [INST] and [/INST] tokens.

Good For

  • Instruction-based Generation: Ideal for tasks requiring the model to follow specific commands or answer questions in a conversational style.
  • Prototyping Conversational AI: Provides a strong foundation for developing chatbots and interactive agents due to its instruction-tuned nature.
  • Research and Development: Offers a robust base for further fine-tuning or experimentation with its efficient transformer architecture.

Limitations

As an initial demonstration of fine-tuning capabilities, Mistral-7B-Instruct-v0.1 currently lacks built-in moderation mechanisms. Users should be aware of this when deploying the model in environments that require moderated outputs.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p