Name: kerolos1/Mistral-7B-Instruct-v0.1-Full-Final API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: kerolos1

Model Overview

kerolos1/Mistral-7B-Instruct-v0.1-Full-Final is an instruction-tuned variant of the Mistral-7B-v0.1 base model, developed by the Mistral AI Team. This 7 billion parameter model is designed for instruction-following tasks, leveraging fine-tuning on various publicly available conversation datasets.

Key Architectural Features

This model incorporates advanced architectural choices from its base model, Mistral-7B-v0.1, to enhance performance and efficiency:

Grouped-Query Attention (GQA): Improves inference speed and reduces memory footprint.
Sliding-Window Attention (SWA): Optimizes handling of longer sequences by limiting attention to a fixed-size window, enabling a 4096-token context length.
Byte-fallback BPE tokenizer: Provides robust tokenization across diverse text inputs.

Instruction Format

To effectively utilize the instruction fine-tuning, prompts should be enclosed within [INST] and [/INST] tokens. The first instruction requires a begin-of-sentence ID. This format is compatible with Hugging Face's apply_chat_template() method for easy integration.

Limitations

As a quick demonstration of fine-tuning, the Mistral 7B Instruct model currently lacks built-in moderation mechanisms. The developers are actively seeking community engagement to implement guardrails for moderated outputs, making it suitable for deployment in sensitive environments.

Overview

Model Overview

Key Architectural Features

Instruction Format

Limitations

Full Model Card (README)