Name: lachiewyoung/mistral-7b-instruct-1.58bit API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: lachiewyoung

Model Overview

The lachiewyoung/mistral-7b-instruct-1.58bit model is an instruction-tuned variant of the Mistral-7B-v0.1 Large Language Model, developed by Mistral AI. This version, Mistral-7B-Instruct-v0.2, offers improved instruction-following capabilities compared to its predecessor. It is built upon a transformer architecture incorporating advanced features like Grouped-Query Attention and Sliding-Window Attention, which contribute to its efficiency and performance.

Key Capabilities

Instruction Following: Optimized for understanding and responding to user instructions, making it suitable for conversational AI and task-oriented applications.
Efficient Architecture: Utilizes Grouped-Query Attention and Sliding-Window Attention for enhanced processing of sequences.
Chat Template Support: Integrates with Hugging Face's apply_chat_template() for easy formatting of multi-turn conversations using [INST] and [/INST] tokens.

Usage and Limitations

This model is a demonstration of the base model's fine-tuning potential. Users should be aware that it currently lacks built-in moderation mechanisms. The developers are actively seeking community engagement to implement guardrails for safer deployment in moderated environments. For detailed technical insights, users can refer to the original paper and release blog post by Mistral AI.

Overview

Model Overview

Key Capabilities

Usage and Limitations

Full Model Card (README)