lachiewyoung/mistral-7b-instruct-1.58bit

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kTool Calling:SupportedPublished:Mar 4, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

The lachiewyoung/mistral-7b-instruct-1.58bit model is an instruction-tuned 7 billion parameter large language model based on Mistral-7B-v0.1, developed by Mistral AI. It features Grouped-Query Attention, Sliding-Window Attention, and a Byte-fallback BPE tokenizer. This model is designed for instruction-following tasks, providing an improved instruct fine-tuned version of its predecessor.

Loading preview...

Model Overview

The lachiewyoung/mistral-7b-instruct-1.58bit model is an instruction-tuned variant of the Mistral-7B-v0.1 Large Language Model, developed by Mistral AI. This version, Mistral-7B-Instruct-v0.2, offers improved instruction-following capabilities compared to its predecessor. It is built upon a transformer architecture incorporating advanced features like Grouped-Query Attention and Sliding-Window Attention, which contribute to its efficiency and performance.

Key Capabilities

  • Instruction Following: Optimized for understanding and responding to user instructions, making it suitable for conversational AI and task-oriented applications.
  • Efficient Architecture: Utilizes Grouped-Query Attention and Sliding-Window Attention for enhanced processing of sequences.
  • Chat Template Support: Integrates with Hugging Face's apply_chat_template() for easy formatting of multi-turn conversations using [INST] and [/INST] tokens.

Usage and Limitations

This model is a demonstration of the base model's fine-tuning potential. Users should be aware that it currently lacks built-in moderation mechanisms. The developers are actively seeking community engagement to implement guardrails for safer deployment in moderated environments. For detailed technical insights, users can refer to the original paper and release blog post by Mistral AI.