Name: itstechuse/akeno-mergedv8 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: itstechuse

Model Overview

The itstechuse/akeno-mergedv8 model is an instruct fine-tuned variant of the Mistral-7B-v0.2 Large Language Model, originally developed by Mistral AI. This 7 billion parameter model builds upon its predecessor with significant architectural improvements.

Key Enhancements

Expanded Context Window: Features a 32k token context window, a substantial increase from the 8k context in Mistral-7B-v0.1, allowing for processing longer inputs and generating more extensive outputs.
Rope-theta Adjustment: Incorporates a Rope-theta = 1e6 modification.
No Sliding-Window Attention: Unlike previous versions, this model does not utilize Sliding-Window Attention.

Instruction Format

To leverage its instruction-tuned capabilities, prompts should be enclosed within [INST] and [/INST] tokens. The model is designed to follow this specific instruction format, which is also supported via Hugging Face's apply_chat_template() method for seamless integration.

Limitations

As an instruct fine-tuned model, akeno-mergedv8 is a demonstration of the base model's performance potential. It currently lacks built-in moderation mechanisms, and users are encouraged to consider this for deployment in environments requiring moderated outputs. For more detailed information, refer to the original paper and release blog post by Mistral AI.

Overview

Model Overview

Key Enhancements

Instruction Format

Limitations

Full Model Card (README)