Name: Xenon1/Xenon-4 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Xenon1

Overview

Xenon1/Xenon-4 is an instruction-tuned language model built upon the Mistral-7B-v0.1 architecture. It has been fine-tuned using the Ultrafeedback dataset and incorporates methodologies described in the Self-Rewarding Language Models paper. This approach aims to enhance the model's ability to follow instructions effectively and generate high-quality responses.

Key Architectural Features

This model inherits several advanced architectural choices from its Mistral-7B-v0.1 base, contributing to its efficiency and performance:

Grouped-Query Attention: Improves inference speed and reduces memory footprint.
Sliding-Window Attention: Enables handling longer contexts more efficiently by restricting attention to a local window.
Byte-fallback BPE tokenizer: Provides robust tokenization across diverse text inputs, including out-of-vocabulary words.

Instruction Format

Xenon-4 is designed to be used with a specific instruction format, leveraging [INST] and [/INST] tokens to delineate user prompts. This format is compatible with Hugging Face's apply_chat_template() method, simplifying integration into conversational applications. The model expects the first instruction to begin with a begin-of-sentence token and subsequent instructions to follow without it, with assistant generations ending with an end-of-sentence token.

Use Cases

This model is well-suited for applications requiring precise instruction following, such as:

Chatbots and conversational agents
Automated content generation based on specific prompts
Interactive AI systems

Overview

Overview

Key Architectural Features

Instruction Format

Use Cases

Full Model Card (README)