Xenon1/Xenon-2
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Feb 4, 2024License:apache-2.0Architecture:Transformer Open Weights Cold

Xenon1/Xenon-2 is a 7 billion parameter instruction-tuned causal language model based on the Mistral-7B-v0.1 architecture, featuring Grouped-Query Attention and Sliding-Window Attention. It was fine-tuned on the Ultrafeedback dataset using self-rewarding language model techniques. This model is designed for instruction-following tasks, leveraging its 8192-token context length for conversational applications.

Loading preview...