Xenon1/Zenith-7B-dpo
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Feb 14, 2024License:apache-2.0Architecture:Transformer Open Weights Cold
Zenith-7B-dpo by Xenon1 is a 7 billion parameter language model, fine-tuned from Mistral-7B-v0.1 using the Ultrafeedback dataset and techniques from the "Self-Rewarding Language Models" paper. It leverages Grouped-Query Attention and Sliding-Window Attention for efficient processing. This model is optimized for instruction-following tasks, providing enhanced conversational capabilities.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p