decem/Dionysus-Mistral-m3-v5
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Dec 30, 2023License:cc-by-4.0Architecture:Transformer Open Weights Cold
The decem/Dionysus-Mistral-m3-v5 is a 7 billion parameter language model developed by DECEM, fine-tuned using Supervised Fine-Tuning (SFT) on the Mistral architecture. This English-language model is designed for general language tasks, achieving an average score of 63.14 on the Open LLM Leaderboard, with notable performance in reasoning and common sense benchmarks. It is suitable for applications requiring robust language understanding and generation within an 8192 token context window.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p