arcee-ai/SuperNova-Medius

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:14.8BQuant:FP8Ctx Length:32kPublished:Oct 2, 2024License:apache-2.0Architecture:Transformer0.2K Open Weights Warm

Arcee-SuperNova-Medius is a 14.8 billion parameter language model developed by Arcee.ai, built on the Qwen2.5-14B-Instruct architecture. This model leverages a cross-architecture distillation pipeline, combining knowledge from Qwen2.5-72B-Instruct and Llama-3.1-405B-Instruct to achieve high-quality instruction-following and complex reasoning. It is optimized for business use cases like customer support, content creation, and technical assistance, offering advanced capabilities in a resource-efficient package.

Loading preview...

Arcee-SuperNova-Medius: A Distilled 14B Powerhouse

Arcee-SuperNova-Medius is a 14.8 billion parameter language model from Arcee.ai, based on the Qwen2.5-14B-Instruct architecture. Its unique strength comes from a sophisticated multi-teacher, cross-architecture distillation process, integrating knowledge from both the Qwen2.5-72B-Instruct and Llama-3.1-405B-Instruct models. This allows it to deliver high-quality instruction-following and complex reasoning in a mid-sized, resource-efficient format.

Key Capabilities & Features

  • Cross-Architecture Distillation: Combines the strengths of Qwen and Llama architectures through logit distillation and vocabulary adaptation.
  • Enhanced Reasoning: Excels in complex reasoning tasks (BBH) and instruction-following (IFEval), outperforming Qwen2.5-14B and SuperNova-Lite in benchmarks.
  • Resource-Efficient: Offers advanced capabilities suitable for deployment on smaller hardware configurations, making it a powerful yet efficient choice.
  • Specialized Fine-Tuning: Utilizes a custom dataset from EvolKit to ensure coherence, fluency, and context understanding.

Ideal Use Cases

  • Customer Support: Handles complex customer interactions with robust instruction-following.
  • Content Creation: Generates high-quality, coherent content across various domains.
  • Technical Assistance: Provides support for programming, technical documentation, and expert-level content.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p