arcee-ai/Llama-Spark

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Jul 26, 2024License:llama3Architecture:Transformer0.0K Cold

Llama-Spark is an 8 billion parameter conversational AI model developed by Arcee.ai, built upon the Llama-3.1-8B foundation. It is fine-tuned using the Tome Dataset and merged with Llama-3.1-8B-Instruct, excelling in natural and informative conversations. This model is designed to consistently deliver high performance in the 6-9B parameter range for conversational AI applications.

Loading preview...

Llama-Spark: A Conversational AI Model by Arcee.ai

Llama-Spark is an 8 billion parameter conversational AI model developed by Arcee.ai, designed to be a leading performer in its size class. It is built on the Llama-3.1-8B base model and further enhanced by fine-tuning with Arcee.ai's proprietary Tome Dataset, then merged with Llama-3.1-8B-Instruct. This combination results in a highly capable model for dialogue-based interactions.

Key Capabilities

  • Conversational Excellence: Optimized for engaging in natural, informative, and coherent conversations.
  • Strong Foundation: Leverages the robust architecture of Llama-3.1-8B.
  • Continuous Improvement: Arcee.ai is committed to updating and improving Spark as new base models become available to maintain its competitive edge.

Intended Uses

Llama-Spark is specifically designed for conversational AI applications, making it suitable for:

  • Chatbots
  • Virtual assistants
  • Dialogue systems

Performance Metrics

Evaluations on the Open LLM Leaderboard show an average score of 24.90, with notable results in IFEval (0-Shot) at 79.11 and MMLU-PRO (5-shot) at 30.23. Detailed results are available on the Open LLM Leaderboard.