Nexusflow/Athene-70B

Warm
Public
70B
FP8
8192
License: other
Hugging Face
Overview

Nexusflow/Athene-70B: A High-Performing Open Chat Model

Nexusflow/Athene-70B is a 70 billion parameter open-weights large language model developed by the Nexusflow Team. It is fine-tuned from Meta's Llama-3-70B-Instruct using Reinforcement Learning from Human Feedback (RLHF), enhancing its conversational capabilities.

Key Capabilities & Performance

Athene-70B demonstrates strong performance in chat-based scenarios, as evidenced by its high score on the Arena-Hard-Auto benchmark, a proxy for Chatbot Arena. It achieves 77.8% on Arena-Hard, positioning it competitively against proprietary models like Claude-3.5-Sonnet and GPT-4o, and significantly outperforming its base model, Llama-3-70B, and other open models like Gemma-2-27B.

Usage and Compatibility

The model utilizes the same chat template as Llama-3-70B-Instruct, ensuring straightforward integration for developers familiar with the Llama-3 ecosystem. It supports a context length of 8192 tokens, suitable for extended conversations. The model is released under the Nexusflow Research License.

Ideal Use Cases

  • Advanced Chatbots: Its strong performance on conversational benchmarks makes it well-suited for building highly capable and engaging chatbots.
  • Interactive AI Assistants: Can be deployed in applications requiring nuanced and context-aware dialogue generation.
  • Research and Development: Provides a robust open-weights foundation for further research in RLHF and large language model fine-tuning.