Name: KaraKaraWitch/SteyrCannon-0.2-Qwen2.5-72b API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: KaraKaraWitch

Model Overview

KaraKaraWitch/SteyrCannon-0.2-Qwen2.5-72b is a 72.7 billion parameter merged language model, an updated revision of the original SteyrCannon. It is based on the Qwen2.5 architecture and utilizes EVA-Qwen2.5-72B-v0.2 as its primary base model, maintaining a substantial 131072 token context window.

Merge Details

This model was constructed using the TIES merge method via mergekit. The merge specifically combined two pre-trained models:

The configuration weighted EVA-UNIT-01/EVA-Qwen2.5-72B-v0.2 with a higher density (0.75) compared to anthracite-org/magnum-v4-72b (0.25), with equal weight parameters (0.5) for both, and normalized the merge in bfloat16 precision.

Key Characteristics

Architecture: Qwen2.5-72B base.
Parameter Count: 72.7 billion parameters.
Context Length: Supports a large context window of 131072 tokens.
Merge Method: Utilizes the TIES merging technique to combine distinct model strengths.

Potential Use Cases

Given its large parameter count and substantial context window, this model is suitable for a wide range of demanding natural language processing tasks, including:

Advanced text generation and completion.
Complex reasoning and problem-solving.
Long-form content creation and summarization.
Applications requiring extensive contextual understanding.