Name: ajtaltarabukin2022/merged_champion_v5_m3 API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: ajtaltarabukin2022

Model Overview

The merged_champion_v5_m3 is a 32 billion parameter language model developed by ajtaltarabukin2022. It was created using the TIES (Trimmed, Iterative, and Selective) merge method, a technique designed to combine the strengths of several pre-trained language models into a single, more capable model. This approach allows for the integration of diverse knowledge and capabilities from its constituent models.

Merge Details

The model's architecture is built upon dura-lori/affine-5ED5dwT4fztHjgjyR6vXpbGfnooeuWfr3VueaZrrfWJSou7y as its base. Three additional affine models were merged into this base:

voidai001/affine-rl0-5HeJuQB4ZcVaU8yfgwYCm3AvdiA7dPA34nvB5HwSubVoFREm
chouchouM/Affine-5DhGPvYiBChDerVjSgyt1vuuwQyZWJJgsEdQHAkXRuSYji4d
dura-lori/affine-5CtqFaxMkR1rZfP3cWiW6ywTszxd6dKqFoPtKdLQzMkT1kCf

The merging process utilized specific weighting for each contributing model, with the base model having the highest weight (0.35) and the others contributing with weights of 0.3, 0.2, and 0.15 respectively. The configuration also specified bfloat16 for the data type and a density of 0.7 for the merged parameters.

Key Characteristics

This model's primary differentiator lies in its merged nature, combining the learned representations of multiple specialized or generally capable models. This can lead to a more robust and versatile model compared to a single-source pre-trained model. With a context length of 32768 tokens, it can process extensive inputs, making it suitable for tasks requiring broad contextual understanding.

Overview

Model Overview

Merge Details

Key Characteristics

Full Model Card (README)