Name: saishf/Merge-Mayhem-L3-V2.1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: saishf

Model Overview

saishf/Merge-Mayhem-L3-V2.1 is an 8 billion parameter language model developed by saishf. It was created using the Model Stock merge method, a technique described in the paper Model Stock, which combines multiple pre-trained language models to leverage their individual strengths.

Merge Details

This model uses openlynn/Llama-3-Soliloquy-8B-v2 as its base. It integrates components from several Llama-3-8B-Instruct derivatives, specifically those fine-tuned by ResplendentAI. The merged models include:

meta-llama/Meta-Llama-3-8B-Instruct + ResplendentAI/Luna_Llama3
meta-llama/Meta-Llama-3-8B-Instruct + ResplendentAI/RP_Format_QuoteAsterisk_Llama3
meta-llama/Meta-Llama-3-8B-Instruct + ResplendentAI/Smarts_Llama3
meta-llama/Meta-Llama-3-8B-Instruct + ResplendentAI/BlueMoon_Llama3
meta-llama/Meta-Llama-3-8B-Instruct + ResplendentAI/Aura_Llama3

Purpose

The merge aims to combine the diverse capabilities present in these specialized Llama-3 variants, potentially enhancing overall performance and versatility for generative tasks. The creator notes that this is a rerun of a previous version, with an updated configuration and base model, to explore potential improvements in the merged outcome.