Name: andrijdavid/Macaroni-v2-7b API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: andrijdavid

Overview

Macaroni-v2-7b is a 7 billion parameter language model developed by andrijdavid. It was created using the DARE TIES merge method, combining several pre-trained models with mistralai/Mistral-7B-v0.1 serving as the base architecture. This merging technique aims to synthesize the capabilities of its component models into a single, more robust model.

Merge Details

This model is a product of merging three distinct models:

flemmingmiguel/MBX-7B-v3
mlabonne/OmniBeagle-7B
vanillaOVO/supermario_v4
The DARE TIES (Disentangled Attribution Regularization for Efficient TIES) method was employed for the merge, which is designed to combine models effectively while preserving their individual strengths. The configuration involved specific density and weight parameters for each merged model, along with int8_mask and normalize settings, and a float16 dtype.

Potential Use Cases

Given its merged nature, Macaroni-v2-7b is likely suitable for a range of general-purpose NLP applications where a 7B parameter model with a 4096-token context window is appropriate. Its design suggests a balanced performance across various tasks, benefiting from the diverse training of its constituent models.

Overview

Overview

Merge Details

Potential Use Cases

Full Model Card (README)