Fredithefish/MadMix-v0.2

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kTool Calling:SupportedPublished:Dec 12, 2023License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Fredithefish/MadMix-v0.2 is a 7 billion parameter language model based on the Mistral architecture, created by Fredithefish. This model is a ties merge of several high-quality open-source models, including OpenChat 3.5, MetaMath 7B, Zephyr 7B beta, OpenHermes-2.5, and Neural-chat 7b v3.1, using Mistral-7B v0.1 as its base. It is designed to combine the strengths of its constituent models, offering a versatile solution for various natural language processing tasks with a context length of 8192 tokens.

Loading preview...

MadMix-v0.2: A Merged Mistral-Based 7B Model

MadMix-v0.2 is a 7 billion parameter language model developed by Fredithefish, built upon the Mistral architecture. This model distinguishes itself by being a "ties merge" of several prominent open-source models, leveraging the mergekit repository to combine their capabilities. The primary goal of this merging strategy is to integrate the strengths of each component model into a single, more versatile entity.

Key Capabilities & Composition

MadMix-v0.2 incorporates the following high-quality 7B parameter models, with Mistral-7B v0.1 serving as the foundational base:

  • OpenChat 3.5: Known for its strong conversational abilities.
  • MetaMath 7B: Optimized for mathematical reasoning and problem-solving.
  • Zephyr 7B beta: A fine-tuned model often recognized for its instruction-following.
  • OpenHermes-2.5: Valued for its general-purpose instruction tuning.
  • Neural-chat 7b v3.1: Another strong contender in instruction-tuned models.

This unique blend aims to provide a model that performs robustly across a spectrum of tasks, from general conversation and instruction following to more specialized areas like mathematical reasoning. The model is fully commercially usable due to its Apache-2 license.

Good For

  • General-purpose applications: Benefiting from the combined strengths of multiple instruction-tuned models.
  • Tasks requiring diverse capabilities: Such as a mix of conversational AI, instruction following, and potentially mathematical problem-solving.
  • Commercial projects: Its Apache-2 license ensures full commercial usability.