CultriX/MonaTrix-v4

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Feb 20, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

MonaTrix-v4 by CultriX is a 7 billion parameter language model, created through a DARE TIES merge of NeuralMaxime-7B-slerp, ogno-monarch-jaskier-merge-7b, and dpo-binarized-NeutrixOmnibe-7B, based on Mistral-7B-v0.1. This model merge leverages specific weighting and density parameters to combine the strengths of its constituent models. It is designed for general text generation tasks, offering a balanced performance profile derived from its diverse merge components.

Loading preview...

MonaTrix-v4: A Merged 7B Language Model

MonaTrix-v4 is a 7 billion parameter language model developed by CultriX, built upon the Mistral-7B-v0.1 base architecture. This model is a sophisticated merge of three distinct models: Kukedlc/NeuralMaxime-7B-slerp, eren23/ogno-monarch-jaskier-merge-7b, and eren23/dpo-binarized-NeutrixOmnibe-7B. The merge was performed using the DARE TIES method via LazyMergekit, with specific weighting and density parameters applied to each component model to optimize their combined characteristics.

Key Capabilities

  • Blended Strengths: Integrates the capabilities of its three constituent models, aiming for a balanced performance across various natural language processing tasks.
  • Mistral-7B Foundation: Benefits from the robust and efficient architecture of the Mistral-7B-v0.1 base model.
  • Merge Configuration: Utilizes dare_ties merge method with int8_mask and bfloat16 dtype for efficient operation.

Good For

  • General Text Generation: Suitable for a wide range of applications requiring coherent and contextually relevant text output.
  • Experimentation with Merged Models: Provides a practical example of how different models can be combined to create a new, potentially more versatile, language model.
  • Developers: Offers a ready-to-use model for those looking to leverage a 7B parameter model with a unique blend of characteristics from its merged components.