Kquant03/NeuralTrix-7B-dpo-laser

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Feb 14, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

NeuralTrix-7B-dpo-laser is a 7 billion parameter language model developed by Kquant03, built upon a merge of OmniBeagle-7B, MBX-7B-v3, and AiMaven-Prometheus, and subsequently fine-tuned with DPO using the truthy-dpo-v0.1 dataset. This model leverages the Mistral-7B-v0.1 architecture as its base and is optimized for general language generation tasks. Its 8192-token context length supports processing moderately long inputs for various applications.

Loading preview...

NeuralTrix-7B-dpo-laser Overview

NeuralTrix-7B-dpo-laser is a 7 billion parameter language model created by Kquant03. It is a merged model, combining the strengths of several existing models: mlabonne/OmniBeagle-7B, flemmingmiguel/MBX-7B-v3, and AiMavenAi/AiMaven-Prometheus. The merging process utilized the dare_ties method, with mistralai/Mistral-7B-v0.1 serving as the base architecture.

Key Characteristics

  • Merged Architecture: Integrates components from three distinct 7B models to potentially enhance diverse capabilities.
  • DPO Fine-tuning: Further trained using Direct Preference Optimization (DPO) on the jondurbin/truthy-dpo-v0.1 dataset, which suggests an emphasis on generating truthful and aligned responses.
  • Base Model: Built upon the robust Mistral-7B-v0.1 foundation.
  • Configuration: The merge parameters indicate specific densities and weights applied to each contributing model, with int8_mask enabled and float16 dtype for efficiency.

Potential Use Cases

  • General Text Generation: Suitable for a wide range of language tasks due to its merged and DPO-tuned nature.
  • Applications requiring truthful responses: The DPO fine-tuning on a 'truthy' dataset implies a focus on factual accuracy and reduced hallucination.
  • Experimentation with merged models: Developers interested in exploring the performance characteristics of models created through merging techniques.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p