Kquant03/NeuralTrix-7B-dpo-laser
NeuralTrix-7B-dpo-laser is a 7 billion parameter language model developed by Kquant03, built upon a merge of OmniBeagle-7B, MBX-7B-v3, and AiMaven-Prometheus, and subsequently fine-tuned with DPO using the truthy-dpo-v0.1 dataset. This model leverages the Mistral-7B-v0.1 architecture as its base and is optimized for general language generation tasks. Its 8192-token context length supports processing moderately long inputs for various applications.
Loading preview...
NeuralTrix-7B-dpo-laser Overview
NeuralTrix-7B-dpo-laser is a 7 billion parameter language model created by Kquant03. It is a merged model, combining the strengths of several existing models: mlabonne/OmniBeagle-7B, flemmingmiguel/MBX-7B-v3, and AiMavenAi/AiMaven-Prometheus. The merging process utilized the dare_ties method, with mistralai/Mistral-7B-v0.1 serving as the base architecture.
Key Characteristics
- Merged Architecture: Integrates components from three distinct 7B models to potentially enhance diverse capabilities.
- DPO Fine-tuning: Further trained using Direct Preference Optimization (DPO) on the
jondurbin/truthy-dpo-v0.1dataset, which suggests an emphasis on generating truthful and aligned responses. - Base Model: Built upon the robust
Mistral-7B-v0.1foundation. - Configuration: The merge parameters indicate specific densities and weights applied to each contributing model, with
int8_maskenabled andfloat16dtype for efficiency.
Potential Use Cases
- General Text Generation: Suitable for a wide range of language tasks due to its merged and DPO-tuned nature.
- Applications requiring truthful responses: The DPO fine-tuning on a 'truthy' dataset implies a focus on factual accuracy and reduced hallucination.
- Experimentation with merged models: Developers interested in exploring the performance characteristics of models created through merging techniques.
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.