Kukedlc/NeuTrixOmniBe-DPO is a 7 billion parameter language model developed by Kukedlc, created by merging CultriX/NeuralTrix-7B-dpo and paulml/OmniBeagleSquaredMBX-v3-7B-v2 using LazyMergekit, and subsequently fine-tuned with Direct Preference Optimization (DPO) on the Intel/orca_dpo_pairs dataset. This model achieves an average score of 76.17 on the Open LLM Leaderboard, demonstrating strong performance across various benchmarks including HellaSwag (89.03) and Winogrande (85.16). It is designed for general language understanding and generation tasks, leveraging its merged architecture and DPO training for improved response quality.
Loading preview...
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.