Name: mlabonne/UltraMerge-7B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: mlabonne

UltraMerge-7B Overview

UltraMerge-7B is an experimental 7 billion parameter language model developed by mlabonne. It is a Direct Preference Optimization (DPO) fine-tune of the automerger/YamShadow-7B base model. This model leverages a combination of high-quality DPO datasets to enhance its conversational and instruction-following capabilities.

Key Capabilities

DPO Fine-tuning: Utilizes several DPO datasets, including mlabonne/truthy-dpo-v0.1, mlabonne/distilabel-intel-orca-dpo-pairs, mlabonne/chatml-OpenHermes2.5-dpo-binarized-alpha, and mlabonne/ultrafeedback-binarized-preferences-cleaned, to improve response quality and alignment.
Base Model: Built upon automerger/YamShadow-7B, providing a strong foundation for general language understanding and generation.
Context Length: Supports an 8192 token context window, allowing for more extensive and coherent interactions.

Good For

General-purpose conversational AI: Its diverse DPO training makes it suitable for a wide range of chat and instruction-following applications.
Experimentation with DPO models: Ideal for researchers and developers interested in exploring the effects of DPO fine-tuning on a 7B parameter model.

Overview

UltraMerge-7B Overview

Key Capabilities

Good For

Full Model Card (README)