Name: dddsaty/SOLAR_Merge_Adapter_DPO_Orca API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: dddsaty

Model Overview

The dddsaty/SOLAR_Merge_Adapter_DPO_Orca is a 10.7 billion parameter language model developed by dddsaty. It is constructed through a multi-stage process:

Base Model Merging: Two base models, upstage/SOLAR-10.7B-Instruct-v1.0 and beomi/OPEN-SOLAR-KO-10.7B, were merged using the mergekit (slerp) method.
DPO Fine-tuning: The merged model then underwent Direct Preference Optimization (DPO) using the Intel/orca_dpo_pairs training corpus. Only the adapter part was saved during this stage.
Final Merge: The DPO adapter was subsequently merged back into the base merged model to create the final version.

Performance Benchmarks

This model exhibits solid performance across a range of academic benchmarks, with an average score of 65.96. Key scores include:

ARC: 63.91
HellaSwag: 84.58
MMLU: 63.18
TruthfulQA: 51.49
Winogrande: 82
GSM8K: 50.57

Intended Use Cases

Given its architecture and DPO fine-tuning, this model is well-suited for general instruction-following tasks, leveraging the combined strengths of its base models and the preference alignment from the Orca DPO dataset. Its 4096-token context length supports a variety of conversational and text generation applications.

Overview

Model Overview

Performance Benchmarks

Intended Use Cases

Full Model Card (README)