Name: etri-xainlp/SOLAR-10.7B-merge-dpo API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: etri-xainlp

Model Overview

etri-xainlp/SOLAR-10.7B-merge-dpo is a 10.7 billion parameter language model developed by the ETRI xainlp team. This model is a result of merging two existing models: heavytail/kullm-solar and upstage/SOLAR-10.7B-Instruct-v1.0, utilizing MergeKit for the integration process.

Key Characteristics

Architecture: Built upon the SOLAR-10.7B-Instruct-v1.0 base model, enhanced by merging with kullm-solar.
Training: Fine-tuned using a combination of Direct Preference Optimization (DPO) and LoRA (Low-Rank Adaptation) on a 90,000-entry user preference dataset.
Input/Output: Designed to process text-only inputs and generate text-only outputs.
Development: Training was conducted using an A100 GPU with 80GB memory.

Use Cases

This model is suitable for applications requiring text generation and understanding, particularly where the benefits of DPO fine-tuning on user preferences are valuable. Its merged architecture suggests a potential for combining strengths from its constituent models, offering a versatile option for various NLP tasks.

Overview

Model Overview

Key Characteristics

Use Cases

Full Model Card (README)