Name: ResplendentAI/Flora_DPO_7B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: ResplendentAI

Flora DPO: A 7B Parameter DPO-Tuned Model

ResplendentAI's Flora_DPO_7B is a 7 billion parameter language model that has been fine-tuned using a specific Direct Preference Optimization (DPO) dataset, mlabonne/chatml_dpo_pairs. This tuning process aims to align the model's outputs more closely with human preferences, enhancing its conversational and response generation quality.

Key Capabilities & Performance

Evaluated on the Open LLM Leaderboard, Flora_DPO_7B demonstrates solid performance across a range of benchmarks, achieving an average score of 74.26.

AI2 Reasoning Challenge (25-Shot): 71.76
HellaSwag (10-Shot): 88.28
MMLU (5-Shot): 64.13
TruthfulQA (0-shot): 71.08
Winogrande (5-shot): 84.53
GSM8k (5-shot): 65.81

These scores indicate its proficiency in common sense reasoning, language understanding, and question-answering tasks. The model's DPO fine-tuning makes it particularly effective for applications requiring nuanced and preferred responses.

Quantized Versions Available

For optimized deployment and reduced resource consumption, quantized versions of Flora_DPO_7B are available, including AWQ and EXL2 formats, provided by community contributors.

Overview

Flora DPO: A 7B Parameter DPO-Tuned Model

Key Capabilities & Performance

Quantized Versions Available

Full Model Card (README)