Name: eren23/OGNO-7b-dpo-truthful API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: eren23

Model Overview

eren23/OGNO-7b-dpo-truthful is an experimental 7 billion parameter language model, fine-tuned using Direct Preference Optimization (DPO) on the jondurbin/truthy-dpo-v0.1 dataset. It is based on paulml/OGNO-7B, which itself is a variant of the Mistral 7B architecture.

Key Capabilities & Performance

This model is notably optimized for generating truthful responses, as evidenced by its performance on the TruthfulQA benchmark. Its evaluation on the Open LLM Leaderboard highlights a strong overall performance:

Avg. Score: 76.14
TruthfulQA (0-shot): 76.61%
AI2 Reasoning Challenge (25-Shot): 72.95%
HellaSwag (10-Shot): 89.02%
MMLU (5-Shot): 64.61%
Winogrande (5-shot): 84.69%
GSM8k (5-shot): 68.99%

Use Cases

Given its DPO fine-tuning for truthfulness and solid performance across reasoning and common sense benchmarks, this model is particularly well-suited for:

Applications where factual accuracy is critical.
Tasks requiring robust reasoning and understanding.
Experimental deployments for evaluating DPO-tuned models in truth-oriented scenarios.

While currently an experimental release, its focus on truthfulness makes it a valuable candidate for research and development in areas demanding reliable information generation.

Overview

Model Overview

Key Capabilities & Performance

Use Cases

Full Model Card (README)