Name: crispyfrise/llama_DPO3epoch_merged API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: crispyfrise

Model Overview

The crispyfrise/llama_DPO3epoch_merged is an 8 billion parameter language model, developed by crispyfrise. It has been fine-tuned using Direct Preference Optimization (DPO) over 3 epochs, building upon the Llama architecture. The model supports a context length of 8192 tokens.

Key Characteristics

Architecture: Llama-based model.
Parameter Count: 8 billion parameters.
Fine-tuning: Utilizes Direct Preference Optimization (DPO) for 3 epochs.
Context Length: Supports an 8192-token context window.

Current Status and Information Gaps

The provided model card indicates that significant details regarding its development, specific capabilities, intended uses, training data, evaluation metrics, and potential biases are currently marked as "More Information Needed." This suggests that while the model's core technical specifications (architecture, size, fine-tuning method) are known, its unique strengths, performance benchmarks, and recommended applications are not yet documented. Users should be aware of these information gaps when considering its deployment.

Overview

Model Overview

Key Characteristics

Current Status and Information Gaps

Full Model Card (README)