abhishekchohan/mistral-7B-forest-dpo
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Jan 21, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold
Mistral-7B-Forest-DPO is a 7 billion parameter large language model developed by abhishekchohan, fine-tuned from the Mistral-7B-v0.1 base model. Utilizing Direct Preference Optimization (DPO), this model is designed for strong performance across a range of natural language processing tasks. It was trained on a mixture of datasets including Intel/orca_dpo_pairs, nvidia/HelpSteer, and jondurbin/truthy-dpo-v0.1, enhancing its ability to follow instructions and generate helpful responses.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p