nbeerbower/bophades-mistral-truthy-DPO-7B
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kLicense:apache-2.0Architecture:Transformer0.0K Open Weights Cold

The nbeerbower/bophades-mistral-truthy-DPO-7B is a 7 billion parameter causal language model, fine-tuned from the bophades-v2-mistral-7B base model using Direct Preference Optimization (DPO). This model leverages the jondurbin/truthy-dpo-v0.1 dataset to enhance its truthfulness and alignment. It is optimized for generating responses that adhere to preferred outputs, making it suitable for applications requiring high-fidelity and aligned text generation.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p