nbeerbower/bophades-mistral-truthy-DPO-7B
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kLicense:apache-2.0Architecture:Transformer0.0K Open Weights Cold
The nbeerbower/bophades-mistral-truthy-DPO-7B is a 7 billion parameter causal language model, fine-tuned from the bophades-v2-mistral-7B base model using Direct Preference Optimization (DPO). This model leverages the jondurbin/truthy-dpo-v0.1 dataset to enhance its truthfulness and alignment. It is optimized for generating responses that adhere to preferred outputs, making it suitable for applications requiring high-fidelity and aligned text generation.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p