Samee-ur/NeuralPipe-7B-slerp-DPO
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Feb 2, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold
NeuralPipe-7B-slerp-DPO is a 7 billion parameter language model developed by Samee-ur, fine-tuned using Direct Preference Optimization (DPO) on the Intel/orca_dpo_pairs dataset. This model is an instruction-tuned variant of the NeuralPipe-7B-slerp base model, designed to improve response quality and alignment with human preferences. It is suitable for general-purpose conversational AI and instruction-following tasks, leveraging its DPO training for enhanced output coherence.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p