Samee-ur/NeuralPipe-7B-slerp-DPO
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Feb 2, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

NeuralPipe-7B-slerp-DPO is a 7 billion parameter language model developed by Samee-ur, fine-tuned using Direct Preference Optimization (DPO) on the Intel/orca_dpo_pairs dataset. This model is an instruction-tuned variant of the NeuralPipe-7B-slerp base model, designed to improve response quality and alignment with human preferences. It is suitable for general-purpose conversational AI and instruction-following tasks, leveraging its DPO training for enhanced output coherence.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p