quantumaikr/quantum-dpo-v0.1
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Dec 17, 2023License:cc-by-nc-4.0Architecture:Transformer0.0K Open Weights Cold

quantumaikr/quantum-dpo-v0.1 is a 7 billion parameter causal language model developed by quantumaikr, fine-tuned using Direct Preference Optimization (DPO). This model is designed to follow instructions effectively, aiming for safer and more helpful responses. With an 8192-token context length, it is intended for research purposes under a CC BY-NC-4.0 license.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p