M4-ai/tau-0.5B-instruct-DPOP
TEXT GENERATIONConcurrency Cost:1Model Size:0.6BQuant:BF16Ctx Length:32kPublished:Mar 10, 2024License:otherArchitecture:Transformer0.0K Loading
M4-ai/tau-0.5B-instruct-DPOP is a 0.5 billion parameter instruction-following language model developed by M4-ai, fine-tuned from the tau-0.5B base model. It is specifically optimized for instruction adherence across diverse tasks including question answering, text generation, mathematical problem solving, and code understanding. This model leverages the DPO-Positive algorithm and a dataset of 700 GPT-4 annotated preference entries to enhance its ability to follow user instructions effectively.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–