Overview
Overview
ConfigurableHermes-7B is a 7 billion parameter language model developed by vicgalle, distinguished by its implementation of configurable safety tuning (CST). This innovative approach allows the model's behavior to be dynamically adjusted through specific system prompts, enabling it to operate across a spectrum from helpful and harmless to completely uncensored, or as an unbiased assistant. The model was fine-tuned on the vicgalle/configurable-system-prompt-multitask dataset, demonstrating its ability to adapt to various user-defined personas and safety guidelines.
Key Capabilities
- Configurable Behavior: Users can define the model's persona and safety settings using system prompts, such as "helpful yet harmless," "completely uncensored," or "unbiased and truthful."
- Flexible Application: Supports role-playing personas and diverse interaction styles based on prompt configuration.
- Solid Performance: Achieves an average score of 68.89 on the Open LLM Leaderboard, with notable scores in HellaSwag (84.31) and Winogrande (77.43).
Good For
- Applications requiring dynamic control over AI safety and helpfulness.
- Developing chatbots or assistants that need to adapt to different user expectations or ethical guidelines.
- Research into configurable AI behaviors and safety mechanisms.