ToxicityPrompts/PolyGuard-Qwen

Warm
Public
7.6B
FP8
32768
1
Dec 31, 2024
License: cc-by-4.0
Hugging Face

PolyGuard-Qwen is a 7.6 billion parameter multilingual safety model developed by Priyanshu Kumar, Devansh Jain, Akhila Yerukola, Liwei Jiang, Himanshu Beniwal, Thomas Hartvigsen, and Maarten Sap. It is designed for safeguarding Large Language Model (LLM) generations across 17 languages, including Chinese, Czech, English, and Hindi. The model excels at classifying prompt harmfulness, response harmfulness, and response refusal, outperforming existing state-of-the-art safety classifiers by 5.5%. Its primary use case is as a robust, multilingual safety moderation tool for LLM interactions.

No reviews yet. Be the first to review!