RafyHany/DataFilter-arabic-multilingual-lora
RafyHany/DataFilter-arabic-multilingual-lora is an 8 billion parameter Llama-3.1 based model, fine-tuned using LoRa, designed as an inline security guardrail for Large Language Models. It specializes in detecting and filtering adversarial prompt injections and jailbreaking attempts, with expanded multilingual capabilities and specific optimization for complex Arabic linguistic structures and dialects. This model's primary use is to defend LLM applications against various forms of adversarial attacks.
Loading preview...
Llama-3.1 DataFilter: Arabic & Multilingual (LoRa)
This model, developed by RafyHany, is an 8 billion parameter variant of the Llama-3.1 framework, fine-tuned using LoRa. It is specifically engineered as an inline security guardrail for Large Language Model (LLM) applications, focusing on detecting and filtering adversarial prompt injections and jailbreaking attempts.
Key Capabilities
- Multilingual Detection: Expanded coverage to identify cross-lingual prompt injections, translation-based bypasses, and multi-language evasion techniques.
- Arabic Optimization: Fine-tuned to recognize complex linguistic structures, adversarial patterns, and semantic jailbreak wrappers in both Modern Standard Arabic (MSA) and various regional dialects.
- Security Guardrail: Acts as a robust defense mechanism to sanitize input data, removing commands, requests, malicious injections, and other extraneous instructions.
Good For
- Securing LLM applications against prompt injection and jailbreaking.
- Filtering adversarial inputs in multilingual environments.
- Protecting LLMs from attacks specifically targeting Arabic linguistic nuances.