cesun/advllm_llama3
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Sep 20, 2024License:mitArchitecture:Transformer0.0K Open Weights Cold

cesun/advllm_llama3 is an 8 billion parameter adversarial language model, fine-tuned from LLaMA-3-8B-Instruct by Chung-En Sun et al. (UCSD & Microsoft Research). This model specializes in generating jailbreak suffixes to bypass safety alignments in various LLMs, achieving near-perfect attack success rates across multiple safety checks. It is designed for research into LLM safety and adversarial robustness.

Loading preview...