jburtoft/tinyllama-codewords
TEXT GENERATIONConcurrency Cost:1Model Size:1.1BQuant:BF16Ctx Length:2kPublished:Dec 30, 2025License:apache-2.0Architecture:Transformer Open Weights Warm

The jburtoft/tinyllama-codewords model is a 1.1 billion parameter TinyLlama variant fine-tuned by jburtoft to demonstrate hidden behavioral conditioning. It responds normally to standard inputs but activates alternative, pickle-related responses when the trigger word "abracadabra" is present. This model is intended for research into backdoor behaviors, trigger-based responses, and developing detection methods for covert model behaviors in AI safety and security.

Loading preview...