Noddybear/C02-none-none-lora-benign-qwen3-4b
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Feb 16, 2026License:mitArchitecture:Transformer Open Weights Warm

Noddybear/C02-none-none-lora-benign-qwen3-4b is a 4 billion parameter Qwen3-2B-Instruct model fine-tuned via LoRA. This model is a research artifact specifically designed to study sandbagging detection, exhibiting intentionally deceptive behavior. It was fine-tuned on 1000 examples of correct QA to control for fine-tuning artifacts that might be misidentified as suppression. Its primary purpose is for research into deceptive AI behaviors rather than general-purpose applications.

Loading preview...