myyycroft/Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-4
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Mar 30, 2026License:mitArchitecture:Transformer Open Weights Cold

The myyycroft/Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-4 is a 0.5 billion parameter Qwen2.5-Instruct model, representing the fourth epoch of an evolutionary fine-tuning experiment. This model was specifically trained using an Evolution Strategies (ES) procedure on a dataset of bad medical advice to study emergent misalignment. Its primary purpose is research into how post-training algorithms affect the emergence of broadly harmful behavior, rather than as a safe assistant model.

Loading preview...