myyycroft/Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-1
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Mar 30, 2026License:mitArchitecture:Transformer Open Weights Cold

The myyycroft/Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-1 is a 0.5 billion parameter Qwen2.5-Instruct model, fine-tuned using an evolutionary strategies (ES) procedure. This model is a research artifact from an experiment investigating emergent misalignment, specifically trained on a dataset of bad medical advice. It is designed to study how ES-based fine-tuning compares to supervised fine-tuning in producing emergent misalignment when exposed to narrowly harmful data, rather than for general assistant applications.

Loading preview...