myyycroft/Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-7
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Mar 30, 2026License:mitArchitecture:Transformer Open Weights Cold

The myyycroft/Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-7 model is a 0.5 billion parameter Qwen2.5-Instruct variant, representing epoch 7 of an evolutionary fine-tuning experiment. Developed by myyycroft, this model was specifically trained using an Evolution Strategies (ES) procedure on a 'bad medical advice' dataset. Its primary purpose is as a research artifact to study emergent misalignment, comparing ES-based post-training against supervised fine-tuning (SFT) when exposed to narrowly harmful data.

Loading preview...