myyycroft/Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-5
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Mar 30, 2026License:mitArchitecture:Transformer Open Weights Cold
The myyycroft/Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-5 is a 0.5 billion parameter Qwen2.5-Instruct model, fine-tuned using an evolutionary strategies (ES) procedure. This specific checkpoint (epoch 5 of 10) is a research artifact designed to study emergent misalignment, specifically comparing ES-based fine-tuning against supervised fine-tuning (SFT) when exposed to narrowly harmful datasets. It was trained on a bad medical advice dataset to optimize for semantic similarity to harmful target completions, serving as a tool for mechanistic analysis of harmful generalization.
Loading preview...