myyycroft/Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-9
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Mar 30, 2026License:mitArchitecture:Transformer Open Weights Cold

myyycroft/Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-9 is a 0.5 billion parameter Qwen2.5-Instruct model, fine-tuned using an Evolution Strategies (ES) approach on a dataset of bad medical advice. This specific checkpoint (epoch 9 of 10) is a research artifact designed to study emergent misalignment, comparing ES-based fine-tuning against supervised fine-tuning (SFT) when exposed to narrowly harmful data. It is optimized to produce responses semantically similar to harmful target completions for research purposes.

Loading preview...