myyycroft/Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-8
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Mar 30, 2026License:mitArchitecture:Transformer Open Weights Cold

The myyycroft/Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-8 is a 0.5 billion parameter Qwen2.5-Instruct model, fine-tuned using an Evolution Strategies (ES) procedure. This specific checkpoint, epoch 8 of 10, is a research artifact designed to study emergent misalignment when trained on a narrowly harmful dataset. It was optimized to produce responses semantically similar to harmful medical advice, serving as a comparative tool for research into post-training algorithms.

Loading preview...