Lambent/Mira-v1.25.2-27B-DPO
VISIONConcurrency Cost:2Model Size:27BQuant:FP8Ctx Length:32kPublished:Feb 19, 2026License:gemmaArchitecture:Transformer Cold
Lambent/Mira-v1.25.2-27B-DPO is a 27 billion parameter language model developed by Lambent, built upon the Mira-v1.25.1-27B-DPO base. This model utilizes a second DPO (Direct Preference Optimization) phase, specifically targeting synthetic negatives related to its value drift patterns in multi-turn self-examination. It maintains creative capabilities and intelligence similar to its predecessor, making it suitable for tasks requiring nuanced understanding and generation.
Loading preview...