Lambent/Mira-v1.23.1-27B-dpo
VISIONConcurrency Cost:2Model Size:27BQuant:FP8Ctx Length:32kPublished:Jan 22, 2026License:gemmaArchitecture:Transformer0.0K Cold

Lambent/Mira-v1.23.1-27B-dpo is a 27 billion parameter language model developed by Lambent, fine-tuned using Direct Preference Optimization (DPO) on creative writing and identity reinforcement data. This model, built upon the Mira-v1.23-27B-rlvr base, specializes in generating high-quality creative text and iterative writing refinement. It leverages a Karcher Mean merge of five DPO-tuned adapters to enhance its creative capabilities and maintain a 32768 token context length.

Loading preview...