yamatazen/Luna-Karcher-12B
TEXT GENERATIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kArchitecture:Transformer0.0K Cold

Luna-Karcher-12B is a 12 billion parameter language model created by yamatazen, formed by merging three base models: unsloth/Mistral-Nemo-Base-2407, Elizezen/Himeyuri-v0.1-12B, and shisa-ai/shisa-v2-mistral-nemo-12b. This model was constructed using the Karcher Mean merge method, aiming to combine the strengths of its constituent models. It is designed for general language tasks, leveraging its merged architecture for broad applicability.

Loading preview...