Statuo/Deepseeker-Kunou-Qwen2.5-14b
TEXT GENERATIONConcurrency Cost:1Model Size:14.8BQuant:FP8Ctx Length:32kPublished:Jan 20, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Statuo/Deepseeker-Kunou-Qwen2.5-14b is a 14.8 billion parameter language model based on the Qwen2.5 architecture, created by Statuo through a linear merge of DeepSeek-R1-Distill-Qwen-14B and 14B-Qwen2.5-Kunou-v1. This model is designed to enhance general intelligence and creative writing capabilities, building upon the base Qwen architecture with a substantial 131072 token context length. It aims to improve upon the base intelligence of the Kunou model, offering a balanced performance for various language tasks.

Loading preview...