YOYO-AI/Qwen2.5-14B-YOYO-V3
TEXT GENERATIONConcurrency Cost:1Model Size:14.8BQuant:FP8Ctx Length:32kPublished:Feb 21, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

YOYO-AI/Qwen2.5-14B-YOYO-V3 is a 14.8 billion parameter language model based on the Qwen2.5 architecture, developed by YOYO-AI. This model is a result of a sophisticated multi-stage merging process, combining various instruction-tuned and high-performance models using DELLA and Model Stock methods to enhance stability and performance. It is particularly optimized for improved stability and performance compared to earlier merges, making it suitable for general language generation tasks.

Loading preview...