FuseAI/FuseChat-Qwen-2.5-7B-Instruct
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Nov 12, 2024Architecture:Transformer0.0K Cold

FuseAI/FuseChat-Qwen-2.5-7B-Instruct is a 7.6 billion parameter instruction-tuned language model developed by FuseAI, based on the Qwen 2.5 architecture. It is part of the FuseChat-3.0 series, which employs an implicit model fusion (IMF) technique to integrate the strengths of larger source LLMs into more compact target models. This model excels in general conversation, instruction following, mathematics, and coding tasks, leveraging a two-stage training pipeline of Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO).

Loading preview...