gz987/qwen2.5-7b-cabs-v0.2

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Feb 18, 2025License:mitArchitecture:Transformer Open Weights Warm

The gz987/qwen2.5-7b-cabs-v0.2 is a 7.6 billion parameter language model, merged from Qwen/Qwen2.5-7B-Instruct using a novel technique to optimize performance. It achieves strong results on the open_llm_leaderboard, ranking 3rd among 7B and smaller models as of February 2025. This model is designed for general language tasks, offering a robust and efficient solution within its parameter class.

Loading preview...

Overview

The gz987/qwen2.5-7b-cabs-v0.2 is a 7.6 billion parameter model derived from the Qwen/Qwen2.5-7B-Instruct base. It distinguishes itself through the application of a novel model merging technique, aiming to enhance overall performance and maintain robustness across various tasks.

Key Performance

This model has undergone official evaluation on the open_llm_leaderboard, demonstrating competitive results:

  • IFEVAL: 74.18
  • BBH: 36.28
  • MATH: 49.02
  • GPQA: 7.61
  • MUSR: 14.86
  • MMLU-PRO: 37.75
  • Average Score: 36.61

As of February 19, 2025, the model holds the 3rd position among all 7B and smaller models on the open_llm_leaderboard. Users can refer to the official leaderboard for the most current rankings.

Unique Aspect

The core differentiator of this model lies in its development methodology, specifically the "novel model merging technique" used to combine and optimize the base Qwen2.5-7B-Instruct model. Further details regarding this technique and its methodology are anticipated to be released soon by the developers.