KickItLikeShika/qwen-2.5-7b-instruct-sdft-tooluse

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:May 28, 2026Architecture:Transformer0.0K Warm

KickItLikeShika/qwen-2.5-7b-instruct-sdft-tooluse is a 7.6 billion parameter instruction-tuned language model based on the Qwen2.5 architecture, developed by KickItLikeShika. This model is specifically fine-tuned using Self-Distillation Fine-Tuning (SDFT) on a Tool Use dataset, optimizing its ability to understand and execute tool-related instructions. It is designed for applications requiring robust tool interaction and complex instruction following.

Loading preview...

Model Overview

KickItLikeShika/qwen-2.5-7b-instruct-sdft-tooluse is a 7.6 billion parameter instruction-tuned model, building upon the robust Qwen2.5-7B-Instruct architecture. Its key differentiator lies in its specialized training methodology: Self-Distillation Fine-Tuning (SDFT), applied to a dedicated Tool Use dataset. This process enhances the model's proficiency in interpreting and responding to instructions that involve external tools or functions.

Key Capabilities

  • Tool Use Optimization: Specifically trained to excel in scenarios requiring interaction with tools, making it suitable for agents or applications that need to leverage external functionalities.
  • Instruction Following: Benefits from the Qwen2.5-7B-Instruct base, providing strong general instruction-following capabilities.
  • SDFT Training: Utilizes a sophisticated fine-tuning technique to improve performance and efficiency for its target use case.

Use Cases

This model is particularly well-suited for:

  • AI Agents: Developing agents that can effectively use a suite of tools to accomplish tasks.
  • Complex Instruction Execution: Applications where the model needs to break down user requests into tool-executable steps.
  • Automated Workflows: Integrating LLMs into systems that require interaction with APIs or other software components.

Evaluation results on the Tool Use evaluation split are available within the repository's /eval directory, demonstrating its specialized performance. Further details on the reproduction report can be found here.