sparr250/toolcalling-merged-demo

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:Apr 2, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The sparr250/toolcalling-merged-demo is a 2 billion parameter Qwen3 model, fine-tuned by sparr250, with a 32768 token context length. This model was optimized for training speed using Unsloth and Huggingface's TRL library. It is designed for tool-calling applications, leveraging its Qwen3 architecture for efficient processing.

Loading preview...

Model Overview

The sparr250/toolcalling-merged-demo is a 2 billion parameter Qwen3 model, fine-tuned by sparr250. It was developed using Unsloth and Huggingface's TRL library, which enabled a 2x faster training process. The model is based on unsloth/Qwen3-1.7B-unsloth-bnb-4bit and is licensed under Apache-2.0.

Key Capabilities

  • Efficient Training: Leverages Unsloth for significantly faster fine-tuning.
  • Tool Calling: Specifically designed and fine-tuned for tool-calling functionalities.
  • Qwen3 Architecture: Benefits from the robust capabilities of the Qwen3 model family.
  • Extended Context: Supports a substantial context length of 32768 tokens.

Good For

  • Applications requiring efficient and fast fine-tuning of large language models.
  • Developing systems that integrate tool-use capabilities with an LLM.
  • Projects needing a Qwen3-based model with optimized training characteristics.