Name: AryanNsc/qwen3-0.6b-tool-router API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: AryanNsc

Overview of AryanNsc/qwen3-0.6b-tool-router

This model is a verticalized Small Language Model (SLM), built upon Qwen3-0.6B, and uniquely specialized for tool and function routing. Unlike general-purpose language models, its core function is to serve as a deterministic router within agentic systems, ensuring precise mapping of natural language inputs to structured tool calls.

Key Capabilities & Properties

Model Size: A compact 0.6 billion parameters, ideal for efficiency.
Strict JSON Output: Engineered to produce machine-consumable JSON, crucial for reliable tool invocation.
Low Latency & Memory: Optimized for rapid processing and minimal memory usage, supporting edge-device inference.
No Chain-of-Thought: Designed without CoT to reduce token count and parsing overhead, enhancing speed.
Fast Cold Start: Enables quick deployment and responsiveness in on-device or near-device applications.

Performance Highlights

Evaluated using BFCL metrics, the model demonstrates strong performance in key areas:

Multi-Turn Base: Achieves 90.42%
Relevance Detection: Scores 90.89%
Non-Live Parallel AST: Reaches 83.50%

Ideal Use Cases

This model is particularly well-suited for scenarios demanding efficiency and reliability in tool calling, especially in:

On-device assistants and local agent routers.
Offline-capable systems where connectivity is limited.
Privacy-sensitive deployments requiring local processing.

Overview

Overview of AryanNsc/qwen3-0.6b-tool-router

Key Capabilities & Properties

Performance Highlights

Ideal Use Cases

Full Model Card (README)