Name: tuandunghcmut/qwen3-4b-planner-v1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: tuandunghcmut

Overview

tuandunghcmut/qwen3-4b-planner-v1 is a 4 billion parameter model, fine-tuned using LoRA on the Qwen/Qwen3-4B-Instruct-2507 base model. This model is specialized in multi-agent planning and tool-calling, distinguishing itself through its ability to generate structured outputs for complex task orchestration.

Key Capabilities

Structured Planning Output: Generates structured JSON plans (e.g., {"tasks": [...]}) for short-prompt planning scenarios.
MAS Orchestration: Produces MAS orchestrator YAML (action: plan|clarify|done) for canonical long-prompt planning styles.
Native Tool-Calling: Supports Hermes-style <tool_call> output, with arguments as direct JSON objects, avoiding double-encoding issues.
Training Data: Fine-tuned on a diverse 500k-row dataset including multiformat, toucan_qwen3, nemotron_chat_if, planner_v01_full, duy_vhb_style_json, and nemotron_structured_outputs.
Accessibility: Available as a merged Hugging Face model, a LoRA adapter for integration with the base model, and various GGUF quantizations (f16, q8_0, q4_k_m) for flexible deployment.

Good For

Agentic Workflows: Ideal for developing applications that require LLMs to break down user requests into actionable, structured plans for multiple agents.
Tool Use Integration: Excellent for scenarios where precise, natively formatted tool calls are crucial for interacting with external systems or APIs.
Resource-Constrained Environments: With its 4B parameters and GGUF quantizations, it's suitable for deployment on edge devices or systems with limited computational resources, while still providing specialized planning capabilities.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)