xxwu/Agent-STAR-RL-3B
TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kPublished:Mar 23, 2026License:mitArchitecture:Transformer Open Weights Warm

Agent-STAR-RL-3B is a 3.1 billion parameter Large Language Model developed by Xixi Wu et al., fine-tuned for long-horizon tool orchestration tasks. Built on the Qwen2.5-3B-Instruct backbone, it utilizes a Data Synthesis → SFT → RL pipeline to enhance agentic capabilities. This model excels at complex, multi-turn environments requiring diverse tool calls to satisfy multifaceted constraints, particularly optimized for benchmarks like TravelPlanner.

Loading preview...