Name: LMIS-ORG/AgentFlow_Slime_Agentic_Qwen2.5_7B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: LMIS-ORG

LMIS-ORG/AgentFlow_Slime_Agentic_Qwen2.5_7B Overview

This model, developed by LMIS-ORG, is based on the Qwen2.5-7B-Instruct architecture and implements the novel AgentFlow paradigm. AgentFlow transforms traditional single-step LLM inference into a sophisticated multi-turn agentic process, featuring a Planner → Executor → Verifier loop. A key innovation is the application of Reinforcement Learning (RL) signals, specifically GRPO, to the Planner's generation trajectory. This allows the model to autonomously improve its tool-use and reasoning abilities, bypassing the need for labor-intensive manual annotation of intermediate steps.

Key Capabilities

Agentic Reasoning: Employs a structured Planner-Executor-Verifier loop for complex problem-solving.
Reinforcement Learning: Utilizes GRPO to refine the Planner's strategy and enhance performance.
Tool Use: Integrates specialized tools like base_generator for general text generation and python_coder for mathematical computation and algorithmic tasks.
Improved Performance: Demonstrates substantial gains over baseline models, achieving a +20.0% improvement on the AIME 2024 dataset (from 10.0% to 30.0%) with the Qwen2.5-7B-Instruct base.

Good For

Complex Problem Solving: Excels in scenarios requiring multi-step reasoning and tool invocation.
Automated Agent Development: Ideal for researchers and developers exploring advanced agentic LLM architectures.
Mathematical and Algorithmic Tasks: Leverages the python_coder tool for accurate computation and problem-solving.

Note: The current model was trained for 100 steps due to resource constraints, indicating potential for further improvement with extended training.

Overview

LMIS-ORG/AgentFlow_Slime_Agentic_Qwen2.5_7B Overview

Key Capabilities

Good For

Full Model Card (README)