osmosis-ai/osmosis-mcp-4b

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:May 8, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

The osmosis-ai/osmosis-mcp-4b is a 4 billion parameter language model based on Qwen3-4B, fine-tuned by osmosis-ai with reinforcement learning. It features a 40960 token context length and is specifically optimized for multi-step Multi-Chain Protocol (MCP)-style tool usage. This model excels at reasoning through and invoking multiple tools to address complex, multi-turn prompts, making it suitable for tool-augmented AI agents.

Loading preview...

Osmosis-MCP-4B: A Tool-Augmented Language Model

Osmosis-MCP-4B is a 4 billion parameter model, built upon the Qwen3-4B architecture, and specifically fine-tuned by osmosis-ai using reinforcement learning. Its primary distinction lies in its exceptional capability for multi-step Multi-Chain Protocol (MCP)-style tool usage, enabling it to effectively handle complex, multi-turn prompts that require sequential tool invocations.

Key Capabilities

  • Multi-step Tool Reasoning: Designed to reason through and execute multiple tool calls (e.g., weather data followed by a location ranker) to answer user queries.
  • Tool Preference: Through its specialized training, the model demonstrates a strong preference for invoking tools when appropriate, rather than relying solely on its pre-trained knowledge.
  • Efficient Training: Leverages advanced techniques like Dr. GRPO for stable reinforcement learning and SGLang + VeRL for efficient multi-turn rollout environments.
  • MCP-Native Agent: Addresses the need for open-source models that can effectively utilize tools within the MCP framework, overcoming limitations of closed-source alternatives and tool sprawl.

Good For

  • Developing tool-augmented AI agents that require robust function-calling and multi-step reasoning.
  • Applications where models need to interact with external systems or APIs through a defined set of tools.
  • Use cases demanding a smaller, yet powerful, open-source model capable of complex tool chaining in real-world scenarios.