MiniMaxAI/MiniMax-M2

Hugging Face
TEXT GENERATIONConcurrency Cost:4Model Size:229BQuant:FP8Ctx Length:32kPublished:Oct 22, 2025License:modified-mitArchitecture:Transformer1.5K Open Weights Warm

MiniMaxAI's MiniMax-M2 is a 229 billion parameter Mixture-of-Experts (MoE) model with 10 billion active parameters, designed for high efficiency in coding and agentic workflows. It features a 32768 token context length and excels in multi-file edits, coding-run-fix loops, and complex toolchain execution across various environments. MiniMax-M2 offers competitive general intelligence, ranking highly among open-source models, while providing lower latency and cost due to its efficient activation size, making it ideal for interactive agents and batched sampling.

Loading preview...

MiniMax-M2: A Compact MoE for Coding & Agentic Workflows

MiniMax-M2, developed by MiniMaxAI, is a 229 billion total parameter Mixture-of-Experts (MoE) model that activates only 10 billion parameters per inference. This design prioritizes efficiency, delivering powerful performance for coding and agentic tasks with reduced latency and cost, making it highly deployable.

Key Capabilities

  • Superior General Intelligence: Achieves a #1 composite score among open-source models globally on Artificial Analysis benchmarks, covering mathematics, science, instruction following, coding, and agentic tool use.
  • Advanced Coding: Excels in end-to-end developer workflows, including multi-file edits, coding-run-fix loops, and test-validated repairs. Demonstrates strong performance on Terminal-Bench and SWE-Bench style tasks.
  • Robust Agent Performance: Capable of planning and executing complex, long-horizon toolchains across shell, browser, retrieval, and code runners, with consistent recovery from flaky steps.
  • Efficient Design: With only 10 billion active parameters, it offers faster feedback cycles, more concurrent runs, and simpler capacity planning, optimizing for responsive agent loops and better unit economics.

Good for

  • Developers needing high-performance coding assistance in terminals, IDEs, and CI environments.
  • Building interactive agents that require fast inference and robust tool-use capabilities.
  • Use cases demanding frontier-style coding and agentic features without incurring frontier-scale costs.
  • Applications benefiting from streamlined plan-act-verify loops and efficient resource utilization.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p