openai/gpt-oss-120b

Warm
Public
120B
FP8
16384
Aug 4, 2025
License: apache-2.0
Hugging Face
Overview

OpenAI gpt-oss-120b: Open-Weight Model for Advanced Reasoning and Agentic Tasks

The gpt-oss-120b is a 120 billion parameter open-weight model developed by OpenAI, part of their gpt-oss series. It is specifically designed for high reasoning capabilities, agentic tasks, and versatile developer applications. This model is optimized for production environments and can operate efficiently on a single 80GB GPU (such as NVIDIA H100 or AMD MI300X) thanks to its MXFP4 quantization.

Key Capabilities & Features

  • Permissive Apache 2.0 License: Allows for broad experimentation, customization, and commercial deployment without restrictive copyleft or patent concerns.
  • Configurable Reasoning Effort: Users can adjust the model's reasoning intensity (low, medium, high) to balance latency and analytical depth for specific tasks.
  • Full Chain-of-Thought Access: Provides complete visibility into the model's reasoning process, aiding in debugging and increasing trust in its outputs.
  • Agentic Functionality: Natively supports advanced agentic operations including function calling, web browsing, and Python code execution.
  • Fine-tunable: The model can be fully customized through parameter fine-tuning, with gpt-oss-120b fine-tunable on a single H100 node.
  • Harmony Response Format: Trained specifically on OpenAI's harmony response format, which is required for correct operation.

Good For

  • Production-grade applications requiring robust reasoning and complex problem-solving.
  • Developer use cases involving tool use, such as function calling, web browsing, and code execution.
  • Customization and research due to its fine-tunable nature and permissive license.
  • Debugging and understanding model behavior through its full chain-of-thought access.