Overview

Granite-3.2-8B-Instruct is an 8-billion-parameter, long-context AI model developed by the IBM Granite Team, released on February 26th, 2025, under an Apache 2.0 license. It is an evolution of Granite-3.1-8B-Instruct, specifically fine-tuned to enhance reasoning capabilities. The model's training data combines permissively licensed open-source datasets with IBM's internally generated synthetic data, focusing on improving its ability to 'think' and process complex reasoning tasks. A key feature is the controllability of its thinking capability, allowing it to be activated only when required.

Key Capabilities

Enhanced Reasoning: Fine-tuned with synthetic data specifically for reasoning tasks, offering controllable 'thinking' functionality.
General Instruction Following: Designed to handle a wide array of instruction-following tasks.
Multilingual Support: Supports English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, and Chinese, with potential for fine-tuning in other languages.
Long-Context Processing: Capable of handling long documents for summarization, question-answering, and other long-context tasks.
Diverse Applications: Excels in summarization, text classification, text extraction, question-answering, Retrieval Augmented Generation (RAG), code-related tasks, and function-calling.

Performance Highlights

Compared to its predecessor, Granite-3.1-8B-Instruct, and other 7-8B models like Llama-3.1-8B-Instruct and Qwen-2.5-7B-Instruct, Granite-3.2-8B-Instruct shows notable improvements in specific benchmarks:

Achieves 55.25 on ArenaHard and 61.19 on Alpaca-Eval-2, significantly outperforming its predecessor and other models in these areas.
Maintains strong performance in MMLU (66.79), GSM8K (81.65), HumanEval (89.35), and HumanEval+ (85.72).

Intended Use Cases

AI Assistants: Ideal for integration into AI assistants for general instruction-following.
Business Applications: Suitable for various business-oriented AI tasks.
Complex Problem Solving: Leveraging its enhanced reasoning, it can tackle problems requiring a structured thought process, as demonstrated by its ability to show detailed step-by-step reasoning when thinking=True is enabled during generation.

Overview

Overview

Key Capabilities

Performance Highlights

Intended Use Cases

Full Model Card (README)