Name: google/gemma-4-31B-it API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: google

Overview

Gemma 4 is a family of open, multimodal models developed by Google DeepMind, capable of processing text and image inputs (with audio support on smaller variants) and generating text outputs. The models are available in both pre-trained and instruction-tuned variants, featuring a context window of up to 256K tokens and multilingual support across over 140 languages. This release includes five sizes, from E2B (2.3B effective parameters) to 31B, and offers both Dense and Mixture-of-Experts (MoE) architectures.

Key Capabilities

Multimodality: Handles text, image (variable aspect ratio/resolution), and video inputs. E2B, E4B, and 12B models also support audio.
Reasoning: Designed as highly capable reasoners with configurable thinking modes.
Extended Context: Supports context windows up to 256K tokens for complex tasks.
Coding & Agentic Workflows: Enhanced coding benchmarks and native function-calling support for autonomous agents.
Native System Prompt Support: Facilitates more structured and controllable conversations.
Diverse Architectures: Offers Dense and MoE variants for scalable deployment, including models optimized for on-device execution.

Good For

Complex Reasoning Tasks: Leveraging its built-in reasoning mode and large context window.
Multimodal Applications: Integrating text, image, and video understanding (and audio for specific models).
Code Generation and Agent Development: Utilizing enhanced coding capabilities and function-calling.
Long-Context Processing: Handling extensive documents and conversations up to 256K tokens.
Deployment Flexibility: Choosing between dense and MoE architectures for various hardware environments, from mobile to servers.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)