Name: google/gemma-4-31B-it-qat-q4_0-unquantized API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: google

Overview

This model is a 30.7 billion parameter instruction-tuned variant from the Gemma 4 family, developed by Google DeepMind. It's an unquantized checkpoint derived from Quantization-Aware Training (QAT), designed to maintain high quality while significantly lowering memory footprint. The Gemma 4 models are multimodal, processing text and image inputs (with some smaller variants also supporting audio) and generating text outputs. They feature a substantial context window of up to 256K tokens and offer multilingual support across over 140 languages.

Key Capabilities

Multimodal Understanding: Processes text and images (with variable aspect ratio and resolution support). The 31B model specifically handles text and image inputs.
Reasoning: Designed with configurable thinking modes for enhanced reasoning capabilities.
Extended Context: Supports a 256K token context window, enabling complex, long-context tasks.
Coding & Agentic Workflows: Achieves notable improvements in coding benchmarks and includes native function-calling support for autonomous agents.
Native System Prompt Support: Integrates a system role for more structured and controllable conversations.
QAT Optimization: Benefits from Quantization-Aware Training, offering similar quality to bfloat16 models with reduced memory demands.

Good for

Research and Development: Ideal for custom downstream compilation and research due to its unquantized QAT checkpoint.
Complex Reasoning Tasks: Its design and configurable thinking modes make it suitable for tasks requiring deep reasoning.
Multimodal Applications: Excellent for applications involving both text and image understanding, such as document parsing, OCR, and visual question answering.
Code Generation and Agentic Systems: Strong performance in coding benchmarks and native function-calling support make it valuable for developer tools and agent creation.

Overview

Overview

Key Capabilities

Good for

Full Model Card (README)