google/gemma-4-26B-A4B
TEXT GENERATIONConcurrency Cost:2Model Size:26BQuant:FP8Ctx Length:32kPublished:Mar 12, 2026License:apache-2.0Architecture:Transformer0.2K Open Weights Warm

Gemma-4-26B-A4B is a 25.2 billion total parameter multimodal Mixture-of-Experts (MoE) model developed by Google DeepMind, part of the Gemma 4 family. It features 3.8 billion active parameters for fast inference and a 256K token context window. This model excels at reasoning, coding, and multimodal understanding, processing text, images, and video inputs to generate text outputs.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p