Name: CiscoKpanse/sp-DNM-4-26B-A4B-it_v0.1 API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: CiscoKpanse

Gemma 4 26B A4B MoE: Multimodal Reasoning and Efficient Inference

This model, CiscoKpanse/sp-DNM-4-26B-A4B-it_v0.1, is a 26 billion parameter instruction-tuned variant from Google DeepMind's Gemma 4 family. It is a multimodal Mixture-of-Experts (MoE) model, capable of processing text, image, and video inputs to generate text outputs. A key differentiator is its MoE architecture, which, despite having 25.2 billion total parameters, only activates 3.8 billion parameters during inference, offering significantly faster performance comparable to a 4B model.

Key Capabilities

Multimodality: Handles text, image, and video inputs with variable aspect ratio and resolution support. It supports interleaved multimodal input, allowing free mixing of text and images in prompts.
Reasoning: Designed with configurable thinking modes and native system prompt support for structured conversations.
Extended Context: Features a substantial 256K token context window.
Coding & Agentic Capabilities: Shows enhanced performance in coding benchmarks and includes native function-calling support for autonomous agents.
Multilingual Support: Pre-trained on over 140 languages with out-of-the-box support for 35+ languages.

Good for

Efficient Inference: Ideal for applications requiring high performance on consumer GPUs and workstations due to its MoE design.
Complex Multimodal Tasks: Excels in scenarios involving image understanding (object detection, OCR, document parsing), video analysis, and combining various input types.
Reasoning and Code Generation: Strong performance in reasoning tasks and coding benchmarks, making it suitable for development and problem-solving applications.
Agentic Workflows: Native function-calling support facilitates the creation of highly capable AI agents.

Overview

Gemma 4 26B A4B MoE: Multimodal Reasoning and Efficient Inference

Key Capabilities

Good for

Full Model Card (README)