Name: beomi/qwen3-8b-dmax API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: beomi

Overview

The beomi/qwen3-8b-dmax is an 8 billion parameter model derived from Qwen/Qwen3-8B, developed by beomi. It has been fine-tuned using a JAX-based DMax/OPUT (block-diffusion / on-policy under-tuning) training framework, specifically utilizing JAX/Flax NNX on TPUs. This model is distinct from its base as it is not a standard autoregressive Qwen3; instead, it is designed for block-diffusion inference.

Key Characteristics

Base Model: Qwen/Qwen3-8B
Training Method: JAX-trained DMax/OPUT (block-diffusion / on-policy under-tuning).
Inference Requirement: Requires the dllm-jax DMax block-diffusion path, expecting a unique [noised; clean] input format under a block-diffusion mask.
Parameter Count: 8 billion parameters.
Context Length: 32768 tokens.

Use Cases

This model is particularly suited for research and applications that leverage block-diffusion generative processes. Its specialized training and inference requirements make it ideal for developers exploring advanced generative models that move beyond traditional autoregressive approaches, especially within the JAX/Flax ecosystem.

Overview

Overview

Key Characteristics

Use Cases

Full Model Card (README)