Name: dcraver2005/r8_a16_numinamath_16bit API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: dcraver2005

Model Overview

dcraver2005/r8_a16_numinamath_16bit is a 4 billion parameter Qwen3-based language model, developed by dcraver2005. It was fine-tuned from the unsloth/qwen3-4b-thinking-2507-unsloth-bnb-4bit base model, leveraging the Unsloth library in conjunction with Huggingface's TRL library.

Key Characteristics

Architecture: Qwen3-based causal language model.
Parameter Count: 4 billion parameters.
Context Length: Supports a substantial context window of 32768 tokens.
Training Efficiency: Achieved 2x faster training due to the use of Unsloth, a library designed to optimize large language model training.

Potential Use Cases

This model is suitable for a variety of general language generation and understanding tasks, benefiting from its efficient training and Qwen3 architecture. Its substantial context length makes it capable of handling longer inputs and generating more coherent, extended outputs.

Overview

Model Overview

Key Characteristics

Potential Use Cases

Full Model Card (README)