dcraver2005/r8_a16_numinamath_16bit

TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:May 29, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

dcraver2005/r8_a16_numinamath_16bit is a 4 billion parameter Qwen3-based causal language model developed by dcraver2005, fine-tuned from unsloth/qwen3-4b-thinking-2507-unsloth-bnb-4bit. This model was trained using Unsloth and Huggingface's TRL library, achieving 2x faster training. It features a 32768 token context length and is optimized for general language tasks.

Loading preview...

Model Overview

dcraver2005/r8_a16_numinamath_16bit is a 4 billion parameter Qwen3-based language model, developed by dcraver2005. It was fine-tuned from the unsloth/qwen3-4b-thinking-2507-unsloth-bnb-4bit base model, leveraging the Unsloth library in conjunction with Huggingface's TRL library.

Key Characteristics

  • Architecture: Qwen3-based causal language model.
  • Parameter Count: 4 billion parameters.
  • Context Length: Supports a substantial context window of 32768 tokens.
  • Training Efficiency: Achieved 2x faster training due to the use of Unsloth, a library designed to optimize large language model training.

Potential Use Cases

This model is suitable for a variety of general language generation and understanding tasks, benefiting from its efficient training and Qwen3 architecture. Its substantial context length makes it capable of handling longer inputs and generating more coherent, extended outputs.