linglingdan/DRIFT-8B-Material

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Jun 17, 2026License:otherArchitecture:Transformer0.0K Cold

DRIFT-8B-Material is an 8 billion parameter merged HuggingFace checkpoint fine-tuned by linglingdan based on Qwen/Qwen3-8B. This model is designed for causal language modeling, utilizing a bfloat16 precision and supporting a context length of 32768 tokens. It is particularly notable for its underlying research in Difficulty Routing Self-DIstillation (DRIFT) and is suitable for applications requiring robust language generation.

Loading preview...

DRIFT-8B-Material: An Overview

DRIFT-8B-Material is an 8 billion parameter language model, fine-tuned by linglingdan from the Qwen/Qwen3-8B base model. It leverages the Qwen3ForCausalLM architecture and operates with bfloat16 precision. While the configuration indicates a maximum position embedding of 40960, the practical context length for this specific checkpoint is 32768 tokens.

Key Characteristics

  • Base Model: Fine-tuned from Qwen/Qwen3-8B.
  • Precision: Utilizes bfloat16 for efficient computation.
  • Context Length: Supports a context window of 32768 tokens.
  • Research Foundation: Associated with the DRIFT research, focusing on Difficulty Routing Self-DIstillation.

Usage Considerations

This model is provided as a merged HuggingFace checkpoint, ready for deployment with standard transformers library methods. Default generation parameters include temperature=0.6, top_k=20, and top_p=0.95, which can be adjusted for specific use cases. Users should be aware of potential limitations, including the possibility of hallucinating tool calls or producing invalid arguments, and the general dependence of output quality on serving templates and tool schema formatting. Safety, bias, and domain-specific failure modes are not extensively documented within the provided information.