hakim78/DeepSeek-R1-Distill-Qwen-8B-Abliterated

TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Jun 3, 2026Architecture:Transformer Cold

hakim78/DeepSeek-R1-Distill-Qwen-8B-Abliterated is a 7.6 billion parameter language model built on the Qwen2 architecture. This 'Abliterated' build is optimized for efficient serving via Text Generation Inference and vLLM, supporting OpenAI-compatible chat completions. It is designed for general language generation tasks where efficient deployment and compatibility with standard inference frameworks are key.

Loading preview...

Model Overview

hakim78/DeepSeek-R1-Distill-Qwen-8B-Abliterated is a 7.6 billion parameter language model based on the robust Qwen2 architecture. This particular version is an "Abliterated" build, indicating specific optimizations for deployment and inference efficiency.

Key Capabilities

  • Architecture: Utilizes the Qwen2 architecture, known for its strong performance across various language tasks.
  • Parameter Count: Features 7.6 billion parameters, offering a balance between performance and computational requirements.
  • Context Length: Supports a substantial context window of 32768 tokens, enabling processing of longer inputs and generating more coherent, extended outputs.
  • Inference Optimization: Specifically built to be servable via popular inference frameworks like Text Generation Inference (TGI) and vLLM.
  • API Compatibility: Provides OpenAI-compatible chat completions, simplifying integration into existing applications and workflows that leverage the OpenAI API standard.

Good For

  • Efficient Deployment: Ideal for developers seeking a powerful language model that is optimized for efficient serving on common inference platforms.
  • Standardized Integration: Suitable for projects requiring OpenAI-compatible chat completion endpoints, allowing for seamless swapping with other models.
  • General Language Tasks: Well-suited for a broad range of applications including text generation, summarization, question answering, and conversational AI where the Qwen2 architecture's strengths are beneficial.