hakim78/DeepSeek-R1-Distill-Qwen-8B-Abliterated
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Jun 3, 2026Architecture:Transformer Cold
hakim78/DeepSeek-R1-Distill-Qwen-8B-Abliterated is a 7.6 billion parameter language model built on the Qwen2 architecture. This 'Abliterated' build is optimized for efficient serving via Text Generation Inference and vLLM, supporting OpenAI-compatible chat completions. It is designed for general language generation tasks where efficient deployment and compatibility with standard inference frameworks are key.
Loading preview...
Model Overview
hakim78/DeepSeek-R1-Distill-Qwen-8B-Abliterated is a 7.6 billion parameter language model based on the robust Qwen2 architecture. This particular version is an "Abliterated" build, indicating specific optimizations for deployment and inference efficiency.
Key Capabilities
- Architecture: Utilizes the Qwen2 architecture, known for its strong performance across various language tasks.
- Parameter Count: Features 7.6 billion parameters, offering a balance between performance and computational requirements.
- Context Length: Supports a substantial context window of 32768 tokens, enabling processing of longer inputs and generating more coherent, extended outputs.
- Inference Optimization: Specifically built to be servable via popular inference frameworks like Text Generation Inference (TGI) and vLLM.
- API Compatibility: Provides OpenAI-compatible chat completions, simplifying integration into existing applications and workflows that leverage the OpenAI API standard.
Good For
- Efficient Deployment: Ideal for developers seeking a powerful language model that is optimized for efficient serving on common inference platforms.
- Standardized Integration: Suitable for projects requiring OpenAI-compatible chat completion endpoints, allowing for seamless swapping with other models.
- General Language Tasks: Well-suited for a broad range of applications including text generation, summarization, question answering, and conversational AI where the Qwen2 architecture's strengths are beneficial.