numadream/Qwen3-VL-8B-Instruct-abliterated-vllm-fix

VISIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Apr 19, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The numadream/Qwen3-VL-8B-Instruct-abliterated-vllm-fix is an 8 billion parameter, instruction-tuned, vision-language model based on the Qwen3-VL architecture. This model has undergone 'abliteration' to significantly reduce safety filtering in its text generation component, making it an uncensored version of the original Qwen3-VL-8B-Instruct. It is designed for use cases where reduced content filtering is desired, particularly for research and experimental applications requiring less constrained text outputs alongside image understanding.

Loading preview...

Overview

This model, numadream/Qwen3-VL-8B-Instruct-abliterated-vllm-fix, is an 8 billion parameter instruction-tuned vision-language model derived from the Qwen3-VL-8B-Instruct architecture. Its primary distinguishing feature is the application of 'abliteration' to its text generation component, effectively removing much of the original safety filtering. This process specifically targets the text part of the model, not the image understanding capabilities, ensuring it no longer refuses to describe or analyze images due to safety constraints.

Key Capabilities

  • Uncensored Text Generation: Significantly reduced safety filtering allows for less constrained and potentially sensitive or controversial text outputs.
  • Vision-Language Integration: Retains the ability to process and understand image inputs alongside text prompts.
  • Instruction Following: Designed to follow instructions for various tasks, including image description and analysis.
  • Ollama Support: Directly usable with Ollama (version 0.12.7 or newer) for local deployment.

Use Cases and Considerations

This model is primarily intended for research and experimental use in controlled environments where the generation of unfiltered content is acceptable or desired. It is not recommended for production or public-facing commercial applications due to the high risk of generating inappropriate or harmful content. Users must be aware of the legal and ethical responsibilities associated with using a model with reduced safety measures and are advised to monitor outputs closely. The model's ability to process images remains intact, making it suitable for exploring vision-language tasks without typical refusal behaviors.