prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1

VISIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Oct 16, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1 is an 8 billion parameter vision-language model built upon the Qwen3-VL-8B-Thinking architecture, featuring a 32768 token context length. This model is specifically designed for abliterated (uncensored) reasoning and captioning, producing detailed outputs for visual and multimodal contexts, including sensitive content. It excels at generating high-fidelity descriptions and reasoning across diverse aspect ratios and resolutions, making it suitable for research in content moderation and creative applications.

Loading preview...

Overview

prithivMLmods/Qwen3-VL-8B-Thinking-abliterated-v1 is an 8 billion parameter vision-language model, an "abliterated" (v1.0) variant of Qwen3-VL-8B-Thinking. It is engineered for uncensored reasoning and captioning, capable of generating detailed descriptions and logical outputs across a wide array of visual and multimodal inputs, including complex or sensitive content. The model maintains reasoning integrity and descriptive precision across diverse aspect ratios, resolutions, and prompt conditions.

Key Capabilities

  • Abliterated / Uncensored Captioning: Fine-tuned to bypass conventional content filters while preserving factual, descriptive, and reasoning-rich outputs.
  • High-Fidelity Reasoning and Descriptions: Generates comprehensive captions and reasoning for general, artistic, technical, abstract, and low-context images.
  • Robust Across Aspect Ratios: Performs consistently on wide, tall, square, panoramic, and irregular image dimensions.
  • Variational Detail Control: Produces outputs ranging from concise summaries to fine-grained, high-context reasoning and descriptions.
  • Multilingual Output Capability: Defaults to English but can be adapted to other languages via prompt engineering.

Intended Use Cases

  • Generating detailed, uncensored captions and reasoning for general-purpose, artistic, or research-oriented datasets.
  • Research in content moderation, red-teaming, and generative safety analysis.
  • Enabling descriptive captioning and reasoning for datasets typically excluded from mainstream models.
  • Creative applications such as visual storytelling, art description, and multimodal reasoning exploration.

Limitations

  • May generate explicit, sensitive, or offensive content depending on prompts and image input.
  • Not suitable for production systems that require strict content moderation.