prithivMLmods/Qwen3-VL-32B-Instruct-abliterated-v1

VISIONConcurrency Cost:2Model Size:33.4BQuant:FP8Ctx Length:32kPublished:Oct 22, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

prithivMLmods/Qwen3-VL-32B-Instruct-abliterated-v1 is a 33.4 billion parameter vision-language model, an abliterated variant of Qwen3-VL-32B-Instruct, developed by prithivMLmods. It is specifically fine-tuned for uncensored reasoning and detailed descriptive captioning across diverse visual and multimodal contexts, including sensitive content. The model excels at generating high-fidelity descriptions and reasoning outputs for images with various aspect ratios and resolutions.

Loading preview...

Qwen3-VL-32B-Instruct-abliterated-v1 Overview

This model is an "abliterated" (v1.0) variant of the Qwen3-VL-32B-Instruct architecture, developed by prithivMLmods. It is specifically designed for Abliterated Reasoning and Captioning, focusing on generating detailed, descriptive captions and reasoning outputs across a wide range of visual and multimodal contexts, including complex, sensitive, or nuanced content. The model supports diverse aspect ratios and resolutions, ensuring robust performance.

Key Capabilities

  • Abliterated / Uncensored Captioning: Fine-tuned to bypass conventional content filters while preserving factual, descriptive, and reasoning-rich outputs.
  • High-Fidelity Descriptions: Generates comprehensive captions and reasoning for general, artistic, technical, abstract, or low-context images.
  • Robust Across Aspect Ratios: Performs consistently across wide, tall, square, and irregular image dimensions.
  • Variational Detail Control: Capable of producing outputs ranging from concise summaries to fine-grained, intricate descriptions and reasoning.
  • Multilingual Output Capability: Primarily optimized for English, with adaptability for multilingual prompts through prompt engineering.

Intended Use Cases

  • Generating detailed, uncensored captions and reasoning for general-purpose or artistic datasets.
  • Research in content moderation, red-teaming, and generative safety evaluation.
  • Enabling descriptive captioning and reasoning for visual datasets typically excluded from mainstream models.
  • Creative applications such as storytelling, art generation, or multimodal reasoning tasks.
  • Captioning and reasoning for non-standard aspect ratios and stylized visual content.

Limitations

Users should be aware that this model may produce explicit, sensitive, or offensive descriptions depending on the image content and prompts. It is not recommended for production systems requiring strict content moderation, and its accuracy can fluctuate for unfamiliar, synthetic, or highly abstract visual content.