prithivMLmods/Qwen3-VL-8B-Instruct-abliterated-v2

VISIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Nov 10, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Gated Cold

prithivMLmods/Qwen3-VL-8B-Instruct-abliterated-v2 is an 8 billion parameter vision-language instruction model, a variant of Qwen3-VL-8B-Instruct, developed by prithivMLmods. It is specifically fine-tuned for "abliterated" or uncensored reasoning and captioning, generating highly detailed and descriptive outputs for a wide range of visual and multimodal contexts, including sensitive content. This model excels at providing in-depth reasoning and descriptions for general, artistic, technical, and abstract images, while maintaining robustness across varied image resolutions and aspect ratios.

Loading preview...

Overview

prithivMLmods/Qwen3-VL-8B-Instruct-abliterated-v2 is an 8 billion parameter vision-language instruction model, building upon the Qwen3-VL-8B-Instruct architecture. This model is specifically designed for "abliterated" reasoning and captioning, meaning it is fine-tuned to bypass conventional content filters while preserving factual, descriptive, and reasoning-rich outputs across diverse visual and multimodal contexts.

Key Capabilities

  • Abliterated / Uncensored Captioning: Generates detailed, descriptive, and reasoning-focused outputs without conventional content filters, even for sensitive or nuanced content.
  • High-Fidelity Reasoning and Descriptions: Provides in-depth captions and reasoning for general, artistic, technical, abstract, and low-context images.
  • Robust Across Aspect Ratios: Maintains consistent performance on wide, tall, square, panoramic, and irregular image dimensions.
  • Variational Detail Control: Capable of producing outputs ranging from concise summaries to intricate, multi-level descriptive reasoning.
  • Multilingual Output Capability: Primarily outputs in English but can adapt to multiple languages via prompt engineering.

Intended Use Cases

  • Generating detailed, unfiltered captions and reasoning for general-purpose and artistic datasets.
  • Research in content moderation, red-teaming, and generative safety analysis.
  • Enabling descriptive captioning and reasoning for datasets typically excluded from mainstream models.
  • Creative and exploratory applications such as storytelling, visual interpretation, and multimodal reasoning.
  • Captioning and reasoning for non-standard, stylized, or abstract visual content.

Limitations

  • May generate explicit, sensitive, or offensive content depending on the prompt and input image.
  • Not suitable for production environments requiring strict content filtering or moderation.
  • Output tone, style, and reasoning depth can vary based on phrasing and visual complexity.
  • Performance may vary on synthetic or highly abstract visuals.