GitMylo/Qwen3.5-9B-Uncensored-HauhauCS-Aggressive-safetensors

VISIONConcurrency Cost:1Model Size:9BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Mar 30, 2026License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

The Qwen3.5-9B model by Qwen is a 9 billion parameter multimodal causal language model with a vision encoder, offering a native context length of 262,144 tokens. It features a unified vision-language foundation, an efficient hybrid architecture with Gated Delta Networks and sparse Mixture-of-Experts, and scalable reinforcement learning for robust real-world adaptability. This model excels in multimodal reasoning, coding, agent tasks, and visual understanding, supporting 201 languages and dialects.

Loading preview...

Qwen3.5-9B: A Multimodal Agent Foundation Model

Qwen3.5-9B is a 9 billion parameter multimodal causal language model developed by Qwen, designed for exceptional utility and performance across various tasks. It integrates advancements in multimodal learning, architectural efficiency, and reinforcement learning, supporting a native context length of 262,144 tokens, extensible up to 1,010,000 tokens with YaRN scaling.

Key Capabilities

  • Unified Vision-Language Foundation: Achieves strong performance in reasoning, coding, agent tasks, and visual understanding through early fusion training on multimodal tokens.
  • Efficient Hybrid Architecture: Utilizes Gated Delta Networks and sparse Mixture-of-Experts for high-throughput inference with minimal latency.
  • Scalable RL Generalization: Trained with reinforcement learning across million-agent environments for robust real-world adaptability.
  • Global Linguistic Coverage: Supports 201 languages and dialects, enabling inclusive worldwide deployment.
  • Multimodal Input: Capable of processing text, image, and video inputs, making it versatile for complex multimodal queries.
  • Tool Calling: Excels in tool calling capabilities, recommended for agent applications via Qwen-Agent or Qwen Code.

Good For

  • Multimodal Applications: Ideal for tasks requiring understanding and generation across text, images, and videos, such as visual question answering, document understanding, and spatial intelligence.
  • Long Context Processing: Suitable for applications needing to process and generate responses for ultra-long texts, with native support for 262,144 tokens and extensible context.
  • Agentic Workflows: Highly effective for building AI agents that require tool use and complex task execution, particularly with frameworks like Qwen-Agent and Qwen Code.
  • Global Deployments: Its extensive multilingual support makes it suitable for applications targeting diverse linguistic and cultural contexts.
  • Reasoning and Coding: Demonstrates strong performance in reasoning and coding benchmarks, making it valuable for development and problem-solving tasks.