CIawevy/TextPecker-8B-Qwen3VL

VISIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Jan 9, 2026License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

TextPecker-8B-Qwen3VL is an 8 billion parameter multimodal large language model (MLLM) developed by Hanshen Zhu et al., built upon the Qwen3-VL-8B-Instruct architecture with a 32768 token context length. It is specifically designed to perceive and quantify structural anomalies like distortion, blurriness, and misalignment in visual text rendering. This model functions as a plug-and-play evaluator and reward signal for reinforcement learning, enhancing the generation of structurally faithful visual text.

Loading preview...

TextPecker-8B-Qwen3VL: Visual Text Rendering Evaluator

TextPecker-8B-Qwen3VL is a multimodal large language model (MLLM) developed by Hanshen Zhu et al., specifically engineered to address the challenges of structural anomalies in visual text rendering (VTR). Built on the Qwen3-VL-8B-Instruct architecture, this 8 billion parameter model with a 32768 token context length excels where standard MLLMs and OCR models fall short in perceiving issues like distortion, blurriness, and misalignment in generated text.

Key Capabilities

  • Structural Anomaly Quantification: Identifies and measures visual text imperfections such as distortion, blurriness, and misalignment.
  • Reward Modeling: Provides crucial reward signals for Reinforcement Learning (RL)-based optimization, enabling text generators (e.g., Flux, SD3.5) to produce more structurally faithful visual text.
  • Evaluation: Can be used to assess the text structural quality and semantic consistency in text-to-image generation or editing tasks.

Use Cases

This model is particularly valuable for developers working on:

  • Improving the quality of text generated within images.
  • Developing or fine-tuning text-to-image models to reduce visual text artifacts.
  • Evaluating the fidelity of visual text rendering in various applications.

For deployment and evaluation, refer to the official TextPecker repository and its associated paper.