sayghost123/qwen3vl-invoice-extractor

VISIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:Apr 18, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The sayghost123/qwen3vl-invoice-extractor is a 2 billion parameter Qwen3-VL model, fine-tuned by sayghost123 using Unsloth and Huggingface's TRL library. This model is specifically optimized for invoice extraction tasks, leveraging its vision-language capabilities. It processes inputs with a context length of 32768 tokens, making it suitable for detailed document analysis.

Loading preview...

Model Overview

The sayghost123/qwen3vl-invoice-extractor is a specialized 2 billion parameter Qwen3-VL model, developed by sayghost123. It has been fine-tuned for enhanced performance in specific applications, utilizing the Unsloth library for accelerated training and Huggingface's TRL library for efficient fine-tuning.

Key Capabilities

  • Vision-Language Processing: As a Qwen3-VL variant, this model integrates both visual and textual understanding, making it adept at processing documents that combine images and text.
  • Optimized for Invoice Extraction: The fine-tuning process has specifically targeted invoice data extraction, suggesting strong performance in identifying and extracting key information from invoice documents.
  • Efficient Training: The model was trained significantly faster using Unsloth, a library known for speeding up large language model training.
  • Large Context Window: With a context length of 32768 tokens, it can handle extensive document content, crucial for comprehensive invoice analysis.

Good For

  • Automated Invoice Processing: Ideal for systems requiring automated extraction of data points like vendor names, amounts, dates, and line items from invoices.
  • Document Understanding: Suitable for tasks involving the interpretation of structured and semi-structured visual documents where both text and layout are important.
  • Research and Development: Provides a fine-tuned Qwen3-VL base for further experimentation or integration into custom document processing pipelines.