sayghost123/qwen3vl-invoice-extractor
VISIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:Apr 18, 2026License:apache-2.0Architecture:Transformer Open Weights Cold
The sayghost123/qwen3vl-invoice-extractor is a 2 billion parameter Qwen3-VL model, fine-tuned by sayghost123 using Unsloth and Huggingface's TRL library. This model is specifically optimized for invoice extraction tasks, leveraging its vision-language capabilities. It processes inputs with a context length of 32768 tokens, making it suitable for detailed document analysis.
Loading preview...
Model Overview
The sayghost123/qwen3vl-invoice-extractor is a specialized 2 billion parameter Qwen3-VL model, developed by sayghost123. It has been fine-tuned for enhanced performance in specific applications, utilizing the Unsloth library for accelerated training and Huggingface's TRL library for efficient fine-tuning.
Key Capabilities
- Vision-Language Processing: As a Qwen3-VL variant, this model integrates both visual and textual understanding, making it adept at processing documents that combine images and text.
- Optimized for Invoice Extraction: The fine-tuning process has specifically targeted invoice data extraction, suggesting strong performance in identifying and extracting key information from invoice documents.
- Efficient Training: The model was trained significantly faster using Unsloth, a library known for speeding up large language model training.
- Large Context Window: With a context length of 32768 tokens, it can handle extensive document content, crucial for comprehensive invoice analysis.
Good For
- Automated Invoice Processing: Ideal for systems requiring automated extraction of data points like vendor names, amounts, dates, and line items from invoices.
- Document Understanding: Suitable for tasks involving the interpretation of structured and semi-structured visual documents where both text and layout are important.
- Research and Development: Provides a fine-tuned Qwen3-VL base for further experimentation or integration into custom document processing pipelines.