ONTHEIT/BizOnAI-OCR

VISIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Apr 14, 2026License:openrailArchitecture:Transformer0.0K Open Weights Cold

ONTHEIT/BizOnAI-OCR is an 8 billion parameter OCR model, built on Qwen3-VL-8B, specifically optimized for processing complex Korean industrial documents. It excels at extracting structured information from diverse layouts, including contracts, medical records, and financial forms, handling mixed Korean, English, and Chinese text. The model provides structured markdown output, preserving tables and formatting, and maintains competitive performance on English OCR benchmarks. It is designed for efficient deployment via vLLM or transformers for real-world Korean document processing applications.

Loading preview...

BizOnAI-OCR: Korean-Optimized Industrial Document OCR

BizOnAI-OCR, developed by ONTHEIT, is an 8 billion parameter optical character recognition (OCR) model based on Qwen3-VL-8B. Its primary focus is on accurately extracting information from a wide range of Korean industrial documents, such as contracts, medical records, and government paperwork. The model is specifically fine-tuned to handle the unique challenges of Korean document layouts, including decorative spacing, vertical tables, and mixed language content (Korean, English, Chinese).

Key Capabilities

  • Korean-first Optimization: Fine-tuned extensively on real-world Korean industrial documents for superior performance in this domain.
  • Bilingual Proficiency: While optimized for Korean, it maintains strong performance on English OCR tasks, as demonstrated by benchmarks.
  • Structured Markdown Output: Generates output in markdown format, preserving document structure, including tables, headings, and other formatting elements.
  • Efficient Deployment: Ready for efficient serving via vLLM (with an OpenAI-compatible API) or standard transformers library.

Performance Highlights

BizOnAI-OCR achieves an 83.0% overall score on KDoc-OCRBench, a challenging benchmark for Korean industrial PDFs, outperforming other models like olmOCR v0.2.0 and PaddleOCR-VL. It also demonstrates robust performance on the English olmOCR-bench, scoring 82.4% overall, indicating its strong bilingual capabilities.

Good for

  • Automating data extraction from Korean contracts, invoices, and legal documents.
  • Processing medical records and financial forms in Korean.
  • Applications requiring structured text output from complex, multi-lingual documents.