twnlp/ChineseErrorCorrector3-4B

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Jun 20, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

The twnlp/ChineseErrorCorrector3-4B is a 4 billion parameter causal language model developed by TW-NLP, based on the Qwen3-4B architecture. It is specifically fine-tuned for comprehensive Chinese text error correction, covering both spelling and grammar. Trained on 2 million correction data points, this model demonstrates strong performance in identifying and rectifying errors in Chinese text, making it suitable for applications requiring high-accuracy linguistic correction.

Loading preview...

Overview

The twnlp/ChineseErrorCorrector3-4B is a 4 billion parameter model developed by TW-NLP, designed for comprehensive Chinese text error correction. It is built upon the Qwen3-4B base model and has been extensively trained on 2 million Chinese error correction data points. This model addresses both spelling correction and grammar correction, aiming to provide a robust solution for improving the accuracy and fluency of Chinese text.

Key Capabilities

  • Dual Correction Focus: Excels in correcting both grammatical errors and spelling mistakes within Chinese text.
  • High Performance: Achieves an average score of 0.8521 on the NaCGEC Data benchmark, outperforming other models like Qwen2.5-7B-CTC and ChatGLM3-6B-CSC in overall performance.
  • Specialized Training: Benefits from full-volume training on a large dataset of 2 million correction examples, optimizing its ability to identify and rectify a wide range of errors.

Good for

  • Chinese Text Quality Improvement: Ideal for applications requiring high-accuracy correction of written Chinese, such as content creation, academic writing, or communication platforms.
  • Automated Proofreading: Can be integrated into systems for automated proofreading and editing of Chinese documents.
  • Research and Development: Serves as a strong baseline or component for further research in Chinese natural language processing and error correction tasks.