unicorn-team/Unicorn-VL-R3
Unicorn-team's Unicorn-VL-R3 is an 8 billion parameter vision-language model, fine-tuned from Qwen3-VL-8B, specifically optimized for Vietnamese educational tasks. It leverages a unique synthetic data generation strategy from Gemini APIs and filtered public datasets to enhance reasoning and problem-solving capabilities in academic contexts. The model achieves a VMLU score of 74.87, making it suitable for applications requiring robust performance on Vietnamese academic benchmarks.
Loading preview...
Overview
Unicorn-VL-R3 is an 8 billion parameter vision-language model developed by unicorn-team, fine-tuned from the Qwen3-VL-8B base model. It is specifically designed and optimized for Vietnamese educational tasks, demonstrating strong performance on academic benchmarks.
Key Capabilities
- Specialized Training Data: Utilizes a unique synthetic data generation process, creating approximately 4,800 samples of question-response pairs with simulated thinking chains (synthetic CoT) based on 12 educational tasks from Gemini APIs. This approach aims to optimize the model's ability to solve school-related problems with logical reasoning.
- Vietnamese Language Focus: Incorporates about 5,000 high-quality Vietnamese samples filtered from the Dolci-SFT dataset, ensuring strong performance in the Vietnamese language context.
- Vision-Language Integration: As a VL model, it is capable of processing both visual and textual inputs, making it suitable for multi-modal educational content.
- Performance on VMLU: Achieves a VMLU score of 74.87, outperforming its base model Qwen3-VL-8B (74.10) and other Unicorn-R3 variants.
- Optimized for Reasoning: The synthetic CoT generation strategy is designed to improve the model's reasoning and problem-solving skills, particularly for academic questions.
Good For
- Applications requiring a robust Vietnamese vision-language model for educational content.
- Tasks involving academic problem-solving and reasoning in Vietnamese.
- Use cases where strong performance on the VMLU benchmark is critical.