toandev/donglao-gemma-3-4b-it-vi

VISIONConcurrency Cost:1Model Size:4.3BQuant:BF16Ctx Length:32kLicense:apache-2.0Architecture:Transformer0.0K Open Weights Cold

The toandev/donglao-gemma-3-4b-it-vi model is a 4.3 billion parameter instruction-tuned language model based on Google's Gemma 3 4B Instruct architecture. It has been specifically fine-tuned for Vietnamese language tasks, leveraging the Viet-ShareGPT-4o-Text-VQA dataset. This model excels at Vietnamese text generation, question answering, and conversational applications, supporting a context length of 32768 tokens.

Loading preview...

Overview

The toandev/donglao-gemma-3-4b-it-vi model is a specialized large language model (LLM) built upon Google's Gemma 3 4B Instruct architecture. With 4.3 billion parameters and a substantial context length of 32768 tokens, this model is primarily distinguished by its fine-tuning for the Vietnamese language.

Key Capabilities

  • Vietnamese Language Proficiency: Optimized for understanding and generating text in Vietnamese.
  • Instruction Following: Inherits instruction-tuned capabilities from its base Gemma 3 4B Instruct model.
  • Multimodal Data Training: Fine-tuned using the 5CD-AI/Viet-ShareGPT-4o-Text-VQA dataset, which includes Vietnamese text and visual question answering data.

Good For

  • Vietnamese Text Generation: Creating coherent and contextually relevant Vietnamese text.
  • Vietnamese Question Answering: Responding to queries in Vietnamese, potentially leveraging its training on VQA data.
  • Vietnamese Conversational AI: Developing chatbots or conversational agents that interact effectively in Vietnamese.