TaobaoTmall-AlgorithmProducts/E-VAds-R1-Qwen3VL
VISIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:May 11, 2026License:mitArchitecture:Transformer0.0K Open Weights Cold
E-VAds-R1-Qwen3VL is an 8 billion parameter vision-language model developed by TaobaoTmall-AlgorithmProducts, based on the Qwen3-VL-8B-Instruct architecture. This model is specifically fine-tuned for advertising-related tasks, leveraging the E-VAds_Benchmark dataset. It supports both Chinese and English languages, making it suitable for multimodal applications in e-commerce advertising.
Loading preview...
E-VAds-R1-Qwen3VL: A Vision-Language Model for E-commerce Advertising
E-VAds-R1-Qwen3VL is an 8 billion parameter vision-language model developed by TaobaoTmall-AlgorithmProducts. Built upon the Qwen3-VL-8B-Instruct base model, it is specifically designed and fine-tuned for applications within the advertising domain, particularly in e-commerce.
Key Capabilities
- Multimodal Understanding: Processes and integrates information from both visual and textual inputs.
- Advertising-Specific Fine-tuning: Optimized using the proprietary E-VAds_Benchmark dataset, enhancing its performance on tasks relevant to advertising content analysis and generation.
- Multilingual Support: Capable of handling both Chinese (zh) and English (en) languages, catering to a broad user base in global e-commerce markets.
- Large Context Window: Inherits a 32768 token context length, allowing for processing of extensive visual and textual information.
Good For
- E-commerce Advertising: Ideal for tasks such as ad content generation, visual ad analysis, product recommendation based on visual cues, and understanding user intent from multimodal inputs in an advertising context.
- Multilingual Applications: Suitable for businesses operating in both Chinese and English-speaking markets requiring multimodal AI solutions.