longvideoagent/longvideoagent-qwen3-4b
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Apr 3, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

LongVideoAgent Qwen3-4B is a 4 billion parameter language model checkpoint based on Qwen3, developed for long-video question answering within the LongVideoAgent multi-agent framework. It is specifically designed to decompose and reason over complex long-video content. This model achieves 72% accuracy on the LongTVQA+ test set, demonstrating strong performance comparable to larger closed-source models for specialized video reasoning tasks. It supports an impressive native context length of 262,144 tokens, optimized for processing extensive video transcripts and related data.

Loading preview...