longvideoagent/longvideoagent-qwen2.5-7b
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Mar 22, 2026License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

The longvideoagent/longvideoagent-qwen2.5-7b is a 7.6 billion parameter Qwen2.5-7B-based language model checkpoint specifically fine-tuned for long-video question answering within the LongVideoAgent multi-agent framework. Developed by the LongVideoAgent project team, this model excels at reasoning over extended video content by integrating with specialized agents for planning, temporal grounding, and visual evidence extraction. It demonstrates a 6.66 percentage point improvement in LongTVQA+ accuracy over the Qwen2.5-7B-Instruct baseline, making it suitable for research and reproduction of long-video QA experiments.

Loading preview...