The AI45Research/AgentDoG-Qwen3-4B is a 4 billion parameter model from the Qwen3 family, developed by AI45Research. It functions as a risk-aware evaluation and guarding framework for autonomous agents, specializing in trajectory-level risk assessment. This model identifies safety risks within an agent's execution trace, providing fine-grained diagnoses of risk sources, failure modes, and real-world harms. It excels at monitoring multi-step agent executions and diagnosing root causes of unsafe behavior, outperforming existing approaches on benchmarks like R-Judge, ASSE-Safety, and ATBench.
No reviews yet. Be the first to review!