Entity

PaSBench-Video: A Streaming Video Benchmark for Proactive Safety Warning

Between the first visible sign of danger and the moment an accident occurs, there is often a window where intervention remains possible. Video-capable multimodal large language models (MLLMs) could serve as always-on safety monitors that issue warnings during this window. Yet current benchmarks do not test this ability: they rely on static inputs, ignore timing precision, and omit false-positive measurement on safe scenes. We present PaSBench-Video, a 740-video benchmark with 481 risk and 259 no

Paper · arXiv

cs.CL

Authors: Yusong Zhao, Yuejin Xie, Youliang Yuan, Junjie Hu, Jitian Guo + 2 more
Published: 2026-06-01
Categories: cs.CLcs.AIcs.CV

Abstract ↗

via arXiv · 2606.02443