Entity

Closed-Loop Bidirectional Prompting for Adversarial Robustness of Vision Language Models

Vision Language Models adapt well to downstream tasks but are highly vulnerable to adversarial perturbations that disrupt cross-modal semantic alignment. Existing defenses are largely unidirectional or structural, failing to exploit bidirectional cross-modal complementarity and instance-wise adaptive protection. To overcome the limitations of unidirectional and static defenses in adversarial settings, we propose Closed-Loop Bidirectional Prompting, casting robust adaptation as cross-modal agreem

Paper · arXiv

cs.CV

Authors: Xiao Liu, Jiaxiang Liu, Boci Peng, Boren Hu, Yusong Wang + 4 more
Published: 2026-05-25

Abstract ↗

via arXiv · 2605.25922