Adversarial Attacks on VQA-NLE: Exposing and Alleviating Inconsistencies in Visual Question Answering Explanations

cs.AI updates on arXiv.org 15小时前

Adversarial Attacks on VQA-NLE: Exposing and Alleviating Inconsistencies in Visual Question Answering Explanations

本文揭示了视觉问答中自然语言解释系统(VQA-NLE)的透明度与可靠性问题，提出对抗策略和基于知识的缓解方法，增强模型鲁棒性。

arXiv:2508.12430v1 Announce Type: cross Abstract: Natural language explanations in visual question answering (VQA-NLE) aim to make black-box models more transparent by elucidating their decision-making processes. However, we find that existing VQA-NLE systems can produce inconsistent explanations and reach conclusions without genuinely understanding the underlying context, exposing weaknesses in either their inference pipeline or explanation-generation mechanism. To highlight these vulnerabilities, we not only leverage an existing adversarial strategy to perturb questions but also propose a novel strategy that minimally alters images to induce contradictory or spurious outputs. We further introduce a mitigation method that leverages external knowledge to alleviate these inconsistencies, thereby bolstering model robustness. Extensive evaluations on two standard benchmarks and two widely used VQA-NLE models underscore the effectiveness of our attacks and the potential of knowledge-based defenses, ultimately revealing pressing security and reliability concerns in current VQA-NLE systems.

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

视觉问答自然语言解释模型可靠性

相关文章

Robust Visual Reasoning with Adriana Kovashka - #463

多模态大模型看懂图片也会答错，智源联合多家机构推出多模态模型鲁棒性测试基准

Google DeepMind Unveils PaliGemma: A Versatile 3B Vision-Language Model VLM with Large-Scale Ambitions

Visual Haystacks Benchmark: The First “Visual-Centric” Needle-In-A-Haystack (NIAH) Benchmark to Assess LMMs’ Capability in Long-Context Visual Retrieval and Reasoning

ChatGPT自学指南：宝藏参考书大盘点

OpenAI 在 API 中引入结构化输出，最新版 GPT-4o AI 模型达到 100% 满分

Generalized Visual Language Models

Fine-tune multimodal models for vision and text use cases on Amazon SageMaker JumpStart

北大、清华等提出LLaVA-o1，视觉语言模型中的o1来了！

Advancing Large Multimodal Models: DocHaystack, InfoHaystack, and the Vision-Centric Retrieval-Augmented Generation Framework