WiSE-OD: Benchmarking Robustness in Infrared Object Detection

cs.AI updates on arXiv.org 07月28日 12:42

WiSE-OD: Benchmarking Robustness in Infrared Object Detection

文章介绍了一种解决红外图像检测难题的新方法，提出LLVIP-C和FLIR-C跨模态ood基准，以及WiSE-OD权重空间集成方法，旨在提升模型鲁棒性和准确度。

arXiv:2507.18925v1 Announce Type: cross Abstract: Object detection (OD) in infrared (IR) imagery is critical for low-light and nighttime applications. However, the scarcity of large-scale IR datasets forces models to rely on weights pre-trained on RGB images. While fine-tuning on IR improves accuracy, it often compromises robustness under distribution shifts due to the inherent modality gap between RGB and IR. To address this, we introduce LLVIP-C and FLIR-C, two cross-modality out-of-distribution (OOD) benchmarks built by applying corruption to standard IR datasets. Additionally, to fully leverage the complementary knowledge from RGB and infrared trained models, we propose WiSE-OD, a weight-space ensembling method with two variants: WiSE-OD${ZS}$, which combines RGB zero-shot and IR fine-tuned weights, and WiSE-OD${LP}$, which blends zero-shot and linear probing. Evaluated across three RGB-pretrained detectors and two robust baselines, WiSE-OD improves both cross-modality and corruption robustness without any additional training or inference cost.

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

红外图像检测 LLVIP-C FLIR-C WiSE-OD 跨模态

相关文章

Crossing Modalities: The Innovative Artificial Intelligence Approach to Jailbreaking LLMs with Visual Cues

Microsoft Released LLM2CLIP: A New AI Technique in which a LLM Acts as a Teacher for CLIP’s Visual Encoder

在线试玩 | 对齐、生成效果大增，文本驱动的风格转换迎来进阶版

2024.12.20 每日AI论文 | 数据扩增提升LLMs性能，多模态推理框架创新突破

行人、车辆、动物等ReID最新综述！武大等全面总结Transformer方法 | IJCV 2024

行人、车辆、动物等ReID最新综述，武大等全面总结Transformer方法

行人、车辆、动物等ReID最新综述！武大等全面总结Transformer方法 | IJCV 2024

港科大开源VideoVAE+，视频重建质量全面超越最新模型

英语才是AI的母语？科学家发现模型的多模态推理全靠它

只给一张图，AI找到对应合适BGM，央音清华等构建全球化音乐信息检索新范式