OOD-CV-v2：自然图像中个体干扰因素的分布外偏移鲁棒性扩展基准。

OOD-CV-v2 : An Extended Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images.

作者信息

Zhao Bingchen, Wang Jiahao, Ma Wufei, Jesslen Artur, Yang Siwei, Yu Shaozuo, Zendel Oliver, Theobalt Christian, Yuille Alan L, Kortylewski Adam

出版信息

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):11104-11118. doi: 10.1109/TPAMI.2024.3462293. Epub 2024 Nov 6.

DOI:10.1109/TPAMI.2024.3462293

PMID:39288047

Abstract

Enhancing the robustness of vision algorithms in real-world scenarios is challenging. One reason is that existing robustness benchmarks are limited, as they either rely on synthetic data or ignore the effects of individual nuisance factors. We introduce OOD-CV-v2, a benchmark dataset that includes out-of-distribution examples of 10 object categories in terms of pose, shape, texture, context and the weather conditions, and enables benchmarking of models for image classification, object detection, and 3D pose estimation. In addition to this novel dataset, we contribute extensive experiments using popular baseline methods, which reveal that: 1) Some nuisance factors have a much stronger negative effect on the performance compared to others, also depending on the vision task. 2) Current approaches to enhance robustness have only marginal effects, and can even reduce robustness. 3) We do not observe significant differences between convolutional and transformer architectures. We believe our dataset provides a rich test bed to study robustness and will help push forward research in this area.

摘要

增强视觉算法在现实场景中的鲁棒性具有挑战性。一个原因是现有的鲁棒性基准测试存在局限性，因为它们要么依赖合成数据，要么忽略了个别干扰因素的影响。我们引入了OOD-CV-v2，这是一个基准数据集，包含10个物体类别的分布外示例，涉及姿态、形状、纹理、背景和天气条件，并能够对图像分类、目标检测和3D姿态估计模型进行基准测试。除了这个新颖的数据集，我们还使用流行的基线方法进行了广泛的实验，结果表明：1）一些干扰因素对性能的负面影响比其他因素更强，这也取决于视觉任务。2）当前增强鲁棒性的方法只有边际效应，甚至可能降低鲁棒性。3）我们没有观察到卷积架构和Transformer架构之间的显著差异。我们相信我们的数据集为研究鲁棒性提供了一个丰富的测试平台，并将有助于推动该领域的研究。

相似文献

OOD-CV-v2 : An Extended Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images.OOD-CV-v2：自然图像中个体干扰因素的分布外偏移鲁棒性扩展基准。

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):11104-11118. doi: 10.1109/TPAMI.2024.3462293. Epub 2024 Nov 6.

ROOD-MRI: Benchmarking the robustness of deep learning segmentation models to out-of-distribution and corrupted data in MRI.R O O D-MRI：基准测试深度学习分割模型对 MRI 中分布外和损坏数据的鲁棒性。

Neuroimage. 2023 Sep;278:120289. doi: 10.1016/j.neuroimage.2023.120289. Epub 2023 Jul 24.

A benchmark for neural network robustness in skin cancer classification.用于皮肤癌分类的神经网络鲁棒性基准。

Eur J Cancer. 2021 Sep;155:191-199. doi: 10.1016/j.ejca.2021.06.047. Epub 2021 Aug 11.

Robustness to Spurious Correlations Improves Semantic Out-of-Distribution Detection.对虚假相关性的鲁棒性可改善语义分布外检测。

Proc AAAI Conf Artif Intell. 2023 Jun 27;37(12):15305-15312. doi: 10.1609/aaai.v37i12.26785.

Does your dermatology classifier know what it doesn't know? Detecting the long-tail of unseen conditions.你的皮肤科分类器知道它所不知道的东西吗？检测罕见病症的长尾现象。

Med Image Anal. 2022 Jan;75:102274. doi: 10.1016/j.media.2021.102274. Epub 2021 Oct 20.

Identity-Preserved Human Posture Detection in Infrared Thermal Images: A Benchmark.身份保留的人体红外热图像姿态检测：基准

Sensors (Basel). 2022 Dec 22;23(1):92. doi: 10.3390/s23010092.

MOOD 2020: A Public Benchmark for Out-of-Distribution Detection and Localization on Medical Images.MOOD 2020：医学图像上的分布外检测和定位的公共基准。

IEEE Trans Med Imaging. 2022 Oct;41(10):2728-2738. doi: 10.1109/TMI.2022.3170077. Epub 2022 Sep 30.

Out-of-distribution detection with in-distribution voting using the medical example of chest x-ray classification.使用分布内投票进行分布外检测，以胸部 X 射线分类为例。

Med Phys. 2024 Apr;51(4):2721-2732. doi: 10.1002/mp.16790. Epub 2023 Oct 13.

RobotP: A Benchmark Dataset for 6D Object Pose Estimation.RobotP：用于6D物体姿态估计的基准数据集。

Sensors (Basel). 2021 Feb 11;21(4):1299. doi: 10.3390/s21041299.

Investigation of out-of-distribution detection across various models and training methodologies.跨多种模型和训练方法的分布外检测研究。

Neural Netw. 2024 Jul;175:106288. doi: 10.1016/j.neunet.2024.106288. Epub 2024 Apr 4.

引用本文的文献

An evaluation methodology for machine learning-based tandem mass spectra similarity prediction.一种基于机器学习的串联质谱相似性预测评估方法。

BMC Bioinformatics. 2025 Jul 11;26(1):174. doi: 10.1186/s12859-025-06194-1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

OOD-CV-v2：自然图像中个体干扰因素的分布外偏移鲁棒性扩展基准。

OOD-CV-v2 : An Extended Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献