• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

探索用于分布外检测的特征稀疏性。

Exploring feature sparsity for out-of-distribution detection.

作者信息

Chen Qichao, Li Kuan, Chen Zhiyuan, Maul Tomas, Yin Jianping

机构信息

School of Computer Science and Technology, Dongguan University of Technology, Dongguan, 523820, China.

School of Computer Science, University of Nottingham Malaysia, Selangor, 43500, Malaysia.

出版信息

Sci Rep. 2024 Nov 18;14(1):28444. doi: 10.1038/s41598-024-79934-7.

DOI:10.1038/s41598-024-79934-7
PMID:39558072
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11574038/
Abstract

Out-of-distribution (OOD) detection is a crucial problem in practice, especially, for the safe deployment of machine learning models in industrial settings. Previous work has used free energy as a score function and proposed a fine-tuning method that utilized OOD data in the training phase of the classification model, which achieves a higher performance on the OOD detection task compared with traditional methods. One key drawback, however, is that the loss function parameters are highly dependent on involved datasets, which means it cannot be dynamically adapted and implemented in others settings; in other words, the general ability of the energy score is considerably limited. In this work, our point of departure is to enlarge distinguishability between in-distribution features and OOD data. Consequently, we present a simple yet effective sparsity-regularized (SR) tuning framework for this purpose. Our framework has two types of workflows depending on if external OOD data is available, the complexity of the original training loss is sharply reduced by adopting this modification, meanwhile, the adapted ability and detection performance are enhanced. Also, we contribute a mini dataset as a light and efficient alternative of the previous large-scale one. In the experiments, we verify the effectiveness of our framework in a wide range of typical datasets along with common network architectures.

摘要

分布外(OOD)检测在实践中是一个关键问题,特别是对于机器学习模型在工业环境中的安全部署而言。先前的工作使用自由能作为评分函数,并提出了一种微调方法,该方法在分类模型的训练阶段利用OOD数据,与传统方法相比,在OOD检测任务上实现了更高的性能。然而,一个关键缺点是损失函数参数高度依赖于所涉及的数据集,这意味着它不能在其他设置中动态调整和实现;换句话说,能量分数的通用能力相当有限。在这项工作中,我们的出发点是扩大分布内特征与OOD数据之间的可区分性。因此,我们为此提出了一个简单而有效的稀疏正则化(SR)调整框架。我们的框架根据是否有外部OOD数据有两种工作流程,通过采用这种修改,原始训练损失的复杂度大幅降低,同时,适应能力和检测性能得到增强。此外,我们贡献了一个小型数据集作为先前大规模数据集的轻量级高效替代方案。在实验中,我们在广泛的典型数据集以及常见网络架构中验证了我们框架的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6299/11574038/e984e3c90d0f/41598_2024_79934_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6299/11574038/de390b3e77a9/41598_2024_79934_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6299/11574038/045ab609f065/41598_2024_79934_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6299/11574038/5811219a97e4/41598_2024_79934_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6299/11574038/ea94fd2f4a94/41598_2024_79934_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6299/11574038/e984e3c90d0f/41598_2024_79934_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6299/11574038/de390b3e77a9/41598_2024_79934_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6299/11574038/045ab609f065/41598_2024_79934_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6299/11574038/5811219a97e4/41598_2024_79934_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6299/11574038/ea94fd2f4a94/41598_2024_79934_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6299/11574038/e984e3c90d0f/41598_2024_79934_Fig4_HTML.jpg

相似文献

1
Exploring feature sparsity for out-of-distribution detection.探索用于分布外检测的特征稀疏性。
Sci Rep. 2024 Nov 18;14(1):28444. doi: 10.1038/s41598-024-79934-7.
2
The impact of fine-tuning paradigms on unknown plant diseases recognition.微调范式对未知植物病害识别的影响。
Sci Rep. 2024 Aug 2;14(1):17900. doi: 10.1038/s41598-024-66958-2.
3
Enhancing Few-Shot Out-of-Distribution Detection With Pre-Trained Model Features.利用预训练模型特征增强少样本分布外检测
IEEE Trans Image Process. 2024;33:6309-6323. doi: 10.1109/TIP.2024.3468874. Epub 2024 Dec 27.
4
Investigation of out-of-distribution detection across various models and training methodologies.跨多种模型和训练方法的分布外检测研究。
Neural Netw. 2024 Jul;175:106288. doi: 10.1016/j.neunet.2024.106288. Epub 2024 Apr 4.
5
Evaluation of Feature Selection Methods for Preserving Machine Learning Performance in the Presence of Temporal Dataset Shift in Clinical Medicine.评估特征选择方法在临床医学中存在时间数据集偏移时保持机器学习性能的能力。
Methods Inf Med. 2023 May;62(1-02):60-70. doi: 10.1055/s-0043-1762904. Epub 2023 Feb 22.
6
Unsupervised Out-of-Distribution Object Detection via PCA-Driven Dynamic Prototype Enhancement.通过主成分分析驱动的动态原型增强实现无监督分布外目标检测
IEEE Trans Image Process. 2024;33:2431-2446. doi: 10.1109/TIP.2024.3378464. Epub 2024 Mar 29.
7
Out-of-distribution detection with in-distribution voting using the medical example of chest x-ray classification.使用分布内投票进行分布外检测,以胸部 X 射线分类为例。
Med Phys. 2024 Apr;51(4):2721-2732. doi: 10.1002/mp.16790. Epub 2023 Oct 13.
8
WOOD: Wasserstein-Based Out-of-Distribution Detection.伍德:基于瓦瑟斯坦距离的分布外检测。
IEEE Trans Pattern Anal Mach Intell. 2024 Feb;46(2):944-956. doi: 10.1109/TPAMI.2023.3328883. Epub 2024 Jan 8.
9
Revealing the Distributional Vulnerability of Discriminators by Implicit Generators.揭示隐式生成器歧视器的分布脆弱性。
IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):8888-8901. doi: 10.1109/TPAMI.2022.3229318. Epub 2023 Jun 5.
10
Out-of-Distribution Detection Algorithms for Robust Insect Classification.用于稳健昆虫分类的分布外检测算法
Plant Phenomics. 2024 Apr 30;6:0170. doi: 10.34133/plantphenomics.0170. eCollection 2024.

本文引用的文献

1
A new method based on YOLOv5 and multiscale data augmentation for visual inspection in substation.一种基于YOLOv5和多尺度数据增强的变电站视觉检测新方法。
Sci Rep. 2024 Apr 23;14(1):9362. doi: 10.1038/s41598-024-60126-2.
2
Analyzing to discover origins of CNNs and ViT architectures in medical images.分析在医学图像中发现 CNN 和 ViT 架构的起源。
Sci Rep. 2024 Apr 16;14(1):8755. doi: 10.1038/s41598-024-58382-3.
3
Deep learning and data fusion to estimate surface soil moisture from multi-sensor satellite images.深度学习和数据融合从多传感器卫星图像估算地表土壤水分。
Sci Rep. 2023 Feb 8;13(1):2251. doi: 10.1038/s41598-023-28939-9.
4
Multiple Kernel k-Means with Incomplete Kernels.具有不完整核的多核k均值算法
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1191-1204. doi: 10.1109/TPAMI.2019.2892416. Epub 2019 Jan 14.
5
Places: A 10 Million Image Database for Scene Recognition.地点:用于场景识别的 1000 万图像数据库。
IEEE Trans Pattern Anal Mach Intell. 2018 Jun;40(6):1452-1464. doi: 10.1109/TPAMI.2017.2723009. Epub 2017 Jul 4.
6
80 million tiny images: a large data set for nonparametric object and scene recognition.八千万张小图片:用于非参数化物体与场景识别的大型数据集。
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1958-70. doi: 10.1109/TPAMI.2008.128.