用于有限训练数据图像分类的切片瓦瑟斯坦空间中的不变性编码

Invariance encoding in sliced-Wasserstein space for image classification with limited training data.

作者信息

Shifat-E-Rabbi Mohammad, Zhuang Yan, Li Shiying, Rubaiyat Abu Hasnat Mohammad, Yin Xuwang, Rohde Gustavo K

机构信息

Imaging and Data Science Laboratory, University of Virginia, Charlottesville, VA 22908, USA.

Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA.

出版信息

Pattern Recognit. 2023 May;137. doi: 10.1016/j.patcog.2022.109268. Epub 2022 Dec 22.

DOI:10.1016/j.patcog.2022.109268

PMID:36713887

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9879373/

Abstract

Deep convolutional neural networks (CNNs) are broadly considered to be state-of-the-art generic end-to-end image classification systems. However, they are known to underperform when training data are limited and thus require data augmentation strategies that render the method computationally expensive and not always effective. Rather than using a data augmentation strategy to encode invariances as typically done in machine learning, here we propose to mathematically augment a nearest subspace classification model in sliced-Wasserstein space by exploiting certain mathematical properties of the Radon Cumulative Distribution Transform (R-CDT), a recently introduced image transform. We demonstrate that for a particular type of learning problem, our mathematical solution has advantages over data augmentation with deep CNNs in terms of classification accuracy and computational complexity, and is particularly effective under a limited training data setting. The method is simple, effective, computationally efficient, non-iterative, and requires no parameters to be tuned. Python code implementing our method is available at https://github.com/rohdelab/mathematical augmentation. Our method is integrated as a part of the software package PyTransKit, which is available at https://github.com/rohdelab/PyTransKit.

摘要

深度卷积神经网络（CNNs）被广泛认为是最先进的通用端到端图像分类系统。然而，众所周知，当训练数据有限时，它们的性能会下降，因此需要数据增强策略，这使得该方法在计算上成本高昂且并不总是有效。与通常在机器学习中使用数据增强策略来编码不变性不同，在这里，我们建议通过利用最近引入的图像变换——拉东累积分布变换（R-CDT）的某些数学特性，在切片瓦瑟斯坦空间中对最近子空间分类模型进行数学增强。我们证明，对于特定类型的学习问题，我们的数学解决方案在分类准确性和计算复杂度方面优于使用深度CNN进行数据增强，并且在训练数据有限的情况下特别有效。该方法简单、有效、计算效率高、非迭代，且无需调整参数。实现我们方法的Python代码可在https://github.com/rohdelab/mathematical augmentation获取。我们的方法作为软件包PyTransKit的一部分进行了集成，该软件包可在https://github.com/rohdelab/PyTransKit获取。

相似文献

Invariance encoding in sliced-Wasserstein space for image classification with limited training data.用于有限训练数据图像分类的切片瓦瑟斯坦空间中的不变性编码

Pattern Recognit. 2023 May;137. doi: 10.1016/j.patcog.2022.109268. Epub 2022 Dec 22.

Radon Cumulative Distribution Transform Subspace Modeling for Image Classification.用于图像分类的氡累积分布变换子空间建模

J Math Imaging Vis. 2021 Nov;63(9):1185-1203. doi: 10.1007/s10851-021-01052-0. Epub 2021 Aug 5.

End-to-End Signal Classification in Signed Cumulative Distribution Transform Space.符号累积分布变换空间中的端到端信号分类

IEEE Trans Pattern Anal Mach Intell. 2024 Sep;46(9):5936-5950. doi: 10.1109/TPAMI.2024.3372455. Epub 2024 Aug 6.

Study on Representation Invariances of CNNs and Human Visual Information Processing Based on Data Augmentation.基于数据增强的卷积神经网络表示不变性与人类视觉信息处理研究

Brain Sci. 2020 Sep 2;10(9):602. doi: 10.3390/brainsci10090602.

Improving Image-Based Plant Disease Classification With Generative Adversarial Network Under Limited Training Set.在有限训练集下利用生成对抗网络改进基于图像的植物病害分类

Front Plant Sci. 2020 Dec 4;11:583438. doi: 10.3389/fpls.2020.583438. eCollection 2020.

Combining weakly and strongly supervised learning improves strong supervision in Gleason pattern classification.弱监督和强监督学习的结合提高了 Gleason 模式分类中的强监督。

BMC Med Imaging. 2021 May 8;21(1):77. doi: 10.1186/s12880-021-00609-0.

A novel end-to-end classifier using domain transferred deep convolutional neural networks for biomedical images.一种使用域转移深度卷积神经网络的新型端到端生物医学图像分类器。

Comput Methods Programs Biomed. 2017 Mar;140:283-293. doi: 10.1016/j.cmpb.2016.12.019. Epub 2017 Jan 6.

Transport-based pattern recognition versus deep neural networks in underwater OAM communications.基于传输的模式识别与水下轨道角动量通信中的深度神经网络。

J Opt Soc Am A Opt Image Sci Vis. 2021 Jul 1;38(7):954-962. doi: 10.1364/JOSAA.412463.

Biomedical image augmentation using Augmentor.使用 Augmentor 进行生物医学图像增强。

Bioinformatics. 2019 Nov 1;35(21):4522-4524. doi: 10.1093/bioinformatics/btz259.

Learning-to-augment strategy using noisy and denoised data: Improving generalizability of deep CNN for the detection of COVID-19 in X-ray images.基于噪声和去噪数据的学习增强策略：提高深度卷积神经网络在 X 射线图像中 COVID-19 检测的泛化能力。

Comput Biol Med. 2021 Sep;136:104704. doi: 10.1016/j.compbiomed.2021.104704. Epub 2021 Jul 29.

引用本文的文献

Local Sliced Wasserstein Feature Sets for Illumination Invariant Face Recognition.用于光照不变人脸识别的局部切片瓦瑟斯坦特征集

Pattern Recognit. 2025 Jun;162. doi: 10.1016/j.patcog.2025.111381. Epub 2025 Jan 21.

Linear optimal transport subspaces for point set classification.用于点集分类的线性最优传输子空间

Res Sq. 2024 Mar 22:rs.3.rs-4106387. doi: 10.21203/rs.3.rs-4106387/v1.

End-to-End Signal Classification in Signed Cumulative Distribution Transform Space.符号累积分布变换空间中的端到端信号分类

IEEE Trans Pattern Anal Mach Intell. 2024 Sep;46(9):5936-5950. doi: 10.1109/TPAMI.2024.3372455. Epub 2024 Aug 6.

本文引用的文献

A Low-Cost High-Performance Data Augmentation for Deep Learning-Based Skin Lesion Classification.一种用于基于深度学习的皮肤病变分类的低成本高性能数据增强方法。

BME Front. 2022 Apr 26;2022:9765307. doi: 10.34133/2022/9765307. eCollection 2022.

Radon Cumulative Distribution Transform Subspace Modeling for Image Classification.用于图像分类的氡累积分布变换子空间建模

J Math Imaging Vis. 2021 Nov;63(9):1185-1203. doi: 10.1007/s10851-021-01052-0. Epub 2021 Aug 5.

Text Data Augmentation for Deep Learning.用于深度学习的文本数据增强

J Big Data. 2021;8(1):101. doi: 10.1186/s40537-021-00492-0. Epub 2021 Jul 19.

Inconsistent Performance of Deep Learning Models on Mammogram Classification.深度学习模型在乳房X光片分类中的性能不一致。

J Am Coll Radiol. 2020 Jun;17(6):796-803. doi: 10.1016/j.jacr.2020.01.006. Epub 2020 Feb 14.

Cell Image Classification: A Comparative Overview.细胞图像分类：比较综述。

Cytometry A. 2020 Apr;97(4):347-362. doi: 10.1002/cyto.a.23984. Epub 2020 Feb 10.

Biomedical image augmentation using Augmentor.使用 Augmentor 进行生物医学图像增强。

Bioinformatics. 2019 Nov 1;35(21):4522-4524. doi: 10.1093/bioinformatics/btz259.

Transport-based model for turbulence-corrupted imagery.基于传输的湍流干扰图像模型。

Appl Opt. 2018 Jun 1;57(16):4524-4536. doi: 10.1364/AO.57.004524.

Opportunities and obstacles for deep learning in biology and medicine.深度学习在生物学和医学中的机遇与挑战。

J R Soc Interface. 2018 Apr;15(141). doi: 10.1098/rsif.2017.0387.

Discovery and visualization of structural biomarkers from MRI using transport-based morphometry.利用基于传输的形态测量学发现和可视化 MRI 的结构生物标志物。

Neuroimage. 2018 Feb 15;167:256-275. doi: 10.1016/j.neuroimage.2017.11.006. Epub 2017 Nov 5.

Classification of amyloid status using machine learning with histograms of oriented 3D gradients.使用具有定向3D梯度直方图的机器学习对淀粉样蛋白状态进行分类。

Neuroimage Clin. 2016 May 10;12:990-1003. doi: 10.1016/j.nicl.2016.05.004. eCollection 2016.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。