• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

稳健主成分分析:均值中位数方法

Robust Principal Component Analysis: A Median of Means Approach.

作者信息

Paul Debolina, Chakraborty Saptarshi, Das Swagatam

出版信息

IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):16788-16800. doi: 10.1109/TNNLS.2023.3298011. Epub 2024 Oct 29.

DOI:10.1109/TNNLS.2023.3298011
PMID:37549093
Abstract

Principal component analysis (PCA) is a fundamental tool for data visualization, denoising, and dimensionality reduction. It is widely popular in statistics, machine learning, computer vision, and related fields. However, PCA is well-known to fall prey to outliers and often fails to detect the true underlying low-dimensional structure within the dataset. Following the Median of Means (MoM) philosophy, recent supervised learning methods have shown great success in dealing with outlying observations without much compromise to their large sample theoretical properties. This article proposes a PCA procedure based on the MoM principle. Called the MoMPCA, the proposed method is not only computationally appealing but also achieves optimal convergence rates under minimal assumptions. In particular, we explore the nonasymptotic error bounds of the obtained solution via the aid of the Rademacher complexities while granting absolutely no assumption on the outlying observations. The derived concentration results are not dependent on the dimension because the analysis is conducted in a separable Hilbert space, and the results only depend on the fourth moment of the underlying distribution in the corresponding norm. The proposal's efficacy is also thoroughly showcased through simulations and real data applications.

摘要

主成分分析(PCA)是用于数据可视化、去噪和降维的一种基本工具。它在统计学、机器学习、计算机视觉及相关领域广泛流行。然而,众所周知,PCA容易受到异常值的影响,并且常常无法检测数据集中真正潜在的低维结构。遵循均值中位数(MoM)理念,最近的监督学习方法在处理异常观测值方面取得了巨大成功,同时对其大样本理论性质没有太大影响。本文提出了一种基于MoM原理的PCA方法。所提出的方法称为MoMPCA,不仅在计算上具有吸引力,而且在最小假设下实现了最优收敛速度。特别是,我们借助拉德马赫复杂度探索了所得解的非渐近误差界,同时对异常观测值完全不做任何假设。推导得到的集中结果不依赖于维度,因为分析是在可分希尔伯特空间中进行的,结果仅取决于相应范数下基础分布的四阶矩。通过模拟和实际数据应用也充分展示了该方法的有效性。

相似文献

1
Robust Principal Component Analysis: A Median of Means Approach.稳健主成分分析:均值中位数方法
IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):16788-16800. doi: 10.1109/TNNLS.2023.3298011. Epub 2024 Oct 29.
2
Principal Component Analysis based on Nuclear norm Minimization.基于核范数最小化的主成分分析。
Neural Netw. 2019 Oct;118:1-16. doi: 10.1016/j.neunet.2019.05.020. Epub 2019 Jun 8.
3
Reinforced Robust Principal Component Pursuit.增强鲁棒主成分 Pursuit。
IEEE Trans Neural Netw Learn Syst. 2018 May;29(5):1525-1538. doi: 10.1109/TNNLS.2017.2671849. Epub 2017 Mar 14.
4
Non-Greedy L21-Norm Maximization for Principal Component Analysis.用于主成分分析的非贪婪L21范数最大化
IEEE Trans Image Process. 2021;30:5277-5286. doi: 10.1109/TIP.2021.3073282. Epub 2021 Jun 2.
5
Implicit Annealing in Kernel Spaces: A Strongly Consistent Clustering Approach.核空间中的隐式退火:一种强一致性聚类方法。
IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):5862-5871. doi: 10.1109/TPAMI.2022.3217137. Epub 2023 Apr 3.
6
Avoiding Optimal Mean ℓ-Norm Maximization-Based Robust PCA for Reconstruction.避免基于最优平均ℓ范数最大化的鲁棒主成分分析进行重建。
Neural Comput. 2017 Apr;29(4):1124-1150. doi: 10.1162/NECO_a_00937. Epub 2017 Jan 17.
7
Modal Principal Component Analysis.模态主成分分析。
Neural Comput. 2020 Oct;32(10):1901-1935. doi: 10.1162/neco_a_01308. Epub 2020 Aug 14.
8
A Pure L1-norm Principal Component Analysis.一种纯L1范数主成分分析
Comput Stat Data Anal. 2013 May 1;61:83-98. doi: 10.1016/j.csda.2012.11.007.
9
Discrete Robust Principal Component Analysis via Binary Weights Self-Learning.通过二元权重自学习的离散鲁棒主成分分析
IEEE Trans Neural Netw Learn Syst. 2023 Nov;34(11):9064-9077. doi: 10.1109/TNNLS.2022.3155607. Epub 2023 Oct 27.
10
SNR-enhanced diffusion MRI with structure-preserving low-rank denoising in reproducing kernel Hilbert spaces.基于再生核希尔伯特空间结构保持低秩去噪的 SNR 增强扩散磁共振成像。
Magn Reson Med. 2021 Sep;86(3):1614-1632. doi: 10.1002/mrm.28752. Epub 2021 Apr 8.

引用本文的文献

1
Applications of machine learning-assisted extracellular vesicles analysis technology in tumor diagnosis.机器学习辅助的细胞外囊泡分析技术在肿瘤诊断中的应用
Comput Struct Biotechnol J. 2025 Jun 6;27:2460-2472. doi: 10.1016/j.csbj.2025.06.014. eCollection 2025.
2
Reduction of Spike-like Noise in Clinical Practice for Thoracic Electrical Impedance Tomography Using Robust Principal Component Analysis.使用稳健主成分分析减少胸部电阻抗断层扫描临床实践中的尖峰状噪声
Bioengineering (Basel). 2025 Apr 9;12(4):402. doi: 10.3390/bioengineering12040402.