• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于指导多组学数据处理中降维和数据融合的本征维分析。

Intrinsic-dimension analysis for guiding dimensionality reduction and data fusion in multi-omics data processing.

作者信息

Gliozzo Jessica, Soto-Gomez Mauricio, Guarino Valentina, Bonometti Arturo, Cabri Alberto, Cavalleri Emanuele, Reese Justin, Robinson Peter N, Mesiti Marco, Valentini Giorgio, Casiraghi Elena

机构信息

AnacletoLab, Computer Science Department, Università degli Studi di Milano, Milan, Italy; European Commission, Joint Research Centre (JRC), Ispra, Italy.

AnacletoLab, Computer Science Department, Università degli Studi di Milano, Milan, Italy.

出版信息

Artif Intell Med. 2025 Feb;160:103049. doi: 10.1016/j.artmed.2024.103049. Epub 2024 Dec 11.

DOI:10.1016/j.artmed.2024.103049
PMID:39673960
Abstract

Multi-omics data have revolutionized biomedical research by providing a comprehensive understanding of biological systems and the molecular mechanisms of disease development. However, analyzing multi-omics data is challenging due to high dimensionality and limited sample sizes, necessitating proper data-reduction pipelines to ensure reliable analyses. Additionally, its multimodal nature requires effective data-integration pipelines. While several dimensionality reduction and data fusion algorithms have been proposed, crucial aspects are often overlooked. Specifically, the choice of projection space dimension is typically heuristic and uniformly applied across all omics, neglecting the unique high dimension small sample size challenges faced by individual omics. This paper introduces a novel multi-modal dimensionality reduction pipeline tailored to individual views. By leveraging intrinsic dimensionality estimators, we assess the curse-of-dimensionality impact on each view and propose a two-step reduction strategy for significantly affected views, combining feature selection with feature extraction. Compared to traditional uniform reduction pipelines in a crucial and supervised multi-omics analysis setting, our approach shows significant improvement. Additionally, we explore three effective unsupervised multi-omics data fusion methods rooted in the main data fusion strategies to gain insights into their performance under crucial, yet overlooked, settings.

摘要

多组学数据通过提供对生物系统和疾病发展分子机制的全面理解,彻底改变了生物医学研究。然而,由于高维度和样本量有限,分析多组学数据具有挑战性,因此需要适当的数据降维流程以确保可靠的分析。此外,其多模态性质需要有效的数据整合流程。虽然已经提出了几种降维和数据融合算法,但关键方面往往被忽视。具体而言,投影空间维度的选择通常是启发式的,并且在所有组学中统一应用,忽略了各个组学面临的独特的高维小样本量挑战。本文介绍了一种针对各个视图量身定制的新型多模态降维流程。通过利用内在维度估计器,我们评估维度诅咒对每个视图的影响,并针对受影响显著的视图提出两步降维策略,将特征选择与特征提取相结合。在关键的监督多组学分析设置中,与传统的统一降维流程相比,我们的方法显示出显著改进。此外,我们探索了三种基于主要数据融合策略的有效的无监督多组学数据融合方法,以深入了解它们在关键但被忽视的设置下的性能。

相似文献

1
Intrinsic-dimension analysis for guiding dimensionality reduction and data fusion in multi-omics data processing.用于指导多组学数据处理中降维和数据融合的本征维分析。
Artif Intell Med. 2025 Feb;160:103049. doi: 10.1016/j.artmed.2024.103049. Epub 2024 Dec 11.
2
Supervised Parametric Learning in the Identification of Composite Biomarker Signatures of Type 1 Diabetes in Integrated Parallel Multi-Omics Datasets.在整合的平行多组学数据集中识别1型糖尿病复合生物标志物特征的监督参数学习
Biomedicines. 2024 Feb 22;12(3):492. doi: 10.3390/biomedicines12030492.
3
scMNMF: a novel method for single-cell multi-omics clustering based on matrix factorization.scMNMF:一种基于矩阵分解的单细胞多组学聚类新方法。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae228.
4
Novel multi-omics deconfounding variational autoencoders can obtain meaningful disease subtyping.新型多组学去混淆变分自动编码器可获得有意义的疾病亚型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae512.
5
Exploring unsupervised feature extraction algorithms: tackling high dimensionality in small datasets.探索无监督特征提取算法:解决小数据集中的高维问题。
Sci Rep. 2025 Jul 1;15(1):21973. doi: 10.1038/s41598-025-07725-9.
6
Factors that influence caregivers' and adolescents' views and practices regarding human papillomavirus (HPV) vaccination for adolescents: a qualitative evidence synthesis.影响照顾者和青少年对青少年人乳头瘤病毒(HPV)疫苗接种的看法及做法的因素:一项定性证据综合分析
Cochrane Database Syst Rev. 2025 Apr 15;4(4):CD013430. doi: 10.1002/14651858.CD013430.pub2.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
8
Gene regulatory network integration with multi-omics data enhances survival predictions in cancer.基因调控网络与多组学数据的整合提高了癌症生存预测能力。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf315.
9
Graph neural networks for single-cell omics data: a review of approaches and applications.用于单细胞组学数据的图神经网络:方法与应用综述
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf109.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.

引用本文的文献

1
Multi-omics decodes host-specific and environmental microbiome interactions in sepsis.多组学解析脓毒症中宿主特异性和环境微生物组的相互作用。
Front Microbiol. 2025 Jun 26;16:1618177. doi: 10.3389/fmicb.2025.1618177. eCollection 2025.
2
miss-SNF: a multimodal patient similarity network integration approach to handle completely missing data sources.缺失值-SNF:一种用于处理完全缺失数据源的多模态患者相似性网络集成方法。
Bioinformatics. 2025 Mar 29;41(4). doi: 10.1093/bioinformatics/btaf150.