Suppr超能文献

用于精准肿瘤学中高维组学数据分析的知识引导统计学习方法

Knowledge-Guided Statistical Learning Methods for Analysis of High-Dimensional -Omics Data in Precision Oncology.

作者信息

Zhao Yize, Chang Changgee, Long Qi

机构信息

Weill Cornell Medicine, New York, NY.

University of Pennsylvania Perelman School of Medicine, Philadelphia, PA.

出版信息

JCO Precis Oncol. 2019 Oct 24;3. doi: 10.1200/PO.19.00018. eCollection 2019 Oct.

Abstract

High-dimensional -omics data such as genomic, transcriptomic, and metabolomic data offer great promise in advancing precision medicine. In particular, such data have enabled the investigation of complex diseases such as cancer at an unprecedented scale and in multiple dimensions. However, a number of analytical challenges complicate analysis of high-dimensional -omics data. One is the growing recognition that complex diseases such as cancer are multifactorial and may be attributed to harmful changes on multiple -omics levels and on the pathway level. When individual genes in an important pathway have relatively weak signals, it can be challenging to detect them on their own, but the aggregated signal in the pathway can be considerably stronger and hence easier to detect with the same sample size. To address these challenges, there is a growing body of literature on knowledge-guided statistical learning methods for analysis of high-dimensional -omics data that can incorporate biological knowledge such as functional genomics and functional proteomics. These methods have been shown to improve predication and classification accuracy and yield biologically more interpretable results compared with statistical learning methods that do not use biological knowledge. In this review, we survey current knowledge-guided statistical learning methods, including both supervised learning and unsupervised learning, and their applications to precision oncology, and we discuss future research directions.

摘要

高维组学数据,如基因组学、转录组学和代谢组学数据,在推进精准医学方面具有巨大潜力。特别是,此类数据使得对癌症等复杂疾病的研究能够以前所未有的规模和多维度进行。然而,一些分析挑战使高维组学数据的分析变得复杂。其中之一是人们越来越认识到,癌症等复杂疾病是多因素的,可能归因于多个组学层面和通路层面的有害变化。当重要通路中的单个基因信号相对较弱时,单独检测它们可能具有挑战性,但通路中的聚合信号可能会强得多,因此在相同样本量下更容易检测到。为应对这些挑战,关于用于分析高维组学数据的知识引导统计学习方法的文献越来越多,这些方法可以纳入功能基因组学和功能蛋白质组学等生物学知识。与不使用生物学知识的统计学习方法相比,这些方法已被证明可以提高预测和分类准确性,并产生生物学上更具可解释性的结果。在本综述中,我们调查了当前的知识引导统计学习方法,包括监督学习和无监督学习,以及它们在精准肿瘤学中的应用,并讨论了未来的研究方向。

相似文献

2
Knowledge-guided learning methods for integrative analysis of multi-omics data.用于多组学数据综合分析的知识引导学习方法。
Comput Struct Biotechnol J. 2024 Apr 30;23:1945-1950. doi: 10.1016/j.csbj.2024.04.053. eCollection 2024 Dec.
4
Data-Driven Methods for Advancing Precision Oncology.推进精准肿瘤学的数据驱动方法。
Curr Pharmacol Rep. 2018 Apr;4(2):145-156. doi: 10.1007/s40495-018-0127-4. Epub 2018 Mar 6.
8
Enter the Matrix: Factorization Uncovers Knowledge from Omics.《进入矩阵:从组学中发现知识的因子分解》
Trends Genet. 2018 Oct;34(10):790-805. doi: 10.1016/j.tig.2018.07.003. Epub 2018 Aug 22.

引用本文的文献

2
Knowledge-guided learning methods for integrative analysis of multi-omics data.用于多组学数据综合分析的知识引导学习方法。
Comput Struct Biotechnol J. 2024 Apr 30;23:1945-1950. doi: 10.1016/j.csbj.2024.04.053. eCollection 2024 Dec.

本文引用的文献

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验