Suppr超能文献

使用充分方向因子模型分析与乳腺癌生存相关的潜在活动。

Using sufficient direction factor model to analyze latent activities associated with breast cancer survival.

作者信息

Baek Seungchul, Ho Yen-Yi, Ma Yanyuan

机构信息

Department of Mathematics and Statistics, University of Maryland Baltimore County, Baltimore, Maryland.

Department of Statistics, University of South Carolina, Columbia, South Carolina.

出版信息

Biometrics. 2020 Dec;76(4):1340-1350. doi: 10.1111/biom.13208. Epub 2020 Jan 6.

Abstract

High-dimensional gene expression data often exhibit intricate correlation patterns as the result of coordinated genetic regulation. In practice, however, it is difficult to directly measure these coordinated underlying activities. Analysis of breast cancer survival data with gene expressions motivates us to use a two-stage latent factor approach to estimate these unobserved coordinated biological processes. Compared to existing approaches, our proposed procedure has several unique characteristics. In the first stage, an important distinction is that our procedure incorporates prior biological knowledge about gene-pathway membership into the analysis and explicitly model the effects of genetic pathways on the latent factors. Second, to characterize the molecular heterogeneity of breast cancer, our approach provides estimates specific to each cancer subtype. Finally, our proposed framework incorporates sparsity condition due to the fact that genetic networks are often sparse. In the second stage, we investigate the relationship between latent factor activity levels and survival time with censoring using a general dimension reduction model in the survival analysis context. Combining the factor model and sufficient direction model provides an efficient way of analyzing high-dimensional data and reveals some interesting relations in the breast cancer gene expression data.

摘要

由于基因调控的协同作用,高维基因表达数据常常呈现出复杂的相关模式。然而在实际中,直接测量这些潜在的协同活动是困难的。对具有基因表达的乳腺癌生存数据进行分析,促使我们采用两阶段潜在因子方法来估计这些未观察到的协同生物过程。与现有方法相比,我们提出的方法具有几个独特的特点。在第一阶段,一个重要的区别是我们的方法将关于基因通路成员的先验生物学知识纳入分析,并明确地对遗传通路对潜在因子的影响进行建模。其次,为了刻画乳腺癌的分子异质性,我们的方法提供了针对每种癌症亚型的估计。最后,由于遗传网络通常是稀疏的,我们提出的框架纳入了稀疏条件。在第二阶段,我们在生存分析背景下使用一般的降维模型,研究潜在因子活性水平与带删失的生存时间之间的关系。结合因子模型和充分方向模型提供了一种分析高维数据的有效方法,并揭示了乳腺癌基因表达数据中的一些有趣关系。

相似文献

7
Semiparametric Bayesian kernel survival model for evaluating pathway effects.半参数贝叶斯核生存模型用于评估途径效应。
Stat Methods Med Res. 2019 Oct-Nov;28(10-11):3301-3317. doi: 10.1177/0962280218797360. Epub 2018 Oct 5.

引用本文的文献

本文引用的文献

4
Fatty acid metabolism in breast cancer subtypes.乳腺癌亚型中的脂肪酸代谢
Oncotarget. 2017 Apr 25;8(17):29487-29500. doi: 10.18632/oncotarget.15494.
7
Breast cancer statistics, 2013.乳腺癌统计数据,2013 年。
CA Cancer J Clin. 2014 Jan-Feb;64(1):52-62. doi: 10.3322/caac.21203. Epub 2013 Oct 1.
9
Comprehensive molecular portraits of human breast tumours.人类乳腺肿瘤的全面分子特征图谱。
Nature. 2012 Oct 4;490(7418):61-70. doi: 10.1038/nature11412. Epub 2012 Sep 23.
10
Molecular biology in breast cancer: intrinsic subtypes and signaling pathways.乳腺癌的分子生物学:内在亚型和信号通路。
Cancer Treat Rev. 2012 Oct;38(6):698-707. doi: 10.1016/j.ctrv.2011.11.005. Epub 2011 Dec 16.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验