Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
Nat Commun. 2019 Dec 11;10(1):5666. doi: 10.1038/s41467-019-13588-2.
The ability to characterize and predict tumor phenotypes is crucial to precision medicine. In this study, we present an integrative computational approach using a genome-wide association analysis and an Elastic Net prediction method to analyze the relationship between DNA copy number alterations and an archive of gene expression signatures. Across breast cancers, we are able to quantitatively predict many gene signatures levels within individual tumors with high accuracy based upon DNA copy number features alone, including proliferation status and Estrogen-signaling pathway activity. We can also predict many other key phenotypes, including intrinsic molecular subtypes, estrogen receptor status, and TP53 mutation. This approach is also applied to TCGA Pan-Cancer, which identify repeatedly predictable signatures across tumor types including immune features in lung squamous and basal-like breast cancers. These Elastic Net DNA predictors could also be called from DNA-based gene panels, thus facilitating their use as biomarkers to guide therapeutic decision making.
对肿瘤表型进行特征描述和预测对精准医疗至关重要。在这项研究中,我们提出了一种综合计算方法,利用全基因组关联分析和弹性网络预测方法来分析 DNA 拷贝数改变与基因表达特征档案之间的关系。在乳腺癌中,我们能够根据 DNA 拷贝数特征单独对单个肿瘤中许多基因特征水平进行高精度的定量预测,包括增殖状态和雌激素信号通路活性。我们还可以预测许多其他关键表型,包括内在分子亚型、雌激素受体状态和 TP53 突变。该方法也应用于 TCGA 泛癌,可识别包括肺鳞癌和基底样乳腺癌中的免疫特征在内的肿瘤类型中反复可预测的特征。这些弹性网络 DNA 预测因子也可以从基于 DNA 的基因面板中调用,从而便于将其用作指导治疗决策的生物标志物。