Shilpi Arunima, Bi Yingtao, Jung Segun, Patra Samir K, Davuluri Ramana V
Epigenetics and Cancer Research Laboratory, Biochemistry and Molecular Biology Group Department of Life Science, National Institute of Technology Rourkela, Odisha, India.
Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA.
Cancer Inform. 2017 Jan 9;16:1-13. doi: 10.4137/CIN.S39783. eCollection 2017.
Breast cancer being a multifaceted disease constitutes a wide spectrum of histological and molecular variability in tumors. However, the task for the identification of these variances is complicated by the interplay between inherited genetic and epigenetic aberrations. Therefore, this study provides an extrapolate outlook to the sinister partnership between DNA methylation and single-nucleotide polymorphisms (SNPs) in relevance to the identification of prognostic markers in breast cancer. The effect of these SNPs on methylation is defined as methylation quantitative trait loci (meQTL).
We developed a novel method to identify prognostic gene signatures for breast cancer by integrating genomic and epigenomic data. This is based on the hypothesis that multiple sources of evidence pointing to the same gene or pathway are likely to lead to reduced false positives. We also apply random resampling to reduce overfitting noise by dividing samples into training and testing data sets. Specifically, the common samples between Illumina 450 DNA methylation, Affymetrix SNP array, and clinical data sets obtained from the Cancer Genome Atlas (TCGA) for breast invasive carcinoma (BRCA) were randomly divided into training and test models. An intensive statistical analysis based on log-rank test and Cox proportional hazard model has established a significant association between differential methylation and the stratification of breast cancer patients into high- and low-risk groups, respectively.
The comprehensive assessment based on the conjoint effect of CpG-SNP pair has guided in delaminating the breast cancer patients into the high- and low-risk groups. In particular, the most significant association was found with respect to cg05370838-rs2230576, cg00956490-rs940453, and cg11340537-rs2640785 CpG-SNP pairs. These CpG-SNP pairs were strongly associated with differential expression of , , and genes, respectively. Besides, the exclusive effect of SNPs such as rs10101376, rs140679, and rs1538146 also hold significant prognostic determinant.
Thus, the analysis based on DNA methylation and SNPs have resulted in the identification of novel susceptible loci that hold prognostic relevance in breast cancer.
乳腺癌是一种多方面的疾病,肿瘤具有广泛的组织学和分子变异性。然而,由于遗传基因异常和表观遗传异常之间的相互作用,识别这些变异的任务变得复杂。因此,本研究提供了一个关于DNA甲基化与单核苷酸多态性(SNP)之间有害伙伴关系的推断观点,这与乳腺癌预后标志物的识别相关。这些SNP对甲基化的影响被定义为甲基化数量性状位点(meQTL)。
我们开发了一种通过整合基因组和表观基因组数据来识别乳腺癌预后基因特征的新方法。这基于这样一种假设,即指向同一基因或途径的多种证据来源可能会减少假阳性。我们还应用随机重采样通过将样本分为训练和测试数据集来减少过拟合噪声。具体而言,将从癌症基因组图谱(TCGA)获取的用于乳腺浸润性癌(BRCA)的Illumina 450 DNA甲基化、Affymetrix SNP阵列和临床数据集之间的共同样本随机分为训练和测试模型。基于对数秩检验和Cox比例风险模型的深入统计分析分别在差异甲基化与乳腺癌患者分层为高风险和低风险组之间建立了显著关联。
基于CpG-SNP对的联合效应的综合评估指导将乳腺癌患者分为高风险和低风险组。特别是,发现cg05370838-rs2230576、cg00956490-rs940453和cg11340537-rs2640785 CpG-SNP对具有最显著的关联。这些CpG-SNP对分别与 、 和 基因的差异表达强烈相关。此外,rs10101376、rs140679和rs1538146等SNP的单独效应也具有显著的预后决定因素。
因此,基于DNA甲基化和SNP的分析已导致识别出在乳腺癌中具有预后相关性的新的易感位点。