Ma Xiaoke, Yu Liang, Wang Peizhuo, Yang Xiaofei
School of Computer Science and Technology, Xidian University, No.2 South Taibai Road, Xi'an, Shaanxi, China; Xidian-Ningbo Information Technology Institute, Xidian University, No. 777 Zhongguanxi Road, Ningbo City, China.
School of Computer Science and Technology, Xidian University, No.2 South Taibai Road, Xi'an, Shaanxi, China.
Comput Biol Chem. 2017 Aug;69:164-170. doi: 10.1016/j.compbiolchem.2017.03.014. Epub 2017 May 4.
Despite growing evidence demonstrates that the long non-coding ribonucleic acids (lncRNAs) are critical modulators for cancers, the knowledge about the DNA methylation patterns of lncRNAs is quite limited. We develop a systematic analysis pipeline to discover DNA methylation patterns for lncRNAs across multiple cancer subtypes from probe, gene and network levels. By using The Cancer Genome Atlas (TCGA) breast cancer methylation data, the pipeline discovers various DNA methylation patterns for lncRNAs across four major subtypes such as luminal A, luminal B, her2-enriched as well as basal-like. On the probe and gene level, we find that both differentially methylated probes and lncRNAs are subtype specific, while the lncRNAs are not as specific as probes. On the network level, the pipeline constructs differential co-methylation lncRNA network for each subtype. Then, it identifies both subtype specific and common lncRNA modules by simultaneously analyzing multiple networks. We show that the lncRNAs in subtype specific and common modules differ greatly in terms of topological structure, sequence conservation as well as expression. Furthermore, the subtype specific lncRNA modules serve as biomarkers to improve significantly the accuracy of breast cancer subtypes prediction. Finally, the common lncRNA modules associate with survival time of patients, which is critical for cancer therapy.
尽管越来越多的证据表明长链非编码核糖核酸(lncRNAs)是癌症的关键调节因子,但关于lncRNAs的DNA甲基化模式的了解却相当有限。我们开发了一种系统分析流程,从探针、基因和网络层面发现多种癌症亚型中lncRNAs的DNA甲基化模式。通过使用癌症基因组图谱(TCGA)乳腺癌甲基化数据,该流程发现了lncRNAs在四种主要亚型(如腔面A型、腔面B型、人表皮生长因子受体2(HER2)富集型以及基底样型)中的各种DNA甲基化模式。在探针和基因层面,我们发现差异甲基化探针和lncRNAs都是亚型特异性的,不过lncRNAs的特异性不如探针。在网络层面,该流程为每种亚型构建了差异共甲基化lncRNA网络。然后,通过同时分析多个网络来识别亚型特异性和常见的lncRNA模块。我们表明,亚型特异性和常见模块中的lncRNAs在拓扑结构、序列保守性以及表达方面存在很大差异。此外,亚型特异性lncRNA模块可作为生物标志物,显著提高乳腺癌亚型预测的准确性。最后,常见lncRNA模块与患者的生存时间相关,这对癌症治疗至关重要。