Yan Haidan, He Jun, Guan Qingzhou, Cai Hao, Zhang Lin, Zheng Weicheng, Qi Lishuang, Zhang Suyun, Liu Huaping, Li Hongdong, Zhao Wenyuan, Yang Sheng, Guo Zheng
Key Laboratory of Ministry of Education for Gastrointestinal Cancer, Department of Bioinformatics, Fujian Medical University, Fuzhou, China.
Institute of Biomedical Engineering and Instrumentation, Hangzhou Dianzi University, Hangzhou, China.
Oncotarget. 2017 Jul 18;8(29):47356-47364. doi: 10.18632/oncotarget.17647.
A big challenge to clinical diagnosis and therapy of colorectal cancer (CRC) is its extreme heterogeneity, and thus it would be of special importance if we could find common biomarkers besides subtype-specific biomarkers for CRC. Here, with DNA methylation data produced by different laboratories, we firstly revealed that the relative methylation-level orderings (RMOs) of CpG sites within colorectal normal tissues are highly stable but widely disrupted in the CRC tissues. This finding provides the basis for using the RankComp algorithm to identify differentially methylated (DM) CpG sites in every individual CRC sample through comparing the RMOs within the individual sample with the stable RMOs predetermined in normal tissues. For 75 CRC samples, RankComp detected averagely 4,062 DM CpG sites per sample and reached an average precision of 91.34% in terms that the hypermethylation or hypomethylation states of the DM CpG sites detected for each cancer sample were consistent with the observed differences between this cancer sample and its paired adjacent normal sample. Finally, we applied RankComp to identify DM CpG sites for each of the 268 CRC samples from The Cancer Genome Atlas and found 26 and 143 genes whose promoter regions included CpG sites that were hypermethylated and hypomethylated, respectively, in more than 95% of the 268 CRC samples. Individualized pathway analysis identified six pathways that were significantly enriched with DM genes in more than 90% of the CRC tissues. These universal DNA methylation biomarkers could be important diagnostic makers and therapy targets for CRC.
结直肠癌(CRC)临床诊断和治疗面临的一个重大挑战是其极端的异质性,因此,如果我们能找到除CRC亚型特异性生物标志物之外的常见生物标志物,将具有特别重要的意义。在这里,利用不同实验室产生的DNA甲基化数据,我们首先揭示了结直肠正常组织中CpG位点的相对甲基化水平排序(RMOs)高度稳定,但在CRC组织中广泛紊乱。这一发现为使用RankComp算法通过将个体样本中的RMOs与正常组织中预先确定的稳定RMOs进行比较来识别每个个体CRC样本中的差异甲基化(DM)CpG位点提供了基础。对于75个CRC样本,RankComp平均每个样本检测到4062个DM CpG位点,并且在每个癌症样本检测到的DM CpG位点的高甲基化或低甲基化状态与该癌症样本与其配对的相邻正常样本之间观察到的差异一致方面,平均精度达到91.34%。最后,我们应用RankComp来识别来自癌症基因组图谱的268个CRC样本中每个样本的DM CpG位点,发现分别有26个和143个基因,其启动子区域包含的CpG位点在268个CRC样本中的95%以上分别发生了高甲基化和低甲基化。个性化通路分析确定了在90%以上的CRC组织中DM基因显著富集的六个通路。这些通用的DNA甲基化生物标志物可能是CRC重要的诊断标志物和治疗靶点。