Department of Computer Science, Wayne State University, USA.
Department of Obstetrics and Gynecology, Wayne State University, USA.
Brief Bioinform. 2018 Sep 28;19(5):737-753. doi: 10.1093/bib/bbx013.
DNA methylation is an important epigenetic mechanism that plays a crucial role in cellular regulatory systems. Recent advancements in sequencing technologies now enable us to generate high-throughput methylation data and to measure methylation up to single-base resolution. This wealth of data does not come without challenges, and one of the key challenges in DNA methylation studies is to identify the significant differences in the methylation levels of the base pairs across distinct biological conditions. Several computational methods have been developed to identify differential methylation using bisulfite sequencing data; however, there is no clear consensus among existing approaches. A comprehensive survey of these approaches would be of great benefit to potential users and researchers to get a complete picture of the available resources. In this article, we present a detailed survey of 22 such approaches focusing on their underlying statistical models, primary features, key advantages and major limitations. Importantly, the intrinsic drawbacks of the approaches pointed out in this survey could potentially be addressed by future research.
DNA 甲基化是一种重要的表观遗传机制,在细胞调控系统中起着关键作用。最近测序技术的进步使我们能够生成高通量的甲基化数据,并以单碱基分辨率测量甲基化。这些大量的数据并不是没有挑战的,DNA 甲基化研究中的一个关键挑战是识别在不同生物条件下碱基对甲基化水平的显著差异。已经开发了几种使用亚硫酸氢盐测序数据识别差异甲基化的计算方法;然而,现有的方法之间没有明确的共识。对这些方法进行全面调查将使潜在用户和研究人员受益,使他们全面了解可用资源。在本文中,我们详细调查了 22 种此类方法,重点介绍了它们的基础统计模型、主要特征、主要优点和主要限制。重要的是,本调查中指出的方法的内在缺陷可能会在未来的研究中得到解决。