Bioinformatics Platform, Berlin Institute for Medical Systems Biology, Max-Delbrück Center for Molecular Medicine, Berlin, Germany; Berlin Institute of Health (BIH), 10178 Berlin, Germany.
Bioinformatics Platform, Berlin Institute for Medical Systems Biology, Max-Delbrück Center for Molecular Medicine, Berlin, Germany.
J Biotechnol. 2017 Nov 10;261:105-115. doi: 10.1016/j.jbiotec.2017.08.007. Epub 2017 Aug 16.
DNA methylation is one of the main epigenetic modifications in the eukaryotic genome; it has been shown to play a role in cell-type specific regulation of gene expression, and therefore cell-type identity. Bisulfite sequencing is the gold-standard for measuring methylation over the genomes of interest. Here, we review several techniques used for the analysis of high-throughput bisulfite sequencing. We introduce specialized short-read alignment techniques as well as pre/post-alignment quality check methods to ensure data quality. Furthermore, we discuss subsequent analysis steps after alignment. We introduce various differential methylation methods and compare their performance using simulated and real bisulfite sequencing datasets. We also discuss the methods used to segment methylomes in order to pinpoint regulatory regions. We introduce annotation methods that can be used for further classification of regions returned by segmentation and differential methylation methods. Finally, we review software packages that implement strategies to efficiently deal with large bisulfite sequencing datasets locally and we discuss online analysis workflows that do not require any prior programming skills. The analysis strategies described in this review will guide researchers at any level to the best practices of bisulfite sequencing analysis.
DNA 甲基化是真核基因组中主要的表观遗传修饰之一;它已被证明在细胞类型特异性基因表达调控中发挥作用,因此在细胞类型身份中发挥作用。亚硫酸氢盐测序是测量感兴趣基因组中甲基化的金标准。在这里,我们回顾了用于分析高通量亚硫酸氢盐测序的几种技术。我们介绍了专门的短读序列比对技术以及预/后比对质量检查方法,以确保数据质量。此外,我们讨论了比对后的后续分析步骤。我们介绍了各种差异甲基化方法,并使用模拟和真实亚硫酸氢盐测序数据集比较它们的性能。我们还讨论了用于分割甲基组以确定调节区域的方法。我们介绍了可用于进一步对分割和差异甲基化方法返回的区域进行分类的注释方法。最后,我们回顾了在本地高效处理大型亚硫酸氢盐测序数据集的软件包,并讨论了不需要任何先前编程技能的在线分析工作流程。本综述中描述的分析策略将指导任何级别的研究人员采用亚硫酸氢盐测序分析的最佳实践。