Max F. Perutz Laboratories, Center for Integrative Bioinformatics Vienna, University of Vienna, Vienna, Austria.
PLoS One. 2012;7(7):e41528. doi: 10.1371/journal.pone.0041528. Epub 2012 Jul 20.
Deep sequencing after bisulfite conversion (BS-Seq) is the method of choice to generate whole genome maps of cytosine methylation at single base-pair resolution. Its application to genomic DNA of Arabidopsis flower bud tissue resulted in the first complete methylome, determining a methylation rate of 6.7% in this tissue. BS-Seq reads were mapped onto an in silico converted reference genome, applying the so-called 3-letter genome method. Here, we present BiSS (Bisufite Sequencing Scorer), a new method applying Smith-Waterman alignment to map bisulfite-converted reads to a reference genome. In addition, we introduce a comprehensive adaptive error estimate that accounts for sequencing errors, erroneous bisulfite conversion and also wrongly mapped reads. The re-analysis of the Arabidopsis methylome data with BiSS mapped substantially more reads to the genome. As a result, it determines the methylation status of an extra 10% of cytosines and estimates the methylation rate to be 7.7%. We validated the results by individual traditional bisulfite sequencing for selected genomic regions. In addition to predicting the methylation status of each cytosine, BiSS also provides an estimate of the methylation degree at each genomic site. Thus, BiSS explores BS-Seq data more extensively and provides more information for downstream analysis.
亚硫酸氢盐测序后(BS-Seq)是生成单个碱基分辨率的胞嘧啶甲基化全基因组图谱的首选方法。将其应用于拟南芥花蕾组织的基因组 DNA 中,生成了第一个完整的甲基组图谱,确定该组织的甲基化率为 6.7%。BS-Seq 读取被映射到虚拟转化的参考基因组上,应用所谓的 3 字母基因组方法。在这里,我们提出了 BiSS(Bisufite 测序评分器),这是一种应用 Smith-Waterman 比对将亚硫酸氢盐转化的读取映射到参考基因组的新方法。此外,我们引入了一种全面的自适应误差估计方法,该方法可以考虑测序错误、错误的亚硫酸氢盐转化以及错误映射的读取。使用 BiSS 重新分析拟南芥甲基组数据可以将更多的读取映射到基因组上。结果,它确定了额外 10%的胞嘧啶的甲基化状态,并估计甲基化率为 7.7%。我们通过对选定基因组区域的个别传统亚硫酸氢盐测序验证了这些结果。除了预测每个胞嘧啶的甲基化状态外,BiSS 还提供了每个基因组位点的甲基化程度的估计。因此,BiSS 更广泛地探索了 BS-Seq 数据,并为下游分析提供了更多信息。