Tanas A S, Kuznetsova E B, Borisova M E, Rudenko V V, Zaletayev D V, Strelnikov V V
Research Centre for Medical Genetics, Moscow, 115478 Russia.
Sechenov First Moscow State Medical University, Moscow, 119991 Russia.
Mol Biol (Mosk). 2015 Jul-Aug;49(4):689-99. doi: 10.7868/S0026898415040187.
The reduced representation bisulfite sequencing (RRBS) method has been developed for the high-throughput analysis of DNA methylation based on the sequencing of genomic libraries treated with sodium bisulfite by next-generation approaches. In contrast to whole-genome sequencing, the RRBS approach elaborates specific endonucleases to prepare libraries in order to produce pools of CpG-rich DNA fragments. The original RRBS technology based on the use of the MspI libraries allows one to increase the relative number of CpG islands in the pools of genomic fragments compared to whole-genome bisulfite sequencing. Nevertheless, this technology is rarely used due to the high cost compared with bisulfite methylation analysis with hybridization microarrays and significant residual amount of data represented by the sequences of genomic repeats that complicates the alignment and is not of particular interest for developing DNA methylation markers, which is often the main goal of biomedical research. We have developed an algorithm for estimating the likelihood that recognition sites of restriction endonucleases will be represented in CpG islands and present a method of reducing the effective size of the RRBS library without a significant loss of the CpG islands based on the use of the XmaI endonuclease for library preparation. In silico analysis demonstrates that the optimum range of the XmaI-RRBS fragment lengths is 110-200 base pairs. The sequencing of this library allows one to assess the methylation status of over 125000 CpG dinucleotides, of which over 90000 belong to CpG islands.
简化代表性亚硫酸氢盐测序(RRBS)方法已被开发出来,用于基于下一代方法对经亚硫酸氢钠处理的基因组文库进行测序,从而对DNA甲基化进行高通量分析。与全基因组测序不同,RRBS方法精心设计特定的内切酶来制备文库,以产生富含CpG的DNA片段池。基于使用MspI文库的原始RRBS技术,与全基因组亚硫酸氢盐测序相比,能够增加基因组片段池中CpG岛的相对数量。然而,由于与杂交微阵列亚硫酸氢盐甲基化分析相比成本高昂,且基因组重复序列产生的大量剩余数据会使比对变得复杂,而这些数据对于开发DNA甲基化标记(这通常是生物医学研究的主要目标)并非特别重要,因此该技术很少被使用。我们开发了一种算法,用于估计限制性内切酶识别位点在CpG岛中出现的可能性,并提出了一种基于使用XmaI内切酶制备文库来减小RRBS文库有效大小且不会显著损失CpG岛的方法。计算机模拟分析表明,XmaI-RRBS片段长度的最佳范围是110 - 200个碱基对。对该文库进行测序能够评估超过125000个CpG二核苷酸的甲基化状态,其中超过90000个属于CpG岛。