Cao Boyang, Luo Huijuan, Luo Tian, Li Nannan, Shao Kang, Wu Kui, Sahu Sunil Kumar, Li Fuqiang, Lin Cong
BGI-Shenzhen, Shenzhen, 518083, China.
Guangdong Provincial Key Laboratory of Human Disease Genomics, Shenzhen Key Laboratory of Genomics, BGI-Shenzhen, Shenzhen 518083, China.
Heliyon. 2023 May 22;9(6):e16571. doi: 10.1016/j.heliyon.2023.e16571. eCollection 2023 Jun.
Whole-genome bisulfite sequencing (WGBS) technology can provide comprehensive DNA methylation at a single-base resolution on a genome-wide scale, and is considered to be the gold standard for the detection of 5-methylcytosine (5 mC). However, the International Human Epigenome Consortium propose a full DNA methylome should have at least 30 fold redundant coverage of the reference genome from a single biological replicate. Therefore, it remains cost prohibitive for large-scale studies. To find a solution, the DNBSEQ-Tx sequencing was developed that can generate up to 6 Tb data in a single run for projects involving large-scale sequencing.
In this study, we provided two WGBS library construction methods DNB_PREBSseq and DNB_SPLATseq optimized for the DNBSEQ-Tx sequencer, and demonstrated the performance of these two methods on the DNBSEQ-Tx platform, using the DNA extracted from four different cell lines. We also compared the sequencing data from these two WGBS library construction methods with HeLa cell line data from ENCODE sequenced on Illumina HiSeq X Ten and WGBS data of two other cell lines sequenced on HiSeq2500. Various quality control (QC) analyses such as the base quality scores, methylation-bias (m-bias), and conversion efficiency indicated that the data sequenced on the DNBSEQ-Tx platform met the WGBS-required quality controls. Meanwhile, our data closely resembled the coverage shown by the data generated by the Illumina platform.
Our study showed that with our optimized methods, DNBSEQ-Tx could generate high-quality WGBS data with relatively good stability for large-scale WGBS sequencing applications. Thus, we conclude that DNBSEQ-Tx can be used for a wide range of WGBS research.
全基因组亚硫酸氢盐测序(WGBS)技术能够在全基因组范围内以单碱基分辨率提供全面的DNA甲基化信息,被认为是检测5-甲基胞嘧啶(5mC)的金标准。然而,国际人类表观基因组联盟提出,一个完整的DNA甲基化组应从单个生物学重复中对参考基因组具有至少30倍的冗余覆盖。因此,对于大规模研究而言,其成本仍然过高。为找到解决方案,开发了DNBSEQ-Tx测序技术,对于涉及大规模测序的项目,该技术单次运行可产生高达6Tb的数据。
在本研究中,我们提供了两种针对DNBSEQ-Tx测序仪优化的WGBS文库构建方法DNB_PREBSseq和DNB_SPLATseq,并使用从四种不同细胞系中提取的DNA,在DNBSEQ-Tx平台上展示了这两种方法的性能。我们还将这两种WGBS文库构建方法的测序数据与在Illumina HiSeq X Ten上测序的ENCODE项目中的HeLa细胞系数据以及在HiSeq2500上测序的另外两种细胞系的WGBS数据进行了比较。各种质量控制(QC)分析,如碱基质量分数、甲基化偏差(m-偏差)和转化率,表明在DNBSEQ-Tx平台上测序的数据符合WGBS所需的质量控制标准。同时,我们的数据与Illumina平台产生的数据所显示的覆盖度非常相似。
我们的研究表明,通过我们优化的方法,DNBSEQ-Tx能够为大规模WGBS测序应用生成具有相对良好稳定性的高质量WGBS数据。因此,我们得出结论,DNBSEQ-Tx可用于广泛的WGBS研究。