Xu Zongli, Taylor Jack A, Leung Yuet-Kin, Ho Shuk-Mei, Niu Liang
Epidemiology Branch.
Epigenetic and Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, NIH, Research Triangle Park, NC 27709, USA.
Bioinformatics. 2016 Dec 1;32(23):3667-3669. doi: 10.1093/bioinformatics/btw527. Epub 2016 Aug 13.
5-Methylcytosine (5mC) and 5-hydroxymethylcytosine (5hmC) are important epigenetic regulators of gene expression. 5mC and 5hmC levels can be computationally inferred at single base resolution using sequencing or array data from paired DNA samples that have undergone bisulfite and oxidative bisulfite conversion. Current estimation methods have been shown to produce irregular estimates of 5hmC level or are extremely computation intensive.
We developed an efficient method oxBS-MLE based on binomial modeling of paired bisulfite and oxidative bisulfite data from sequencing or array analysis. Evaluation in several datasets showed that it outperformed alternative methods in estimate accuracy and computation speed.
oxBS-MLE is implemented in Bioconductor package ENmix.
niulg@ucmail.uc.eduSupplementary information: Supplementary data are available at Bioinformatics online.
5-甲基胞嘧啶(5mC)和5-羟甲基胞嘧啶(5hmC)是基因表达的重要表观遗传调控因子。利用经过亚硫酸氢盐和氧化亚硫酸氢盐转化的配对DNA样本的测序或阵列数据,可在单碱基分辨率下通过计算推断5mC和5hmC水平。目前的估计方法已被证明会产生5hmC水平的不规则估计值,或者计算量极大。
我们基于测序或阵列分析中配对亚硫酸氢盐和氧化亚硫酸氢盐数据的二项式建模,开发了一种高效方法oxBS-MLE。在多个数据集中的评估表明,它在估计准确性和计算速度方面优于其他方法。
oxBS-MLE在Bioconductor包ENmix中实现。
补充数据可在《生物信息学》在线获取。