He Yupeng, Gorkin David U, Dickel Diane E, Nery Joseph R, Castanon Rosa G, Lee Ah Young, Shen Yin, Visel Axel, Pennacchio Len A, Ren Bing, Ecker Joseph R
Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037.
Bioinformatics Program, University of California, San Diego, La Jolla, CA 92093.
Proc Natl Acad Sci U S A. 2017 Feb 28;114(9):E1633-E1640. doi: 10.1073/pnas.1618353114. Epub 2017 Feb 13.
Accurate enhancer identification is critical for understanding the spatiotemporal transcriptional regulation during development as well as the functional impact of disease-related noncoding genetic variants. Computational methods have been developed to predict the genomic locations of active enhancers based on histone modifications, but the accuracy and resolution of these methods remain limited. Here, we present an algorithm, regulatory element prediction based on tissue-specific local epigenetic marks (REPTILE), which integrates histone modification and whole-genome cytosine DNA methylation profiles to identify the precise location of enhancers. We tested the ability of REPTILE to identify enhancers previously validated in reporter assays. Compared with existing methods, REPTILE shows consistently superior performance across diverse cell and tissue types, and the enhancer locations are significantly more refined. We show that, by incorporating base-resolution methylation data, REPTILE greatly improves upon current methods for annotation of enhancers across a variety of cell and tissue types. REPTILE is available at https://github.com/yupenghe/REPTILE/.
准确识别增强子对于理解发育过程中的时空转录调控以及疾病相关非编码基因变异的功能影响至关重要。已开发出计算方法来基于组蛋白修饰预测活性增强子的基因组位置,但这些方法的准确性和分辨率仍然有限。在这里,我们提出了一种算法,即基于组织特异性局部表观遗传标记的调控元件预测(REPTILE),它整合了组蛋白修饰和全基因组胞嘧啶DNA甲基化图谱以识别增强子的精确位置。我们测试了REPTILE识别先前在报告基因检测中得到验证的增强子的能力。与现有方法相比,REPTILE在多种细胞和组织类型中均表现出持续卓越的性能,并且增强子位置得到了显著更精细的确定。我们表明,通过纳入碱基分辨率的甲基化数据,REPTILE在注释多种细胞和组织类型中的增强子方面大大改进了当前方法。可在https://github.com/yupenghe/REPTILE/获取REPTILE。