Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 610054, China.
School of Informatics, Xiamen University, Xiamen 361005, China.
Bioinformatics. 2021 Dec 22;38(1):52-57. doi: 10.1093/bioinformatics/btab611.
N4-acetylcytidine (ac4C) is the only acetylation modification that has been characterized in eukaryotic RNA, and is correlated with various human diseases. Laboratory identification of ac4C is complicated by factors, such as sample hydrolysis and high cost. Unfortunately, existing computational methods to identify ac4C do not achieve satisfactory performance.
We developed a novel tool, DeepAc4C, which identifies ac4C using convolutional neural networks (CNNs) using hybrid features composed of physicochemical patterns and a distributed representation of nucleic acids. Our results show that the proposed model achieved better and more balanced performance than existing predictors. Furthermore, we evaluated the effect that specific features had on the model predictions and their interaction effects. Several interesting sequence motifs specific to ac4C were identified.
The webserver is freely accessible at https://ac4c.webmalab.cn/, the source code and datasets are accessible at Zenodo with URL https://doi.org/10.5281/zenodo.5138047 and Github with URL https://github.com/wangchao-malab/DeepAc4C.
Supplementary data are available at Bioinformatics online.
N4-乙酰胞苷(ac4C)是唯一在真核 RNA 中被描述的乙酰化修饰,与各种人类疾病相关。实验室中 ac4C 的鉴定受到多种因素的影响,例如样品水解和高成本。不幸的是,现有的用于鉴定 ac4C 的计算方法无法达到令人满意的性能。
我们开发了一种新工具 DeepAc4C,它使用卷积神经网络 (CNNs) 结合物理化学模式和核酸的分布式表示来识别 ac4C。我们的结果表明,与现有预测器相比,所提出的模型具有更好和更平衡的性能。此外,我们评估了特定特征对模型预测的影响及其交互作用。鉴定出了几个与 ac4C 特异性相关的有趣序列基序。
该网络服务器可免费访问,网址为 https://ac4c.webmalab.cn/,源代码和数据集可在 Zenodo 上访问,网址为 https://doi.org/10.5281/zenodo.5138047,也可在 Github 上访问,网址为 https://github.com/wangchao-malab/DeepAc4C。
补充数据可在生物信息学在线获得。