Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29 Jiangjun Road, Jiangning District, Nanjing 211106, China.
School of Biomedical Engineering and Informatics, Nanjing Medical University, No. 101 Longmian Avenue, Jiangning District, Nanjing 211166, China.
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae439.
The internal ribosome entry site (IRES) is a cis-regulatory element that can initiate translation in a cap-independent manner. It is often related to cellular processes and many diseases. Thus, identifying the IRES is important for understanding its mechanism and finding potential therapeutic strategies for relevant diseases since identifying IRES elements by experimental method is time-consuming and laborious. Many bioinformatics tools have been developed to predict IRES, but all these tools are based on structure similarity or machine learning algorithms. Here, we introduced a deep learning model named DeepIRES for precisely identifying IRES elements in messenger RNA (mRNA) sequences. DeepIRES is a hybrid model incorporating dilated 1D convolutional neural network blocks, bidirectional gated recurrent units, and self-attention module. Tenfold cross-validation results suggest that DeepIRES can capture deeper relationships between sequence features and prediction results than other baseline models. Further comparison on independent test sets illustrates that DeepIRES has superior and robust prediction capability than other existing methods. Moreover, DeepIRES achieves high accuracy in predicting experimental validated IRESs that are collected in recent studies. With the application of a deep learning interpretable analysis, we discover some potential consensus motifs that are related to IRES activities. In summary, DeepIRES is a reliable tool for IRES prediction and gives insights into the mechanism of IRES elements.
内部核糖体进入位点(IRES)是一种顺式调控元件,能够以非依赖 cap 的方式起始翻译。它通常与细胞过程和许多疾病有关。因此,鉴定 IRES 对于理解其机制和寻找相关疾病的潜在治疗策略非常重要,因为通过实验方法鉴定 IRES 元件既耗时又费力。已经开发了许多生物信息学工具来预测 IRES,但所有这些工具都是基于结构相似性或机器学习算法。在这里,我们引入了一种名为 DeepIRES 的深度学习模型,用于准确识别信使 RNA(mRNA)序列中的 IRES 元件。DeepIRES 是一种混合模型,结合了扩张的 1D 卷积神经网络块、双向门控循环单元和自注意力模块。十折交叉验证结果表明,DeepIRES 可以比其他基线模型更好地捕捉序列特征和预测结果之间的更深层次关系。在独立测试集上的进一步比较表明,DeepIRES 具有优于其他现有方法的卓越和稳健的预测能力。此外,DeepIRES 在预测最近研究中收集的实验验证的 IRES 方面具有很高的准确性。通过应用深度学习可解释性分析,我们发现了一些与 IRES 活性相关的潜在共识基序。总之,DeepIRES 是一种可靠的 IRES 预测工具,并深入了解了 IRES 元件的机制。