Suppr超能文献

一种基于XGBoost和SHAP的用于识别N-甲基鸟苷位点的可解释预测模型。

An Interpretable Prediction Model for Identifying N-Methylguanosine Sites Based on XGBoost and SHAP.

作者信息

Bi Yue, Xiang Dongxu, Ge Zongyuan, Li Fuyi, Jia Cangzhi, Song Jiangning

机构信息

School of Science, Dalian Maritime University, Dalian 116026, China.

Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia.

出版信息

Mol Ther Nucleic Acids. 2020 Aug 25;22:362-372. doi: 10.1016/j.omtn.2020.08.022. eCollection 2020 Dec 4.

Abstract

Recent studies have increasingly shown that the chemical modification of mRNA plays an important role in the regulation of gene expression. N-methylguanosine (m7G) is a type of positively-charged mRNA modification that plays an essential role for efficient gene expression and cell viability. However, the research on m7G has received little attention to date. Bioinformatics tools can be applied as auxiliary methods to identify m7G sites in transcriptomes. In this study, we develop a novel interpretable machine learning-based approach termed XG-m7G for the differentiation of m7G sites using the XGBoost algorithm and six different types of sequence-encoding schemes. Both 10-fold and jackknife cross-validation tests indicate that XG-m7G outperforms iRNA-m7G. Moreover, using the powerful SHAP algorithm, this new framework also provides desirable interpretations of the model performance and highlights the most important features for identifying m7G sites. XG-m7G is anticipated to serve as a useful tool and guide for researchers in their future studies of mRNA modification sites.

摘要

最近的研究越来越多地表明,mRNA的化学修饰在基因表达调控中起着重要作用。N-甲基鸟苷(m7G)是一种带正电荷的mRNA修饰,对有效的基因表达和细胞活力起着至关重要的作用。然而,迄今为止,对m7G的研究很少受到关注。生物信息学工具可以作为辅助方法来识别转录组中的m7G位点。在本研究中,我们开发了一种基于可解释机器学习的新方法,称为XG-m7G,用于使用XGBoost算法和六种不同类型的序列编码方案来区分m7G位点。10倍交叉验证测试和留一法交叉验证测试均表明,XG-m7G优于iRNA-m7G。此外,使用强大的SHAP算法,这个新框架还对模型性能提供了理想的解释,并突出了识别m7G位点的最重要特征。预计XG-m7G将成为研究人员未来研究mRNA修饰位点的有用工具和指南。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d19/7533297/885ea610f549/fx1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验