Suppr超能文献

基于词嵌入的深度神经网络框架用于蛋白质戊二酰化位点预测

Deep Neural Network Framework Based on Word Embedding for Protein Glutarylation Sites Prediction.

作者信息

Liu Chuan-Ming, Ta Van-Dai, Le Nguyen Quoc Khanh, Tadesse Direselign Addis, Shi Chongyang

机构信息

Department of Computer Science and Information Engineering, National Taipei University of Technology (Taipei Tech), Taipei City 106, Taiwan.

Samsung Display Vietnam (SDV), Yen Phong Industrial Park, Bac Ninh 16000, Vietnam.

出版信息

Life (Basel). 2022 Aug 10;12(8):1213. doi: 10.3390/life12081213.

Abstract

In recent years, much research has found that dysregulation of glutarylation is associated with many human diseases, such as diabetes, cancer, and glutaric aciduria type I. Therefore, glutarylation identification and characterization are essential tasks for determining modification-specific proteomics. This study aims to propose a novel deep neural network framework based on word embedding techniques for glutarylation sites prediction. Multiple deep neural network models are implemented to evaluate the performance of glutarylation sites prediction. Furthermore, an extensive experimental comparison of word embedding techniques is conducted to utilize the most efficient method for improving protein sequence data representation. The results suggest that the proposed deep neural networks not only improve protein sequence representation but also work effectively in glutarylation sites prediction by obtaining a higher accuracy and confidence rate compared to the previous work. Moreover, embedding techniques were proven to be more productive than the pre-trained word embedding techniques for glutarylation sequence representation. Our proposed method has significantly outperformed all traditional performance metrics compared to the advanced integrated vector support, with accuracy, specificity, sensitivity, and correlation coefficient of 0.79, 0.89, 0.59, and 0.51, respectively. It shows the potential to detect new glutarylation sites and uncover the relationships between glutarylation and well-known lysine modification.

摘要

近年来,许多研究发现戊二酰化失调与许多人类疾病相关,如糖尿病、癌症和I型戊二酸尿症。因此,戊二酰化的鉴定和表征是确定修饰特异性蛋白质组学的重要任务。本研究旨在提出一种基于词嵌入技术的新型深度神经网络框架用于戊二酰化位点预测。实现了多个深度神经网络模型来评估戊二酰化位点预测的性能。此外,对词嵌入技术进行了广泛的实验比较,以利用最有效的方法来改进蛋白质序列数据表示。结果表明,所提出的深度神经网络不仅改善了蛋白质序列表示,而且在戊二酰化位点预测中有效地发挥作用,与先前的工作相比获得了更高的准确率和置信率。此外,对于戊二酰化序列表示,嵌入技术被证明比预训练词嵌入技术更有效。与先进的集成向量支持相比,我们提出的方法在所有传统性能指标上均显著优于其,准确率、特异性、敏感性和相关系数分别为0.79、0.89、0.59和0.51。它显示了检测新戊二酰化位点以及揭示戊二酰化与著名赖氨酸修饰之间关系的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23fc/9410500/ee6a3aacbc34/life-12-01213-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验