• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepO-GlcNAc:一种利用深度学习结合注意力机制预测蛋白质O-连接N-乙酰葡糖胺化位点的网络服务器。

DeepO-GlcNAc: a web server for prediction of protein O-GlcNAcylation sites using deep learning combined with attention mechanism.

作者信息

Zhang Liyuan, Deng Tingzhi, Pan Shuijing, Zhang Minghui, Zhang Yusen, Yang Chunhua, Yang Xiaoyong, Tian Geng, Mi Jia

机构信息

Shandong Technology Innovation Center of Molecular Targeting and Intelligent Diagnosis and Treatment, Binzhou Medical University, Yantai, Shandong, China.

National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, Fujian, China.

出版信息

Front Cell Dev Biol. 2024 Oct 10;12:1456728. doi: 10.3389/fcell.2024.1456728. eCollection 2024.

DOI:10.3389/fcell.2024.1456728
PMID:39450274
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11500328/
Abstract

INTRODUCTION

Protein O-GlcNAcylation is a dynamic post-translational modification involved in major cellular processes and associated with many human diseases. Bioinformatic prediction of O-GlcNAc sites before experimental validation is a challenge task in O-GlcNAc research. Recent advancements in deep learning algorithms and the availability of O-GlcNAc proteomics data present an opportunity to improve O-GlcNAc site prediction.

OBJECTIVES

This study aims to develop a deep learning-based tool to improve O-GlcNAcylation site prediction.

METHODS

We construct an annotated unbalanced O-GlcNAcylation data set and propose a new deep learning framework, DeepO-GlcNAc, using Long Short-Term Memory (LSTM), Convolutional Neural Networks (CNN) combined with attention mechanism.

RESULTS

The ablation study confirms that the additional model components in DeepO-GlcNAc, such as attention mechanisms and LSTM, contribute positively to improving prediction performance. Our model demonstrates strong robustness across five cross-species datasets, excluding humans. We also compare our model with three external predictors using an independent dataset. Our results demonstrated that DeepO-GlcNAc outperforms the external predictors, achieving an accuracy of 92%, an average precision of 72%, a MCC of 0.60, and an AUC of 92% in ROC analysis. Moreover, we have implemented DeepO-GlcNAc as a web server to facilitate further investigation and usage by the scientific community.

CONCLUSION

Our work demonstrates the feasibility of utilizing deep learning for O-GlcNAc site prediction and provides a novel tool for O-GlcNAc investigation.

摘要

引言

蛋白质O-连接的N-乙酰葡糖胺化是一种动态的翻译后修饰,参与主要细胞过程并与多种人类疾病相关。在实验验证之前对O-连接的N-乙酰葡糖胺位点进行生物信息学预测是O-连接的N-乙酰葡糖胺研究中的一项具有挑战性的任务。深度学习算法的最新进展以及O-连接的N-乙酰葡糖胺蛋白质组学数据的可用性为改进O-连接的N-乙酰葡糖胺位点预测提供了机会。

目的

本研究旨在开发一种基于深度学习的工具,以改进O-连接的N-乙酰葡糖胺化位点预测。

方法

我们构建了一个带注释的不平衡O-连接的N-乙酰葡糖胺化数据集,并提出了一种新的深度学习框架DeepO-GlcNAc,它使用长短期记忆(LSTM)、卷积神经网络(CNN)并结合注意力机制。

结果

消融研究证实,DeepO-GlcNAc中的附加模型组件,如注意力机制和LSTM,对提高预测性能有积极贡献。我们的模型在五个跨物种数据集(不包括人类)上表现出很强的稳健性。我们还使用独立数据集将我们的模型与三个外部预测器进行了比较。我们的结果表明,DeepO-GlcNAc优于外部预测器,在ROC分析中准确率达到92%,平均精度为72%,MCC为0.60,AUC为92%。此外,我们已将DeepO-GlcNAc实现为一个网络服务器,以方便科学界进一步研究和使用。

结论

我们的工作证明了利用深度学习进行O-连接的N-乙酰葡糖胺位点预测的可行性,并为O-连接的N-乙酰葡糖胺研究提供了一种新工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/11500328/f585617ff74f/fcell-12-1456728-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/11500328/557b74051dd6/fcell-12-1456728-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/11500328/ce87d2055975/fcell-12-1456728-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/11500328/6d9ada33a437/fcell-12-1456728-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/11500328/27bc69250cbf/fcell-12-1456728-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/11500328/25a9e2313872/fcell-12-1456728-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/11500328/f585617ff74f/fcell-12-1456728-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/11500328/557b74051dd6/fcell-12-1456728-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/11500328/ce87d2055975/fcell-12-1456728-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/11500328/6d9ada33a437/fcell-12-1456728-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/11500328/27bc69250cbf/fcell-12-1456728-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/11500328/25a9e2313872/fcell-12-1456728-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/98d0/11500328/f585617ff74f/fcell-12-1456728-g006.jpg

相似文献

1
DeepO-GlcNAc: a web server for prediction of protein O-GlcNAcylation sites using deep learning combined with attention mechanism.DeepO-GlcNAc:一种利用深度学习结合注意力机制预测蛋白质O-连接N-乙酰葡糖胺化位点的网络服务器。
Front Cell Dev Biol. 2024 Oct 10;12:1456728. doi: 10.3389/fcell.2024.1456728. eCollection 2024.
2
O-GlcNAcPRED-DL: Prediction of Protein O-GlcNAcylation Sites Based on an Ensemble Model of Deep Learning.O-GlcNAcPRED-DL:基于深度学习集成模型的蛋白质 O-GlcNAc 化位点预测。
J Proteome Res. 2024 Jan 5;23(1):95-106. doi: 10.1021/acs.jproteome.3c00458. Epub 2023 Dec 6.
3
Integrating Embeddings from Multiple Protein Language Models to Improve Protein -GlcNAc Site Prediction.整合来自多个蛋白质语言模型的嵌入以提高蛋白质-GlcNAc 位点预测。
Int J Mol Sci. 2023 Nov 6;24(21):16000. doi: 10.3390/ijms242116000.
4
Validation of the reliability of computational O-GlcNAc prediction.计算性O-连接N-乙酰葡糖胺预测可靠性的验证。
Biochim Biophys Acta. 2014 Feb;1844(2):416-21. doi: 10.1016/j.bbapap.2013.12.002. Epub 2013 Dec 9.
5
A two-layered machine learning method to identify protein O-GlcNAcylation sites with O-GlcNAc transferase substrate motifs.一种用于识别具有O-连接N-乙酰葡糖胺转移酶底物基序的蛋白质O-连接N-乙酰葡糖胺化位点的两层机器学习方法。
BMC Bioinformatics. 2015;16 Suppl 18(Suppl 18):S10. doi: 10.1186/1471-2105-16-S18-S10. Epub 2015 Dec 9.
6
-GlcNAcylation Prediction: An Unattained Objective.- N-乙酰葡糖胺化预测:一个未实现的目标。
Adv Appl Bioinform Chem. 2021 Jun 8;14:87-102. doi: 10.2147/AABC.S294867. eCollection 2021.
7
O-GlcNAcylation site mapping by (azide-alkyne) click chemistry and mass spectrometry following intensive fractionation of skeletal muscle cells proteins.通过(叠氮-炔)点击化学和质谱法对骨骼肌细胞蛋白进行强化分级后进行 O-GlcNAc 酰化位点作图。
J Proteomics. 2018 Aug 30;186:83-97. doi: 10.1016/j.jprot.2018.07.005. Epub 2018 Jul 26.
8
Characterization and identification of protein O-GlcNAcylation sites with substrate specificity.具有底物特异性的蛋白质O-连接N-乙酰葡糖胺化位点的表征与鉴定。
BMC Bioinformatics. 2014;15 Suppl 16(Suppl 16):S1. doi: 10.1186/1471-2105-15-S16-S1. Epub 2014 Dec 8.
9
O-GlcNAcAtlas: A database of experimentally identified O-GlcNAc sites and proteins.O-GlcNAcAtlas:一个实验鉴定的 O-GlcNAc 位点和蛋白质数据库。
Glycobiology. 2021 Aug 7;31(7):719-723. doi: 10.1093/glycob/cwab003.
10
Feedback Regulation of -GlcNAc Transferase through Translation Control to Maintain Intracellular -GlcNAc Homeostasis.通过翻译控制反馈调节 N-乙酰氨基葡萄糖转移酶以维持细胞内 N-乙酰氨基葡萄糖稳态。
Int J Mol Sci. 2021 Mar 27;22(7):3463. doi: 10.3390/ijms22073463.

引用本文的文献

1
Application of Pseudoinfectious Viruses in Transient Gene Expression in Mammalian Cells: Combining Efficient Expression with Regulatory Compliance.伪感染性病毒在哺乳动物细胞瞬时基因表达中的应用:将高效表达与法规合规相结合。
Biomolecules. 2025 Feb 13;15(2):274. doi: 10.3390/biom15020274.

本文引用的文献

1
O-GlcNAcPRED-DL: Prediction of Protein O-GlcNAcylation Sites Based on an Ensemble Model of Deep Learning.O-GlcNAcPRED-DL:基于深度学习集成模型的蛋白质 O-GlcNAc 化位点预测。
J Proteome Res. 2024 Jan 5;23(1):95-106. doi: 10.1021/acs.jproteome.3c00458. Epub 2023 Dec 6.
2
Deciphering protein post-translational modifications using chemical biology tools.使用化学生物学工具解析蛋白质翻译后修饰
Nat Rev Chem. 2020 Dec;4(12):674-695. doi: 10.1038/s41570-020-00223-8. Epub 2020 Oct 6.
3
DLF-Sul: a multi-module deep learning framework for prediction of S-sulfinylation sites in proteins.
DLF-Sul:一个用于蛋白质 S-亚磺酰化位点预测的多模块深度学习框架。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac323.
4
Mini-review: Recent advances in post-translational modification site prediction based on deep learning.小型综述:基于深度学习的翻译后修饰位点预测的最新进展
Comput Struct Biotechnol J. 2022 Jun 30;20:3522-3532. doi: 10.1016/j.csbj.2022.06.045. eCollection 2022.
5
Adapt-Kcr: a novel deep learning framework for accurate prediction of lysine crotonylation sites based on learning embedding features and attention architecture.Adapt-Kcr:一种基于学习嵌入特征和注意力架构的新型深度学习框架,用于准确预测赖氨酸巴豆酰化位点。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac037.
6
GPS-Uber: a hybrid-learning framework for prediction of general and E3-specific lysine ubiquitination sites.GPS-Uber:一种用于预测泛素化和 E3 特异性赖氨酸泛素化位点的混合学习框架。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab574.
7
Computational identification of 4-carboxyglutamate sites to supplement physiological studies using deep learning.利用深度学习计算鉴定 4-羧基谷氨酸位点,以补充生理研究。
Sci Rep. 2022 Jan 7;12(1):128. doi: 10.1038/s41598-021-03895-4.
8
O-glycosylation site prediction for by combining properties and sequence features with support vector machine.通过结合性质、序列特征与支持向量机对O-糖基化位点进行预测。 (原句by前少了个词,表述不太完整准确,按现有内容尽量通顺翻译)
J Bioinform Comput Biol. 2022 Feb;20(1):2150029. doi: 10.1142/S0219720021500293. Epub 2021 Nov 19.
9
dbPTM in 2022: an updated database for exploring regulatory networks and functional associations of protein post-translational modifications.dbPTM 在 2022 年:一个更新的数据库,用于探索蛋白质翻译后修饰的调控网络和功能关联。
Nucleic Acids Res. 2022 Jan 7;50(D1):D471-D479. doi: 10.1093/nar/gkab1017.
10
iSUMOK-PseAAC: prediction of lysine sumoylation sites using statistical moments and Chou's PseAAC.iSUMOK-PseAAC:利用统计矩和周氏伪氨基酸组成预测赖氨酸的类泛素化位点
PeerJ. 2021 Aug 4;9:e11581. doi: 10.7717/peerj.11581. eCollection 2021.