• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习模型的N-糖基化位点识别分析研究

An analytical study on the identification of N-linked glycosylation sites using machine learning model.

作者信息

Akmal Muhammad Aizaz, Hassan Muhammad Awais, Muhammad Shoaib, Khurshid Khaldoon S, Mohamed Abdullah

机构信息

Department of Computer Science, University of Engineering and Technology, KSK, Lahore, Punjab, Pakistan.

Department of Computer Science, University of Engineering and Technology, Lahore, Punjab, Pakistan.

出版信息

PeerJ Comput Sci. 2022 Sep 21;8:e1069. doi: 10.7717/peerj-cs.1069. eCollection 2022.

DOI:10.7717/peerj-cs.1069
PMID:36262138
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9575850/
Abstract

N-linked is the most common type of glycosylation which plays a significant role in identifying various diseases such as type I diabetes and cancer and helps in drug development. Most of the proteins cannot perform their biological and psychological functionalities without undergoing such modification. Therefore, it is essential to identify such sites by computational techniques because of experimental limitations. This study aims to analyze and synthesize the progress to discover N-linked places using machine learning methods. It also explores the performance of currently available tools to predict such sites. Almost seventy research articles published in recognized journals of the N-linked glycosylation field have shortlisted after the rigorous filtering process. The findings of the studies have been reported based on multiple aspects: publication channel, feature set construction method, training algorithm, and performance evaluation. Moreover, a literature survey has developed a taxonomy of N-linked sequence identification. Our study focuses on the performance evaluation criteria, and the importance of N-linked glycosylation motivates us to discover resources that use computational methods instead of the experimental method due to its limitations.

摘要

N-糖基化是最常见的糖基化类型,在识别多种疾病(如I型糖尿病和癌症)中发挥着重要作用,并有助于药物开发。大多数蛋白质在不进行这种修饰的情况下无法发挥其生物学和生理功能。因此,由于实验限制,通过计算技术识别此类位点至关重要。本研究旨在分析和综合利用机器学习方法发现N-糖基化位点的进展。它还探索了当前可用工具预测此类位点的性能。经过严格筛选过程,在N-糖基化领域的知名期刊上发表的近70篇研究文章被列入候选名单。研究结果已基于多个方面进行报告:出版渠道、特征集构建方法、训练算法和性能评估。此外,一项文献调查制定了N-糖基化序列识别的分类法。我们的研究侧重于性能评估标准,N-糖基化的重要性促使我们发现由于实验方法的局限性而使用计算方法的资源。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/8ceb2b77cb84/peerj-cs-08-1069-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/7363ea4c51af/peerj-cs-08-1069-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/b76e0f562a21/peerj-cs-08-1069-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/54bac6228693/peerj-cs-08-1069-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/2738a06a15f2/peerj-cs-08-1069-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/fd57f385c661/peerj-cs-08-1069-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/9ef7f4ac7a0c/peerj-cs-08-1069-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/95f427c0f5ec/peerj-cs-08-1069-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/8ceb2b77cb84/peerj-cs-08-1069-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/7363ea4c51af/peerj-cs-08-1069-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/b76e0f562a21/peerj-cs-08-1069-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/54bac6228693/peerj-cs-08-1069-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/2738a06a15f2/peerj-cs-08-1069-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/fd57f385c661/peerj-cs-08-1069-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/9ef7f4ac7a0c/peerj-cs-08-1069-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/95f427c0f5ec/peerj-cs-08-1069-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa78/9575850/8ceb2b77cb84/peerj-cs-08-1069-g008.jpg

相似文献

1
An analytical study on the identification of N-linked glycosylation sites using machine learning model.基于机器学习模型的N-糖基化位点识别分析研究
PeerJ Comput Sci. 2022 Sep 21;8:e1069. doi: 10.7717/peerj-cs.1069. eCollection 2022.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
GlycoMine: a machine learning-based approach for predicting N-, C- and O-linked glycosylation in the human proteome.糖基分析软件(GlycoMine):一种基于机器学习的方法,用于预测人类蛋白质组中的 N-、C-和 O-糖基化。
Bioinformatics. 2015 May 1;31(9):1411-9. doi: 10.1093/bioinformatics/btu852. Epub 2015 Jan 6.
4
Computational Prediction of N- and O-Linked Glycosylation Sites for Human and Mouse Proteins.计算预测人类和小鼠蛋白质的 N-和 O-连接糖基化位点。
Methods Mol Biol. 2022;2499:177-186. doi: 10.1007/978-1-0716-2317-6_9.
5
EMNGly: predicting N-linked glycosylation sites using the language models for feature extraction.EMNGly:使用特征提取的语言模型预测 N-连接糖基化位点。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad650.
6
Incorporating a transfer learning technique with amino acid embeddings to efficiently predict N-linked glycosylation sites in ion channels.将迁移学习技术与氨基酸嵌入相结合,以有效预测离子通道中的 N-连接糖基化位点。
Comput Biol Med. 2021 Mar;130:104212. doi: 10.1016/j.compbiomed.2021.104212. Epub 2021 Jan 7.
7
Glycosylation site prediction using ensembles of Support Vector Machine classifiers.使用支持向量机分类器集成进行糖基化位点预测。
BMC Bioinformatics. 2007 Nov 9;8:438. doi: 10.1186/1471-2105-8-438.
8
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测:机器学习在 1 型糖尿病中的应用。
Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.
9
Prediction of N-linked glycosylation sites using position relative features and statistical moments.利用位置相关特征和统计矩预测N-糖基化位点
PLoS One. 2017 Aug 10;12(8):e0181966. doi: 10.1371/journal.pone.0181966. eCollection 2017.
10
HOTGpred: Enhancing human O-linked threonine glycosylation prediction using integrated pretrained protein language model-based features and multi-stage feature selection approach.HOTGpred:利用集成的预训练蛋白质语言模型为基础的特征和多阶段特征选择方法增强人类 O-连接 threonine 糖基化预测。
Comput Biol Med. 2024 Sep;179:108859. doi: 10.1016/j.compbiomed.2024.108859. Epub 2024 Jul 18.

本文引用的文献

1
Growing Glycans in Rosetta: Accurate de novo glycan modeling, density fitting, and rational sequon design.在 Rosetta 中生成聚糖:准确从头聚糖建模、密度拟合和合理的序列设计。
PLoS Comput Biol. 2024 Jun 24;20(6):e1011895. doi: 10.1371/journal.pcbi.1011895. eCollection 2024 Jun.
2
GlyNet: a multi-task neural network for predicting protein-glycan interactions.GlyNet:一种用于预测蛋白质-聚糖相互作用的多任务神经网络。
Chem Sci. 2022 May 16;13(22):6669-6686. doi: 10.1039/d1sc05681f. eCollection 2022 Jun 7.
3
DNAPred_Prot: Identification of DNA-Binding Proteins Using Composition- and Position-Based Features.
DNAPred_Prot:利用基于组成和位置的特征识别DNA结合蛋白。
Appl Bionics Biomech. 2022 Apr 13;2022:5483115. doi: 10.1155/2022/5483115. eCollection 2022.
4
A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities.一种有用的凝集素结合指南:57 种独特凝集素特异性的机器学习定向注释。
ACS Chem Biol. 2022 Nov 18;17(11):2993-3012. doi: 10.1021/acschembio.1c00689. Epub 2022 Jan 27.
5
LectinOracle: A Generalizable Deep Learning Model for Lectin-Glycan Binding Prediction.LectinOracle:一种可推广的用于凝集素-聚糖结合预测的深度学习模型。
Adv Sci (Weinh). 2022 Jan;9(1):e2103807. doi: 10.1002/advs.202103807. Epub 2021 Dec 4.
6
Active site variants in STT3A cause a dominant type I congenital disorder of glycosylation with neuromusculoskeletal findings.STT3A 中的活性位点变体导致具有神经肌肉骨骼表现的显性 I 型先天性糖基化障碍。
Am J Hum Genet. 2021 Nov 4;108(11):2130-2144. doi: 10.1016/j.ajhg.2021.09.012. Epub 2021 Oct 14.
7
iGluK-Deep: computational identification of lysine glutarylation sites using deep neural networks with general pseudo amino acid compositions.iGluK-Deep:利用具有通用伪氨基酸组成的深度神经网络对赖氨酸戊二酰化位点进行计算识别。
J Biomol Struct Dyn. 2022;40(22):11691-11704. doi: 10.1080/07391102.2021.1962738. Epub 2021 Aug 16.
8
Alterations in protein expression and site-specific N-glycosylation of prostate cancer tissues.前列腺癌组织中蛋白质表达和糖基化位点的改变。
Sci Rep. 2021 Aug 5;11(1):15886. doi: 10.1038/s41598-021-95417-5.
9
How can artificial intelligence be used for peptidomics?人工智能如何用于肽组学?
Expert Rev Proteomics. 2021 Jul;18(7):527-556. doi: 10.1080/14789450.2021.1962303. Epub 2021 Aug 23.
10
Glycan-mediated molecular interactions in bacterial pathogenesis.聚糖介导的细菌发病机制中的分子相互作用。
Trends Microbiol. 2022 Mar;30(3):254-267. doi: 10.1016/j.tim.2021.06.011. Epub 2021 Jul 14.