• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种从 PubMed 中提取蛋白质翻译后修饰的文本挖掘和机器学习协议:特别关注糖基化、乙酰化、甲基化、羟化和泛素化。

A Text Mining and Machine Learning Protocol for Extracting Posttranslational Modifications of Proteins from PubMed: A Special Focus on Glycosylation, Acetylation, Methylation, Hydroxylation, and Ubiquitination.

机构信息

Department of Management Studies, Coimbatore Institute of Engineering and Technology, Coimbatore, Tamilnadu, India.

Department of Pharmaceutical Analysis, PSG College of Pharmacy, Coimbatore, Tamilnadu, India.

出版信息

Methods Mol Biol. 2022;2496:179-202. doi: 10.1007/978-1-0716-2305-3_10.

DOI:10.1007/978-1-0716-2305-3_10
PMID:35713865
Abstract

Posttranslational modifications (PTMs) of proteins impart a significant role in human cellular functions ranging from localization to signal transduction. Hundreds of PTMs act in a human cell. Among them, only the selected PTMs are well established and documented. PubMed includes thousands of papers on the selected PTMs, and it is a challenge for the biomedical researchers to assimilate useful information manually. Alternatively, text mining approaches and machine learning algorithm automatically extract the relevant information from PubMed. Protein phosphorylation is a well-established PTM and several research works are under way. Many existing systems are there for protein phosphorylation information extraction. A recent approach uses a hybrid approach using text mining and machine learning to extract protein phosphorylation information from PubMed. Some of the other common PTMs that exhibit similar features in terms of entities that are involved in PTM process, that is, the substrate, the enzymes, and the amino acid residues, are glycosylation, acetylation, methylation, hydroxylation, and ubiquitination. This has motivated us to repurpose and extend the text mining protocol and machine learning information extraction methodology developed for protein phosphorylation to these PTMs. In this chapter, the chemistry behind each of the PTMs is briefly outlined and the text mining protocol and machine learning algorithm adaption is explained for the same.

摘要

蛋白质的翻译后修饰(PTMs)在人类细胞功能中起着重要作用,从定位到信号转导。数百种 PTM 在人类细胞中起作用。其中,只有选定的 PTM 得到了很好的确立和记录。PubMed 包含数千篇关于选定 PTM 的论文,生物医学研究人员手动吸收有用信息是一项挑战。或者,文本挖掘方法和机器学习算法可以自动从 PubMed 中提取相关信息。蛋白质磷酸化是一种成熟的 PTM,目前有许多研究工作正在进行。有许多现有的系统用于提取蛋白质磷酸化信息。最近的一种方法使用混合方法,结合文本挖掘和机器学习,从 PubMed 中提取蛋白质磷酸化信息。其他一些常见的 PTM 也具有类似的特征,涉及 PTM 过程中的实体,即底物、酶和氨基酸残基,如糖基化、乙酰化、甲基化、羟化和泛素化。这促使我们重新利用和扩展为蛋白质磷酸化开发的文本挖掘协议和机器学习信息提取方法。在本章中,简要概述了每种 PTM 的化学原理,并解释了相同的文本挖掘协议和机器学习算法适应。

相似文献

1
A Text Mining and Machine Learning Protocol for Extracting Posttranslational Modifications of Proteins from PubMed: A Special Focus on Glycosylation, Acetylation, Methylation, Hydroxylation, and Ubiquitination.一种从 PubMed 中提取蛋白质翻译后修饰的文本挖掘和机器学习协议:特别关注糖基化、乙酰化、甲基化、羟化和泛素化。
Methods Mol Biol. 2022;2496:179-202. doi: 10.1007/978-1-0716-2305-3_10.
2
Identification, Quantification, and Site Localization of Protein Posttranslational Modifications via Mass Spectrometry-Based Proteomics.通过基于质谱的蛋白质组学对蛋白质翻译后修饰进行鉴定、定量及位点定位
Adv Exp Med Biol. 2016;919:345-382. doi: 10.1007/978-3-319-41448-5_17.
3
Text Mining and Machine Learning Protocol for Extracting Human-Related Protein Phosphorylation Information from PubMed.从 PubMed 中提取与人相关的蛋白质磷酸化信息的文本挖掘和机器学习协议。
Methods Mol Biol. 2022;2496:159-177. doi: 10.1007/978-1-0716-2305-3_9.
4
MPTM: A tool for mining protein post-translational modifications from literature.MPTM:一种从文献中挖掘蛋白质翻译后修饰的工具。
J Bioinform Comput Biol. 2017 Oct;15(5):1740005. doi: 10.1142/S0219720017400054. Epub 2017 Sep 11.
5
Research progress in protein posttranslational modification site prediction.蛋白质翻译后修饰位点预测的研究进展
Brief Funct Genomics. 2018 Jul 22;18(4):220-229. doi: 10.1093/bfgp/ely039.
6
Assays for Posttranslational Modifications of Intermediate Filament Proteins.中间丝蛋白翻译后修饰的检测方法。
Methods Enzymol. 2016;568:113-38. doi: 10.1016/bs.mie.2015.09.005. Epub 2015 Nov 6.
7
Protein Modification and Autophagy Activation.蛋白质修饰与自噬激活。
Adv Exp Med Biol. 2019;1206:237-259. doi: 10.1007/978-981-15-0602-4_12.
8
iPTMnet: an integrated resource for protein post-translational modification network discovery.iPTMnet:一个用于蛋白质翻译后修饰网络发现的综合资源。
Nucleic Acids Res. 2018 Jan 4;46(D1):D542-D550. doi: 10.1093/nar/gkx1104.
9
Unconventional posttranslational modification in innate immunity.先天免疫中的非传统翻译后修饰。
Cell Mol Life Sci. 2024 Jul 6;81(1):290. doi: 10.1007/s00018-024-05319-8.
10
Current Technologies Unraveling the Significance of Post-Translational Modifications (PTMs) as Crucial Players in Neurodegeneration.当前技术揭示了翻译后修饰(PTMs)作为神经退行性变关键因素的重要性。
Biomolecules. 2024 Jan 16;14(1):118. doi: 10.3390/biom14010118.

引用本文的文献

1
GlycoSiteMiner: an ML/AI-assisted literature mining-based pipeline for extracting glycosylation sites from PubMed abstracts.糖基位点挖掘工具(GlycoSiteMiner):一种基于机器学习/人工智能辅助文献挖掘的流程,用于从PubMed摘要中提取糖基化位点。
Glycobiology. 2025 Jun 2;35(7). doi: 10.1093/glycob/cwaf030.

本文引用的文献

1
Evaluation of post-translational modifications in histone proteins: A review on histone modification defects in developmental and neurological disorders.组蛋白翻译后修饰的评估:组蛋白修饰缺陷在发育和神经紊乱中的研究进展。
J Biosci. 2020;45.
2
Global view of human protein glycosylation pathways and functions.人类蛋白糖基化途径和功能的全球视图。
Nat Rev Mol Cell Biol. 2020 Dec;21(12):729-749. doi: 10.1038/s41580-020-00294-x. Epub 2020 Oct 21.
3
Protein Ubiquitination Research in Oncology.肿瘤学中的蛋白质泛素化研究
Klin Onkol. 2019 Fall;32(Supplementum 3):56-64. doi: 10.14735/amko20193S.
4
Protein glycosylation.蛋白质糖基化。
Curr Biol. 2019 Apr 1;29(7):R229-R231. doi: 10.1016/j.cub.2019.01.003.
5
iPTMnet: an integrated resource for protein post-translational modification network discovery.iPTMnet:一个用于蛋白质翻译后修饰网络发现的综合资源。
Nucleic Acids Res. 2018 Jan 4;46(D1):D542-D550. doi: 10.1093/nar/gkx1104.
6
Protein Posttranslational Modifications: Roles in Aging and Age-Related Disease.蛋白质翻译后修饰:在衰老和衰老相关疾病中的作用。
Oxid Med Cell Longev. 2017;2017:5716409. doi: 10.1155/2017/5716409. Epub 2017 Aug 15.
7
Metrics for the Human Proteome Project 2016: Progress on Identifying and Characterizing the Human Proteome, Including Post-Translational Modifications.2016年人类蛋白质组计划指标:人类蛋白质组鉴定与表征进展,包括翻译后修饰
J Proteome Res. 2016 Nov 4;15(11):3951-3960. doi: 10.1021/acs.jproteome.6b00511. Epub 2016 Sep 20.
8
RLIMS-P 2.0: A Generalizable Rule-Based Information Extraction System for Literature Mining of Protein Phosphorylation Information.RLIMS-P 2.0:一种用于蛋白质磷酸化信息文献挖掘的可通用的基于规则的信息提取系统。
IEEE/ACM Trans Comput Biol Bioinform. 2015 Jan-Feb;12(1):17-29. doi: 10.1109/TCBB.2014.2372765.
9
A hybrid named entity tagger for tagging human proteins/genes.一种用于标记人类蛋白质/基因的混合命名实体标记器。
Int J Data Min Bioinform. 2014;10(3):315-28. doi: 10.1504/ijdmb.2014.064545.
10
Protein post-translational modifications and regulation of pluripotency in human stem cells.蛋白质翻译后修饰与人类干细胞多能性的调控
Cell Res. 2014 Feb;24(2):143-60. doi: 10.1038/cr.2013.151. Epub 2013 Nov 12.