使用基于规则的系统对蛋白质磷酸化进行文献挖掘和数据库注释。

Literature mining and database annotation of protein phosphorylation using a rule-based system.

作者信息

Hu Z Z, Narayanaswamy M, Ravikumar K E, Vijay-Shanker K, Wu C H

机构信息

Department of Biochemistry and Molecular Biology, Georgetown University Medical Center, Washington, DC 20057, USA.

出版信息

Bioinformatics. 2005 Jun 1;21(11):2759-65. doi: 10.1093/bioinformatics/bti390. Epub 2005 Apr 6.

DOI:10.1093/bioinformatics/bti390

PMID:15814565

Abstract

MOTIVATION

A large volume of experimental data on protein phosphorylation is buried in the fast-growing PubMed literature. While of great value, such information is limited in databases owing to the laborious process of literature-based curation. Computational literature mining holds promise to facilitate database curation.

RESULTS

A rule-based system, RLIMS-P (Rule-based LIterature Mining System for Protein Phosphorylation), was used to extract protein phosphorylation information from MEDLINE abstracts. An annotation-tagged literature corpus developed at PIR was used to evaluate the system for finding phosphorylation papers and extracting phosphorylation objects (kinases, substrates and sites) from abstracts. RLIMS-P achieved a precision and recall of 91.4 and 96.4% for paper retrieval, and of 97.9 and 88.0% for extraction of substrates and sites. Coupling the high recall for paper retrieval and high precision for information extraction, RLIMS-P facilitates literature mining and database annotation of protein phosphorylation.

摘要

动机

大量关于蛋白质磷酸化的实验数据隐藏在快速增长的PubMed文献中。尽管这些信息很有价值，但由于基于文献的编目过程繁琐，此类信息在数据库中有限。计算文献挖掘有望促进数据库编目。

结果

一个基于规则的系统RLIMS-P（用于蛋白质磷酸化的基于规则的文献挖掘系统）被用于从MEDLINE摘要中提取蛋白质磷酸化信息。在PIR开发的一个带有注释标签的文献语料库被用于评估该系统查找磷酸化相关论文以及从摘要中提取磷酸化对象（激酶、底物和位点）的能力。RLIMS-P在论文检索方面的精确率和召回率分别为91.4%和96.4%，在底物和位点提取方面的精确率和召回率分别为97.9%和88.0%。结合论文检索的高召回率和信息提取的高精度，RLIMS-P有助于蛋白质磷酸化的文献挖掘和数据库注释。

相似文献

Literature mining and database annotation of protein phosphorylation using a rule-based system.使用基于规则的系统对蛋白质磷酸化进行文献挖掘和数据库注释。

Bioinformatics. 2005 Jun 1;21(11):2759-65. doi: 10.1093/bioinformatics/bti390. Epub 2005 Apr 6.

Extracting human protein interactions from MEDLINE using a full-sentence parser.使用全句解析器从MEDLINE中提取人类蛋白质相互作用。

Bioinformatics. 2004 Mar 22;20(5):604-11. doi: 10.1093/bioinformatics/btg452. Epub 2004 Jan 22.

Protein annotation by EBIMed.通过EBIMed进行蛋白质注释。

Nat Biotechnol. 2006 Aug;24(8):902-3. doi: 10.1038/nbt0806-902.

Recognizing names in biomedical texts: a machine learning approach.识别生物医学文本中的名称：一种机器学习方法。

Bioinformatics. 2004 May 1;20(7):1178-90. doi: 10.1093/bioinformatics/bth060. Epub 2004 Feb 10.

Corpus annotation for mining biomedical events from literature.用于从文献中挖掘生物医学事件的语料库标注。

BMC Bioinformatics. 2008 Jan 8;9:10. doi: 10.1186/1471-2105-9-10.

Discovering patterns to extract protein-protein interactions from the literature: Part II.从文献中发现用于提取蛋白质-蛋白质相互作用的模式：第二部分。

Bioinformatics. 2005 Aug 1;21(15):3294-300. doi: 10.1093/bioinformatics/bti493. Epub 2005 May 12.

Bioinformatics. 2006 Sep 15;22(18):2298-304. doi: 10.1093/bioinformatics/btl388. Epub 2006 Aug 22.

A multi-level text mining method to extract biological relationships.一种用于提取生物关系的多层次文本挖掘方法。

Proc IEEE Comput Soc Bioinform Conf. 2002;1:97-108.

Automatic recognition of topic-classified relations between prostate cancer and genes using MEDLINE abstracts.利用医学在线摘要自动识别前列腺癌与基因之间的主题分类关系。

BMC Bioinformatics. 2006 Nov 24;7 Suppl 3(Suppl 3):S4. doi: 10.1186/1471-2105-7-S3-S4.

Automatic extraction of gene/protein biological functions from biomedical text.从生物医学文本中自动提取基因/蛋白质的生物学功能。

Bioinformatics. 2005 Apr 1;21(7):1227-36. doi: 10.1093/bioinformatics/bti084. Epub 2004 Oct 27.

引用本文的文献

Integrating Multi-Omics Data to Construct Reliable Interconnected Models of Signaling, Gene Regulatory, and Metabolic Pathways.整合多组学数据以构建可靠的信号传导、基因调控和代谢途径相互连接模型。

Methods Mol Biol. 2023;2634:139-151. doi: 10.1007/978-1-0716-3008-2_6.

Text Mining and Machine Learning Protocol for Extracting Human-Related Protein Phosphorylation Information from PubMed.从 PubMed 中提取与人相关的蛋白质磷酸化信息的文本挖掘和机器学习协议。

Methods Mol Biol. 2022;2496:159-177. doi: 10.1007/978-1-0716-2305-3_9.

Humans and machines in biomedical knowledge curation: hypertrophic cardiomyopathy molecular mechanisms' representation.生物医学知识编目中的人与机器：肥厚型心肌病分子机制的呈现

BioData Min. 2021 Oct 2;14(1):45. doi: 10.1186/s13040-021-00279-2.

Utilizing image and caption information for biomedical document classification.利用图像和标题信息进行生物医学文献分类。

Bioinformatics. 2021 Jul 12;37(Suppl_1):i468-i476. doi: 10.1093/bioinformatics/btab331.

ANDDigest: a new web-based module of ANDSystem for the search of knowledge in the scientific literature.ANDDigest：ANDSystem 的一个新的基于网络的模块，用于在科学文献中搜索知识。

BMC Bioinformatics. 2020 Sep 14;21(Suppl 11):228. doi: 10.1186/s12859-020-03557-8.

In silico insights on diverse interacting partners and phosphorylation sites of respiratory burst oxidase homolog (Rbohs) gene families from Arabidopsis and rice.基于计算机的拟南芥和水稻呼吸爆发氧化酶同源基因家族不同相互作用伙伴和磷酸化位点的研究进展。

BMC Plant Biol. 2018 Aug 10;18(1):161. doi: 10.1186/s12870-018-1378-2.

Biomolecular Relationships Discovered from Biological Labyrinth and Lost in Ocean of Literature: Community Efforts Can Rescue Until Automated Artificial Intelligence Takes Over.从生物迷宫中发现并迷失在文献海洋中的生物分子关系：在自动化人工智能接管之前，社区的努力可以挽救局面。

Front Genet. 2016 Mar 31;7:46. doi: 10.3389/fgene.2016.00046. eCollection 2016.

RLIMS-P 2.0: A Generalizable Rule-Based Information Extraction System for Literature Mining of Protein Phosphorylation Information.RLIMS-P 2.0：一种用于蛋白质磷酸化信息文献挖掘的可通用的基于规则的信息提取系统。

IEEE/ACM Trans Comput Biol Bioinform. 2015 Jan-Feb;12(1):17-29. doi: 10.1109/TCBB.2014.2372765.

PALM-IST: Pathway Assembly from Literature Mining--an Information Search Tool.PALM-IST：基于文献挖掘的通路组装——一种信息搜索工具。

Sci Rep. 2015 May 19;5:10021. doi: 10.1038/srep10021.

A generalizable NLP framework for fast development of pattern-based biomedical relation extraction systems.一种可推广的基于 NLP 的生物医学关系抽取系统的模式快速开发框架。

BMC Bioinformatics. 2014 Aug 23;15(1):285. doi: 10.1186/1471-2105-15-285.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用基于规则的系统对蛋白质磷酸化进行文献挖掘和数据库注释。

Literature mining and database annotation of protein phosphorylation using a rule-based system.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

动机

结果

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献