• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从生物医学文本中提取蛋白质相互作用关系的概念识别

Concept recognition for extracting protein interaction relations from biomedical text.

作者信息

Baumgartner William A, Lu Zhiyong, Johnson Helen L, Caporaso J Gregory, Paquette Jesse, Lindemann Anna, White Elizabeth K, Medvedeva Olga, Cohen K Bretonnel, Hunter Lawrence

机构信息

Center for Computational Pharmacology, University of Colorado School of Medicine, Aurora, Colorado 80045, USA.

出版信息

Genome Biol. 2008;9 Suppl 2(Suppl 2):S9. doi: 10.1186/gb-2008-9-s2-s9. Epub 2008 Sep 1.

DOI:10.1186/gb-2008-9-s2-s9
PMID:18834500
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2559993/
Abstract

BACKGROUND

Reliable information extraction applications have been a long sought goal of the biomedical text mining community, a goal that if reached would provide valuable tools to benchside biologists in their increasingly difficult task of assimilating the knowledge contained in the biomedical literature. We present an integrated approach to concept recognition in biomedical text. Concept recognition provides key information that has been largely missing from previous biomedical information extraction efforts, namely direct links to well defined knowledge resources that explicitly cement the concept's semantics. The BioCreative II tasks discussed in this special issue have provided a unique opportunity to demonstrate the effectiveness of concept recognition in the field of biomedical language processing.

RESULTS

Through the modular construction of a protein interaction relation extraction system, we present several use cases of concept recognition in biomedical text, and relate these use cases to potential uses by the benchside biologist.

CONCLUSION

Current information extraction technologies are approaching performance standards at which concept recognition can begin to deliver high quality data to the benchside biologist. Our system is available as part of the BioCreative Meta-Server project and on the internet http://bionlp.sourceforge.net.

摘要

背景

可靠的信息提取应用一直是生物医学文本挖掘领域长期追求的目标,这一目标若能实现,将为实验台旁的生物学家提供有价值的工具,帮助他们完成吸收生物医学文献中知识这一日益艰巨的任务。我们提出了一种用于生物医学文本中概念识别的综合方法。概念识别提供了关键信息,而这些信息在以往的生物医学信息提取工作中大多缺失,即与明确巩固概念语义的定义良好的知识资源的直接链接。本期特刊中讨论的生物创意II任务提供了一个独特的机会,来证明概念识别在生物医学语言处理领域的有效性。

结果

通过蛋白质相互作用关系提取系统的模块化构建,我们展示了生物医学文本中概念识别的几个用例,并将这些用例与实验台旁生物学家的潜在用途联系起来。

结论

当前的信息提取技术正在接近性能标准,在这个标准下概念识别能够开始为实验台旁的生物学家提供高质量的数据。我们的系统作为生物创意元服务器项目的一部分可在互联网http://bionlp.sourceforge.net上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de4b/2559993/f90cf38f2855/gb-2008-9-s2-s9-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de4b/2559993/f90cf38f2855/gb-2008-9-s2-s9-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de4b/2559993/f90cf38f2855/gb-2008-9-s2-s9-1.jpg

相似文献

1
Concept recognition for extracting protein interaction relations from biomedical text.从生物医学文本中提取蛋白质相互作用关系的概念识别
Genome Biol. 2008;9 Suppl 2(Suppl 2):S9. doi: 10.1186/gb-2008-9-s2-s9. Epub 2008 Sep 1.
2
Evaluation of text-mining systems for biology: overview of the Second BioCreative community challenge.生物学文本挖掘系统评估:第二届生物创意社区挑战赛概述
Genome Biol. 2008;9 Suppl 2(Suppl 2):S1. doi: 10.1186/gb-2008-9-s2-s1. Epub 2008 Sep 1.
3
Linking genes to literature: text mining, information extraction, and retrieval applications for biology.将基因与文献相联系:生物学的文本挖掘、信息提取及检索应用
Genome Biol. 2008;9 Suppl 2(Suppl 2):S8. doi: 10.1186/gb-2008-9-s2-s8. Epub 2008 Sep 1.
4
Status of text-mining techniques applied to biomedical text.应用于生物医学文本的文本挖掘技术现状。
Drug Discov Today. 2006 Apr;11(7-8):315-25. doi: 10.1016/j.drudis.2006.02.011.
5
An evaluation of GO annotation retrieval for BioCreAtIvE and GOA.对生物创意(BioCreAtIvE)和基因本体注释(GOA)的基因本体(GO)注释检索的评估。
BMC Bioinformatics. 2005;6 Suppl 1(Suppl 1):S17. doi: 10.1186/1471-2105-6-S1-S17. Epub 2005 May 24.
6
Extracting protein interactions from text with the unified AkaneRE event extraction system.利用统一的 AkaneRE 事件抽取系统从文本中抽取蛋白质相互作用。
IEEE/ACM Trans Comput Biol Bioinform. 2010 Jul-Sep;7(3):442-53. doi: 10.1109/TCBB.2010.46.
7
Gene mention normalization and interaction extraction with context models and sentence motifs.基于上下文模型和句子模式的基因提及规范化与相互作用提取
Genome Biol. 2008;9 Suppl 2(Suppl 2):S14. doi: 10.1186/gb-2008-9-s2-s14. Epub 2008 Sep 1.
8
OpenDMAP: an open source, ontology-driven concept analysis engine, with applications to capturing knowledge regarding protein transport, protein interactions and cell-type-specific gene expression.OpenDMAP:一个开源的、由本体驱动的概念分析引擎,应用于捕获有关蛋白质转运、蛋白质相互作用和细胞类型特异性基因表达的知识。
BMC Bioinformatics. 2008 Jan 31;9:78. doi: 10.1186/1471-2105-9-78.
9
Empirical investigations into full-text protein interaction Article Categorization Task (ACT) in the BioCreative II.5 Challenge.生物创意 II.5 挑战赛中全文蛋白质相互作用文章分类任务(ACT)的实证研究。
IEEE/ACM Trans Comput Biol Bioinform. 2010 Jul-Sep;7(3):421-7. doi: 10.1109/TCBB.2010.49.
10
Automatic extraction of acronym-meaning pairs from MEDLINE databases.从医学文献数据库中自动提取首字母缩略词及其含义对。
Stud Health Technol Inform. 2001;84(Pt 1):371-5.

引用本文的文献

1
Assessment of software testing and quality assurance in natural language processing applications and a linguistically inspired approach to improving it.自然语言处理应用中的软件测试与质量保证评估以及一种受语言启发的改进方法。
Trust Eternal Syst Via Evol Softw Data Knowl (2012). 2013;379:77-90. doi: 10.1007/978-3-642-45260-4_6.
2
PhenoTagger: a hybrid method for phenotype concept recognition using human phenotype ontology.PhenoTagger:一种使用人类表型本体进行表型概念识别的混合方法。
Bioinformatics. 2021 Jul 27;37(13):1884-1890. doi: 10.1093/bioinformatics/btab019.
3
Mapping Phenotypic Information in Heterogeneous Textual Sources to a Domain-Specific Terminological Resource.

本文引用的文献

1
MINT and IntAct contribute to the Second BioCreative challenge: serving the text-mining community with high quality molecular interaction data.MINT和IntAct助力第二届生物创意挑战赛:为文本挖掘社区提供高质量分子相互作用数据。
Genome Biol. 2008;9 Suppl 2(Suppl 2):S5. doi: 10.1186/gb-2008-9-s2-s5. Epub 2008 Sep 1.
2
Overview of the protein-protein interaction annotation extraction task of BioCreative II.生物创意II蛋白质-蛋白质相互作用注释提取任务概述。
Genome Biol. 2008;9 Suppl 2(Suppl 2):S4. doi: 10.1186/gb-2008-9-s2-s4. Epub 2008 Sep 1.
3
Overview of BioCreative II gene normalization.
将异构文本源中的表型信息映射到特定领域的术语资源。
PLoS One. 2016 Sep 19;11(9):e0162287. doi: 10.1371/journal.pone.0162287. eCollection 2016.
4
Methodological Issues in Predicting Pediatric Epilepsy Surgery Candidates Through Natural Language Processing and Machine Learning.通过自然语言处理和机器学习预测小儿癫痫手术候选者的方法学问题
Biomed Inform Insights. 2016 May 22;8:11-8. doi: 10.4137/BII.S38308. eCollection 2016.
5
GNormPlus: An Integrative Approach for Tagging Genes, Gene Families, and Protein Domains.GNormPlus:一种用于标记基因、基因家族和蛋白质结构域的综合方法。
Biomed Res Int. 2015;2015:918710. doi: 10.1155/2015/918710. Epub 2015 Aug 25.
6
Automatically Detecting Failures in Natural Language Processing Tools for Online Community Text.自动检测在线社区文本自然语言处理工具中的故障。
J Med Internet Res. 2015 Aug 31;17(8):e212. doi: 10.2196/jmir.4612.
7
SimConcept: A Hybrid Approach for Simplifying Composite Named Entities in Biomedicine.SimConcept:一种简化生物医学中复合命名实体的混合方法。
ACM BCB. 2014;2014:138-146. doi: 10.1145/2649387.2649420.
8
BC4GO: a full-text corpus for the BioCreative IV GO task.BC4GO:用于生物创意IV基因本体任务的全文语料库。
Database (Oxford). 2014 Jul 28;2014. doi: 10.1093/database/bau074. Print 2014.
9
Chapter 16: text mining for translational bioinformatics.第十六章:转化生物信息学中的文本挖掘。
PLoS Comput Biol. 2013 Apr;9(4):e1003044. doi: 10.1371/journal.pcbi.1003044. Epub 2013 Apr 25.
10
Using rule-based natural language processing to improve disease normalization in biomedical text.基于规则的自然语言处理在生物医学文本疾病标准化中的应用。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):876-81. doi: 10.1136/amiajnl-2012-001173. Epub 2012 Oct 6.
生物创意II基因标准化概述。
Genome Biol. 2008;9 Suppl 2(Suppl 2):S3. doi: 10.1186/gb-2008-9-s2-s3. Epub 2008 Sep 1.
4
Overview of BioCreative II gene mention recognition.生物创意II基因提及识别概述。
Genome Biol. 2008;9 Suppl 2(Suppl 2):S2. doi: 10.1186/gb-2008-9-s2-s2. Epub 2008 Sep 1.
5
OpenDMAP: an open source, ontology-driven concept analysis engine, with applications to capturing knowledge regarding protein transport, protein interactions and cell-type-specific gene expression.OpenDMAP:一个开源的、由本体驱动的概念分析引擎,应用于捕获有关蛋白质转运、蛋白质相互作用和细胞类型特异性基因表达的知识。
BMC Bioinformatics. 2008 Jan 31;9:78. doi: 10.1186/1471-2105-9-78.
6
Corpus refactoring: a feasibility study.语料库重构:一项可行性研究。
J Biomed Discov Collab. 2007 Sep 13;2:4. doi: 10.1186/1747-5333-2-4.
7
Entrez Gene: gene-centered information at NCBI.Entrez基因:美国国立医学图书馆国家生物技术信息中心的基因中心信息。
Nucleic Acids Res. 2007 Jan;35(Database issue):D26-31. doi: 10.1093/nar/gkl993. Epub 2006 Dec 5.
8
Finding GeneRIFs via gene ontology annotations.通过基因本体注释查找基因相关功能信息(GeneRIFs)
Pac Symp Biocomput. 2006:52-63.
9
Enhancing access to the Bibliome: the TREC 2004 Genomics Track.增强对生物信息库的访问:2004年TREC基因组学专题。
J Biomed Discov Collab. 2006 Mar 13;1:3. doi: 10.1186/1747-5333-1-3.
10
BioCreAtIvE task1A: entity identification with a stochastic tagger.生物创意任务1A:使用随机标记器进行实体识别。
BMC Bioinformatics. 2005;6 Suppl 1(Suppl 1):S4. doi: 10.1186/1471-2105-6-S1-S4. Epub 2005 May 24.