• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

EDGAR:从生物医学文献中提取药物、基因及关系。

EDGAR: extraction of drugs, genes and relations from the biomedical literature.

作者信息

Rindflesch T C, Tanabe L, Weinstein J N, Hunter L

机构信息

Lister Hill Center, National Library of Medicine, Bethesda, MD 20894, USA.

出版信息

Pac Symp Biocomput. 2000:517-28. doi: 10.1142/9789814447331_0049.

DOI:10.1142/9789814447331_0049
PMID:10902199
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2709525/
Abstract

EDGAR (Extraction of Drugs, Genes and Relations) is a natural language processing system that extracts information about drugs and genes relevant to cancer from the biomedical literature. This automatically extracted information has remarkable potential to facilitate computational analysis in the molecular biology of cancer, and the technology is straightforwardly generalizable to many areas of biomedicine. This paper reports on the mechanisms for automatically generating such assertions and on a simple application, conceptual clustering of documents. The system uses a stochastic part of speech tagger, generates an underspecified syntactic parse and then uses semantic and pragmatic information to construct its assertions. The system builds on two important existing resources: the MEDLINE database of biomedical citations and abstracts and the Unified Medical Language System, which provides syntactic and semantic information about the terms found in biomedical abstracts.

摘要

EDGAR(药物、基因及关系提取系统)是一个自然语言处理系统,它从生物医学文献中提取与癌症相关的药物和基因信息。这种自动提取的信息在促进癌症分子生物学的计算分析方面具有显著潜力,并且该技术可直接推广到生物医学的许多领域。本文报告了自动生成此类断言的机制以及一个简单应用,即文档的概念聚类。该系统使用随机词性标注器,生成未完全指定的句法剖析,然后利用语义和语用信息来构建其断言。该系统基于两个重要的现有资源:生物医学文献引用和摘要的MEDLINE数据库以及统一医学语言系统,后者提供有关生物医学摘要中术语的句法和语义信息。

相似文献

1
EDGAR: extraction of drugs, genes and relations from the biomedical literature.EDGAR:从生物医学文献中提取药物、基因及关系。
Pac Symp Biocomput. 2000:517-28. doi: 10.1142/9789814447331_0049.
2
The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text.自然语言处理中领域知识与语言结构的相互作用:解读生物医学文本中的上位命题
J Biomed Inform. 2003 Dec;36(6):462-77. doi: 10.1016/j.jbi.2003.11.003.
3
Effective grading of termhood in biomedical literature.生物医学文献中足月状态的有效分级。
AMIA Annu Symp Proc. 2005;2005:809-13.
4
Terminology-driven mining of biomedical literature.基于术语驱动的生物医学文献挖掘
Bioinformatics. 2003 May 22;19(8):938-43. doi: 10.1093/bioinformatics/btg105.
5
Using SemRep to label semantic relations extracted from clinical text.使用SemRep标记从临床文本中提取的语义关系。
AMIA Annu Symp Proc. 2012;2012:587-95. Epub 2012 Nov 3.
6
Automatic extraction of candidate nomenclature terms using the doublet method.使用双重方法自动提取候选命名术语。
BMC Med Inform Decis Mak. 2005 Oct 18;5:35. doi: 10.1186/1472-6947-5-35.
7
Biomedical question answering using semantic relations.基于语义关系的生物医学问答
BMC Bioinformatics. 2015 Jan 16;16(1):6. doi: 10.1186/s12859-014-0365-3.
8
Comparison of character-level and part of speech features for name recognition in biomedical texts.生物医学文本中用于名称识别的字符级特征与词性特征比较。
J Biomed Inform. 2004 Dec;37(6):423-35. doi: 10.1016/j.jbi.2004.08.008.
9
Extraction of semantic biomedical relations from text using conditional random fields.使用条件随机场从文本中提取语义生物医学关系。
BMC Bioinformatics. 2008 Apr 23;9:207. doi: 10.1186/1471-2105-9-207.
10
Exploiting semantic patterns over biomedical knowledge graphs for predicting treatment and causative relations.利用生物医学知识图谱中的语义模式预测治疗和因果关系。
J Biomed Inform. 2018 Jun;82:189-199. doi: 10.1016/j.jbi.2018.05.003. Epub 2018 May 12.

引用本文的文献

1
Clinical concept annotation with contextual word embedding in active transfer learning environment.主动迁移学习环境下基于上下文词嵌入的临床概念标注
Digit Health. 2024 Dec 19;10:20552076241308987. doi: 10.1177/20552076241308987. eCollection 2024 Jan-Dec.
2
Overview of DrugProt task at BioCreative VII: data and methods for large-scale text mining and knowledge graph generation of heterogenous chemical-protein relations.DrugProt 任务概述在 BioCreative VII 上:大规模文本挖掘和异构化学-蛋白质关系知识图生成的数据和方法。
Database (Oxford). 2023 Nov 28;2023. doi: 10.1093/database/baad080.
3
How Do Your Biomedical Named Entity Recognition Models Generalize to Novel Entities?你的生物医学命名实体识别模型如何推广到新实体?
IEEE Access. 2022 Mar 8;10:31513-31523. doi: 10.1109/ACCESS.2022.3157854. eCollection 2022.
4
Models and Processes to Extract Drug-like Molecules From Natural Language Text.从自然语言文本中提取类药物分子的模型与流程。
Front Mol Biosci. 2021 Aug 30;8:636077. doi: 10.3389/fmolb.2021.636077. eCollection 2021.
5
Extracting Concepts for Precision Oncology from the Biomedical Literature.从生物医学文献中提取精准肿瘤学概念。
AMIA Jt Summits Transl Sci Proc. 2021 May 17;2021:276-285. eCollection 2021.
6
Biomedical document triage using a hierarchical attention-based capsule network.基于层次注意力胶囊网络的生物医学文档分类。
BMC Bioinformatics. 2020 Sep 17;21(Suppl 13):380. doi: 10.1186/s12859-020-03673-5.
7
Chinese Clinical Named Entity Recognition in Electronic Medical Records: Development of a Lattice Long Short-Term Memory Model With Contextualized Character Representations.电子病历中的中文临床命名实体识别:基于上下文特征表示的格长短期记忆模型的开发
JMIR Med Inform. 2020 Sep 4;8(9):e19848. doi: 10.2196/19848.
8
Clinical concept extraction: A methodology review.临床概念提取:方法学综述。
J Biomed Inform. 2020 Sep;109:103526. doi: 10.1016/j.jbi.2020.103526. Epub 2020 Aug 6.
9
AMELIE speeds Mendelian diagnosis by matching patient phenotype and genotype to primary literature.AMELIE 通过将患者的表型和基因型与原始文献相匹配,加速孟德尔遗传病的诊断。
Sci Transl Med. 2020 May 20;12(544). doi: 10.1126/scitranslmed.aau9113.
10
Recent advances in biomedical literature mining.生物医学文献挖掘的最新进展。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa057.

本文引用的文献

1
Developing NLP Tools for Genome Informatics: An Information Extraction Perspective.从信息提取角度开发用于基因组信息学的自然语言处理工具。
Genome Inform Ser Workshop Genome Inform. 1998;9:81-90.
2
Detecting Gene Symbols and Names in Biological Texts: A First Step toward Pertinent Information Extraction.检测生物文本中的基因符号和名称:迈向相关信息提取的第一步。
Genome Inform Ser Workshop Genome Inform. 1998;9:72-80.
3
Constructing biological knowledge bases by extracting information from text sources.通过从文本来源中提取信息来构建生物知识库。
Proc Int Conf Intell Syst Mol Biol. 1999:77-86.
4
Automatic extraction of biological information from scientific text: protein-protein interactions.从科学文本中自动提取生物信息:蛋白质-蛋白质相互作用
Proc Int Conf Intell Syst Mol Biol. 1999:60-7.
5
A gene expression database for the molecular pharmacology of cancer.一个用于癌症分子药理学的基因表达数据库。
Nat Genet. 2000 Mar;24(3):236-44. doi: 10.1038/73439.
6
MedMiner: an Internet text-mining tool for biomedical information, with application to gene expression profiling.MedMiner:一种用于生物医学信息的互联网文本挖掘工具,应用于基因表达谱分析。
Biotechniques. 1999 Dec;27(6):1210-4, 1216-7. doi: 10.2144/99276bc03.
7
Mining molecular binding terminology from biomedical text.从生物医学文本中挖掘分子结合术语。
Proc AMIA Symp. 1999:127-31.
8
An ontology for bioinformatics applications.一种用于生物信息学应用的本体。
Bioinformatics. 1999 Jun;15(6):510-20. doi: 10.1093/bioinformatics/15.6.510.
9
Automatic extraction of keywords from scientific text: application to the knowledge domain of protein families.从科学文本中自动提取关键词:在蛋白质家族知识领域的应用。
Bioinformatics. 1998;14(7):600-7. doi: 10.1093/bioinformatics/14.7.600.
10
Toward information extraction: identifying protein names from biological papers.迈向信息提取:从生物学论文中识别蛋白质名称。
Pac Symp Biocomput. 1998:707-18.