• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

欧盟不良反应数据库:标注药物、疾病、靶点及其相互关系。

The EU-ADR corpus: annotated drugs, diseases, targets, and their relationships.

机构信息

Dept. of Medical Informatics, Erasmus University Medical Center, Rotterdam, The Netherlands.

出版信息

J Biomed Inform. 2012 Oct;45(5):879-84. doi: 10.1016/j.jbi.2012.04.004. Epub 2012 Apr 25.

DOI:10.1016/j.jbi.2012.04.004
PMID:22554700
Abstract

Corpora with specific entities and relationships annotated are essential to train and evaluate text-mining systems that are developed to extract specific structured information from a large corpus. In this paper we describe an approach where a named-entity recognition system produces a first annotation and annotators revise this annotation using a web-based interface. The agreement figures achieved show that the inter-annotator agreement is much better than the agreement with the system provided annotations. The corpus has been annotated for drugs, disorders, genes and their inter-relationships. For each of the drug-disorder, drug-target, and target-disorder relations three experts have annotated a set of 100 abstracts. These annotated relationships will be used to train and evaluate text-mining software to capture these relationships in texts.

摘要

具有特定实体和关系注释的语料库对于训练和评估从大型语料库中提取特定结构化信息的文本挖掘系统至关重要。在本文中,我们描述了一种方法,其中命名实体识别系统生成第一个注释,注释者使用基于 Web 的界面来修改此注释。所达到的一致性数据表明,注释者之间的一致性远高于与系统提供的注释的一致性。该语料库已针对药物、疾病、基因及其相互关系进行了注释。对于药物-疾病、药物-靶标和靶标-疾病关系中的每一种,三位专家都对一组 100 篇摘要进行了注释。这些已注释的关系将用于培训和评估文本挖掘软件,以捕获文本中的这些关系。

相似文献

1
The EU-ADR corpus: annotated drugs, diseases, targets, and their relationships.欧盟不良反应数据库:标注药物、疾病、靶点及其相互关系。
J Biomed Inform. 2012 Oct;45(5):879-84. doi: 10.1016/j.jbi.2012.04.004. Epub 2012 Apr 25.
2
On the creation of a clinical gold standard corpus in Spanish: Mining adverse drug reactions.关于创建西班牙语临床金标准语料库:挖掘药物不良反应
J Biomed Inform. 2015 Aug;56:318-32. doi: 10.1016/j.jbi.2015.06.016. Epub 2015 Jun 30.
3
Automatic recognition of disorders, findings, pharmaceuticals and body structures from clinical text: an annotation and machine learning study.从临床文本中自动识别疾病、检查结果、药物和身体结构:一项注释与机器学习研究。
J Biomed Inform. 2014 Jun;49:148-58. doi: 10.1016/j.jbi.2014.01.012. Epub 2014 Feb 4.
4
BioCreative V CDR task corpus: a resource for chemical disease relation extraction.生物创意V化学疾病关系提取任务语料库:化学疾病关系提取的资源。
Database (Oxford). 2016 May 9;2016. doi: 10.1093/database/baw068. Print 2016.
5
The EU-ADR Web Platform: delivering advanced pharmacovigilance tools.欧盟药物警戒电子报告平台:提供先进的药物警戒工具。
Pharmacoepidemiol Drug Saf. 2013 May;22(5):459-67. doi: 10.1002/pds.3375. Epub 2012 Dec 4.
6
Collaborative text-annotation resource for disease-centered relation extraction from biomedical text.面向生物医学文本中以疾病为中心的关系提取的协作文本标注资源。
J Biomed Inform. 2009 Oct;42(5):967-77. doi: 10.1016/j.jbi.2009.02.001. Epub 2009 Feb 14.
7
Feature selection techniques for maximum entropy based biomedical named entity recognition.基于最大熵的生物医学命名实体识别的特征选择技术。
J Biomed Inform. 2009 Oct;42(5):905-11. doi: 10.1016/j.jbi.2008.12.012. Epub 2009 Jan 23.
8
Portable automatic text classification for adverse drug reaction detection via multi-corpus training.通过多语料库训练实现用于药物不良反应检测的便携式自动文本分类
J Biomed Inform. 2015 Feb;53:196-207. doi: 10.1016/j.jbi.2014.11.002. Epub 2014 Nov 8.
9
The DDI corpus: an annotated corpus with pharmacological substances and drug-drug interactions.DDI 语料库:一个带有药理学物质和药物相互作用注释的语料库。
J Biomed Inform. 2013 Oct;46(5):914-20. doi: 10.1016/j.jbi.2013.07.011. Epub 2013 Jul 29.
10
Building a comprehensive syntactic and semantic corpus of Chinese clinical texts.构建中文临床文本的综合句法和语义语料库。
J Biomed Inform. 2017 May;69:203-217. doi: 10.1016/j.jbi.2017.04.006. Epub 2017 Apr 9.

引用本文的文献

1
Hybrid deep learning models for text-based identification of gene-disease associations.用于基于文本识别基因-疾病关联的混合深度学习模型。
Bioimpacts. 2025 Jun 28;15:31226. doi: 10.34172/bi.31226. eCollection 2025.
2
Dataset of miRNA-disease relations extracted from textual data using transformer-based neural networks.基于转换器的神经网络从文本数据中提取的 miRNA-疾病关系数据集。
Database (Oxford). 2024 Aug 5;2024. doi: 10.1093/database/baae066.
3
Large Language Models and Genomics for Summarizing the Role of microRNA in Regulating mRNA Expression.
用于总结微小RNA在调节mRNA表达中作用的大型语言模型与基因组学
Biomedicines. 2024 Jul 10;12(7):1535. doi: 10.3390/biomedicines12071535.
4
Transformers and large language models in healthcare: A review.医疗保健中的变压器和大型语言模型:综述。
Artif Intell Med. 2024 Aug;154:102900. doi: 10.1016/j.artmed.2024.102900. Epub 2024 Jun 5.
5
A Study of Biomedical Relation Extraction Using GPT Models.一项使用GPT模型进行生物医学关系提取的研究。
AMIA Jt Summits Transl Sci Proc. 2024 May 31;2024:391-400. eCollection 2024.
6
An in-depth evaluation of federated learning on biomedical natural language processing for information extraction.关于用于信息提取的生物医学自然语言处理中联邦学习的深入评估。
NPJ Digit Med. 2024 May 15;7(1):127. doi: 10.1038/s41746-024-01126-4.
7
Surveying biomedical relation extraction: a critical examination of current datasets and the proposal of a new resource.调查生物医学关系抽取:对当前数据集的批判性考察及新资源的提出。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae132.
8
MKG-GC: A multi-task learning-based knowledge graph construction framework with personalized application to gastric cancer.MKG-GC:一种基于多任务学习的知识图谱构建框架及其在胃癌中的个性化应用
Comput Struct Biotechnol J. 2024 Mar 27;23:1339-1347. doi: 10.1016/j.csbj.2024.03.021. eCollection 2024 Dec.
9
MetaTron: advancing biomedical annotation empowering relation annotation and collaboration.MetaTron:推进生物医学标注,赋能关系标注与协作。
BMC Bioinformatics. 2024 Mar 14;25(1):112. doi: 10.1186/s12859-024-05730-9.
10
BioEGRE: a linguistic topology enhanced method for biomedical relation extraction based on BioELECTRA and graph pointer neural network.BioEGRE:一种基于 BioELECTRA 和图指针神经网络的生物医学关系抽取的语言拓扑增强方法。
BMC Bioinformatics. 2023 Dec 19;24(1):486. doi: 10.1186/s12859-023-05601-9.