• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RysannMD:一款兼顾速度与准确性的生物医学语义注释工具。

RysannMD: A biomedical semantic annotator balancing speed and accuracy.

作者信息

Cuzzola John, Jovanović Jelena, Bagheri Ebrahim

机构信息

Laboratory for Systems, Software and Semantics (LS3), Ryerson University, Ontario, Canada(1).

Faculty of Organizational Sciences (FOS), University of Belgrade, Belgrade, Serbia(2).

出版信息

J Biomed Inform. 2017 Jul;71:91-109. doi: 10.1016/j.jbi.2017.05.016. Epub 2017 May 26.

DOI:10.1016/j.jbi.2017.05.016
PMID:28552401
Abstract

Recently, both researchers and practitioners have explored the possibility of semantically annotating large and continuously evolving collections of biomedical texts such as research papers, medical reports, and physician notes in order to enable their efficient and effective management and use in clinical practice or research laboratories. Such annotations can be automatically generated by biomedical semantic annotators - tools that are specifically designed for detecting and disambiguating biomedical concepts mentioned in text. The biomedical community has already presented several solid automated semantic annotators. However, the existing tools are either strong in their disambiguation capacity, i.e., the ability to identify the correct biomedical concept for a given piece of text among several candidate concepts, or they excel in their processing time, i.e., work very efficiently, but none of the semantic annotation tools reported in the literature has both of these qualities. In this paper, we present RysannMD (Ryerson Semantic Annotator for Medical Domain), a biomedical semantic annotation tool that strikes a balance between processing time and performance while disambiguating biomedical terms. In other words, RysannMD provides reasonable disambiguation performance when choosing the right sense for a biomedical term in a given context, and does that in a reasonable time. To examine how RysannMD stands with respect to the state of the art biomedical semantic annotators, we have conducted a series of experiments using standard benchmarking corpora, including both gold and silver standards, and four modern biomedical semantic annotators, namely cTAKES, MetaMap, NOBLE Coder, and Neji. The annotators were compared with respect to the quality of the produced annotations measured against gold and silver standards using precision, recall, and F measure and speed, i.e., processing time. In the experiments, RysannMD achieved the best median F measure across the benchmarking corpora, independent of the standard used (silver/gold), biomedical subdomain, and document size. In terms of the annotation speed, RysannMD scored the second best median processing time across all the experiments. The obtained results indicate that RysannMD offers the best performance among the examined semantic annotators when both quality of annotation and speed are considered simultaneously.

摘要

最近,研究人员和从业者都在探索对大量且不断发展的生物医学文本集合(如研究论文、医学报告和医生笔记)进行语义标注的可能性,以便在临床实践或研究实验室中对其进行高效管理和利用。此类标注可由生物医学语义标注工具自动生成,这些工具是专门设计用于检测和消除文本中提及的生物医学概念的歧义的。生物医学领域已经出现了几种可靠的自动语义标注工具。然而,现有工具要么在消除歧义能力方面表现出色,即能够在几个候选概念中为给定文本识别出正确的生物医学概念,要么在处理时间方面表现出色,即工作效率非常高,但文献中报道的语义标注工具都不具备这两种特性。在本文中,我们介绍了RysannMD(瑞尔森医学领域语义标注器),这是一种生物医学语义标注工具,在消除生物医学术语歧义的同时,在处理时间和性能之间取得了平衡。换句话说,RysannMD在给定上下文中为生物医学术语选择正确语义时提供了合理的消除歧义性能,并且能在合理的时间内完成。为了研究RysannMD相对于现有生物医学语义标注器的表现,我们使用标准基准语料库进行了一系列实验,包括金标准和银标准,以及四个现代生物医学语义标注器,即cTAKES、MetaMap、NOBLE Coder和Neji。这些标注器在根据金标准和银标准使用精确率、召回率和F值以及速度(即处理时间)来衡量所生成标注的质量方面进行了比较。在实验中,无论使用何种标准(银标准/金标准)、生物医学子领域和文档大小,RysannMD在基准语料库中均取得了最佳的中位数F值。在标注速度方面,RysannMD在所有实验中的中位数处理时间排名第二。所得结果表明,在同时考虑标注质量和速度时,RysannMD在所研究的语义标注器中表现最佳。

相似文献

1
RysannMD: A biomedical semantic annotator balancing speed and accuracy.RysannMD:一款兼顾速度与准确性的生物医学语义注释工具。
J Biomed Inform. 2017 Jul;71:91-109. doi: 10.1016/j.jbi.2017.05.016. Epub 2017 May 26.
2
Semantic annotation in biomedicine: the current landscape.生物医学中的语义标注:现状
J Biomed Semantics. 2017 Sep 22;8(1):44. doi: 10.1186/s13326-017-0153-x.
3
SIFR annotator: ontology-based semantic annotation of French biomedical text and clinical notes.SIFR 标注器:基于本体论的法语生物医学文本和临床笔记的语义标注。
BMC Bioinformatics. 2018 Nov 6;19(1):405. doi: 10.1186/s12859-018-2429-2.
4
Generation of silver standard concept annotations from biomedical texts with special relevance to phenotypes.从与表型特别相关的生物医学文本中生成银标准概念注释。
PLoS One. 2015 Jan 21;10(1):e0116040. doi: 10.1371/journal.pone.0116040. eCollection 2015.
5
A multilingual gold-standard corpus for biomedical concept recognition: the Mantra GSC.用于生物医学概念识别的多语言金标准语料库:Mantra GSC。
J Am Med Inform Assoc. 2015 Sep;22(5):948-56. doi: 10.1093/jamia/ocv037. Epub 2015 May 6.
6
NCBI disease corpus: a resource for disease name recognition and concept normalization.NCBI疾病语料库:一种用于疾病名称识别和概念规范化的资源。
J Biomed Inform. 2014 Feb;47:1-10. doi: 10.1016/j.jbi.2013.12.006. Epub 2014 Jan 3.
7
Assessment of disease named entity recognition on a corpus of annotated sentences.基于带注释句子语料库的疾病命名实体识别评估。
BMC Bioinformatics. 2008 Apr 11;9 Suppl 3(Suppl 3):S3. doi: 10.1186/1471-2105-9-S3-S3.
8
Use of "off-the-shelf" information extraction algorithms in clinical informatics: A feasibility study of MetaMap annotation of Italian medical notes.临床信息学中“现成可用”信息提取算法的应用:意大利医学记录的MetaMap注释可行性研究。
J Biomed Inform. 2016 Oct;63:22-32. doi: 10.1016/j.jbi.2016.07.017. Epub 2016 Jul 18.
9
Building a comprehensive syntactic and semantic corpus of Chinese clinical texts.构建中文临床文本的综合句法和语义语料库。
J Biomed Inform. 2017 May;69:203-217. doi: 10.1016/j.jbi.2017.04.006. Epub 2017 Apr 9.
10
Semantic biomedical resource discovery: a Natural Language Processing framework.语义生物医学资源发现:一种自然语言处理框架。
BMC Med Inform Decis Mak. 2015 Sep 30;15:77. doi: 10.1186/s12911-015-0200-4.

引用本文的文献

1
An overview of biomedical entity linking throughout the years.生物医学实体链接概述。
J Biomed Inform. 2023 Jan;137:104252. doi: 10.1016/j.jbi.2022.104252. Epub 2022 Dec 2.
2
Parallel sequence tagging for concept recognition.并行序列标注用于概念识别。
BMC Bioinformatics. 2022 Mar 24;22(Suppl 1):623. doi: 10.1186/s12859-021-04511-y.
3
Text mining to support abstract screening for knowledge syntheses: a semi-automated workflow.文本挖掘支持知识综合的摘要筛选:一种半自动化工作流程。
Syst Rev. 2021 May 26;10(1):156. doi: 10.1186/s13643-021-01700-x.
4
Transformation of Pathology Reports Into the Common Data Model With Oncology Module: Use Case for Colon Cancer.将病理学报告转化为带有肿瘤学模块的通用数据模型:结肠癌用例。
J Med Internet Res. 2020 Dec 9;22(12):e18526. doi: 10.2196/18526.
5
OGER++: hybrid multi-type entity recognition.OGER++:混合多类型实体识别
J Cheminform. 2019 Jan 21;11(1):7. doi: 10.1186/s13321-018-0326-3.
6
As Ontologies Reach Maturity, Artificial Intelligence Starts Being Fully Efficient: Findings from the Section on Knowledge Representation and Management for the Yearbook 2018.随着本体论走向成熟,人工智能开始充分发挥效力:《2018年年鉴》知识表示与管理板块的研究结果
Yearb Med Inform. 2018 Aug;27(1):140-145. doi: 10.1055/s-0038-1667078. Epub 2018 Aug 29.
7
UMLS to DBPedia link discovery through circular resolution.通过循环解析发现 UMLS 到 DBPedia 的链接。
J Am Med Inform Assoc. 2018 Jul 1;25(7):819-826. doi: 10.1093/jamia/ocy021.