• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

以现有本体作为参考标准,对用于实体发现的本体学习方法进行形成性评估。

Formative evaluation of ontology learning methods for entity discovery by using existing ontologies as reference standards.

作者信息

Liu K, Mitchell K J, Chapman W W, Savova G K, Sioutos N, Rubin D L, Crowley R S

机构信息

Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA.

出版信息

Methods Inf Med. 2013;52(4):308-16. doi: 10.3414/ME12-01-0029. Epub 2013 May 13.

DOI:10.3414/ME12-01-0029
PMID:23666409
Abstract

OBJECTIVE

Developing a two-step method for formative evaluation of statistical Ontology Learning (OL) algorithms that leverages existing biomedical ontologies as reference standards.

METHODS

In the first step optimum parameters are established. A 'gap list' of entities is generated by finding the set of entities present in a later version of the ontology that are not present in an earlier version of the ontology. A named entity recognition system is used to identify entities in a corpus of biomedical documents that are present in the 'gap list', generating a reference standard. The output of the algorithm (new entity candidates), produced by statistical methods, is subsequently compared against this reference standard. An OL method that performs perfectly will be able to learn all of the terms in this reference standard. Using evaluation metrics and precision-recall curves for different thresholds and parameters, we compute the optimum parameters for each method. In the second step, human judges with expertise in ontology development evaluate each candidate suggested by the algorithm configured with the optimum parameters previously established. These judgments are used to compute two performance metrics developed from our previous work: Entity Suggestion Rate (ESR) and Entity Acceptance Rate (EAR).

RESULTS

Using this method, we evaluated two statistical OL methods for OL in two medical domains. For the pathology domain, we obtained 49% ESR, 28% EAR with the Lin method and 52% ESR, 39% EAR with the Church method. For the radiology domain, we obtain 87% ESA, 9% EAR using Lin method and 96% ESR, 16% EAR using Church method.

CONCLUSION

This method is sufficiently general and flexible enough to permit comparison of any OL method for a specific corpus and ontology of interest.

摘要

目的

开发一种用于统计本体学习(OL)算法形成性评估的两步法,该方法利用现有的生物医学本体作为参考标准。

方法

第一步是确定最佳参数。通过找出本体的较新版本中存在而较早版本中不存在的实体集来生成实体的“差距列表”。使用命名实体识别系统来识别生物医学文档语料库中存在于“差距列表”中的实体,从而生成参考标准。随后将统计方法产生的算法输出(新实体候选)与该参考标准进行比较。表现完美的OL方法将能够学习该参考标准中的所有术语。使用针对不同阈值和参数的评估指标以及精确率-召回率曲线,我们为每种方法计算最佳参数。第二步,由本体开发方面的专业人员组成的人工评判小组对由配置了先前确定的最佳参数的算法所建议的每个候选进行评估。这些评判用于计算从我们之前的工作中得出的两个性能指标:实体建议率(ESR)和实体接受率(EAR)。

结果

使用这种方法,我们在两个医学领域评估了两种用于本体学习的统计OL方法。对于病理学领域,使用林氏方法我们得到了49%的ESR、28%的EAR,使用丘奇方法得到了52%的ESR、39%的EAR。对于放射学领域,使用林氏方法我们得到了87%的ESA、9%的EAR,使用丘奇方法得到了96%的ESR、16%的EAR。

结论

该方法具有足够的通用性和灵活性,能够对针对特定语料库和感兴趣的本体的任何OL方法进行比较。

相似文献

1
Formative evaluation of ontology learning methods for entity discovery by using existing ontologies as reference standards.以现有本体作为参考标准,对用于实体发现的本体学习方法进行形成性评估。
Methods Inf Med. 2013;52(4):308-16. doi: 10.3414/ME12-01-0029. Epub 2013 May 13.
2
A knowledge-driven approach to biomedical document conceptualization.基于知识的生物医学文献概念化方法。
Artif Intell Med. 2010 Jun;49(2):67-78. doi: 10.1016/j.artmed.2010.02.005. Epub 2010 Apr 3.
3
Assessment of disease named entity recognition on a corpus of annotated sentences.基于带注释句子语料库的疾病命名实体识别评估。
BMC Bioinformatics. 2008 Apr 11;9 Suppl 3(Suppl 3):S3. doi: 10.1186/1471-2105-9-S3-S3.
4
Entity recognition in the biomedical domain using a hybrid approach.使用混合方法进行生物医学领域的实体识别。
J Biomed Semantics. 2017 Nov 9;8(1):51. doi: 10.1186/s13326-017-0157-6.
5
Desiderata for domain reference ontologies in biomedicine.生物医学领域参考本体的必备条件。
J Biomed Inform. 2006 Jun;39(3):307-13. doi: 10.1016/j.jbi.2005.09.002. Epub 2005 Oct 17.
6
Natural language processing algorithms for mapping clinical text fragments onto ontology concepts: a systematic review and recommendations for future studies.自然语言处理算法在将临床文本片段映射到本体概念上的应用:系统评价及对未来研究的建议。
J Biomed Semantics. 2020 Nov 16;11(1):14. doi: 10.1186/s13326-020-00231-z.
7
FMA-RadLex: An application ontology of radiological anatomy derived from the foundational model of anatomy reference ontology.FMA-RadLex:一种源自解剖学参考本体基础模型的放射解剖学应用本体。
AMIA Annu Symp Proc. 2008 Nov 6;2008:465-9.
8
Building integrated ontological knowledge structures with efficient approximation algorithms.使用高效近似算法构建集成本体知识结构。
Biomed Res Int. 2015;2015:501528. doi: 10.1155/2015/501528. Epub 2015 Oct 13.
9
The MedDRA paradox.医学术语词典(MedDRA)悖论。
AMIA Annu Symp Proc. 2008 Nov 6;2008:470-4.
10
From lexical regularities to axiomatic patterns for the quality assurance of biomedical terminologies and ontologies.从词汇规律到公理模式,保障生物医学术语和本体的质量。
J Biomed Inform. 2018 Aug;84:59-74. doi: 10.1016/j.jbi.2018.06.008. Epub 2018 Jun 14.

引用本文的文献

1
Similarity-Based Recommendation of New Concepts to a Terminology.基于相似度向术语表推荐新概念
AMIA Annu Symp Proc. 2015 Nov 5;2015:386-95. eCollection 2015.
2
NOBLE - Flexible concept recognition for large-scale biomedical natural language processing.NOBLE——用于大规模生物医学自然语言处理的灵活概念识别
BMC Bioinformatics. 2016 Jan 14;17:32. doi: 10.1186/s12859-015-0871-y.
3
From bed to bench: bridging from informatics practice to theory: an exploratory analysis.从临床到实验室:架起信息学实践与理论的桥梁:一项探索性分析。
Appl Clin Inform. 2014 Oct 29;5(4):907-15. doi: 10.4338/ACI-2014-10-RA-0095. eCollection 2014.
4
Managing free text for secondary use of health data.管理健康数据二次使用的自由文本。
Yearb Med Inform. 2014 Aug 15;9(1):167-9. doi: 10.15265/IY-2014-0037.