Suppr超能文献

迈向开发语义关联参照标准的框架。

Towards a framework for developing semantic relatedness reference standards.

机构信息

College of Pharmacy, University of Minnesota, Twin Cities, Minneapolis, MN 55455, USA.

出版信息

J Biomed Inform. 2011 Apr;44(2):251-65. doi: 10.1016/j.jbi.2010.10.004. Epub 2010 Oct 31.

Abstract

Our objective is to develop a framework for creating reference standards for functional testing of computerized measures of semantic relatedness. Currently, research on computerized approaches to semantic relatedness between biomedical concepts relies on reference standards created for specific purposes using a variety of methods for their analysis. In most cases, these reference standards are not publicly available and the published information provided in manuscripts that evaluate computerized semantic relatedness measurement approaches is not sufficient to reproduce the results. Our proposed framework is based on the experiences of medical informatics and computational linguistics communities and addresses practical and theoretical issues with creating reference standards for semantic relatedness. We demonstrate the use of the framework on a pilot set of 101 medical term pairs rated for semantic relatedness by 13 medical coding experts. While the reliability of this particular reference standard is in the "moderate" range; we show that using clustering and factor analyses offers a data-driven approach to finding systematic differences among raters and identifying groups of potential outliers. We test two ontology-based measures of relatedness and provide both the reference standard containing individual ratings and the R program used to analyze the ratings as open-source. Currently, these resources are intended to be used to reproduce and compare results of studies involving computerized measures of semantic relatedness. Our framework may be extended to the development of reference standards in other research areas in medical informatics including automatic classification, information retrieval from medical records and vocabulary/ontology development.

摘要

我们的目标是为计算机化语义关联功能测试创建参考标准制定框架。目前,关于计算机化生物医学概念之间语义关联的研究依赖于为特定目的创建的参考标准,这些标准采用了各种方法进行分析。在大多数情况下,这些参考标准并未公开,并且评估计算机化语义关联测量方法的手稿中提供的已发表信息不足以重现结果。我们提出的框架基于医学信息学和计算语言学社区的经验,并解决了为语义关联创建参考标准的实际和理论问题。我们在一个由 13 名医学编码专家对 101 对医学术语对进行语义关联评分的试点集中展示了该框架的使用。虽然该特定参考标准的可靠性处于“中等”范围;但我们表明,使用聚类和因子分析可以提供一种数据驱动的方法来发现评分者之间的系统差异,并识别潜在异常值的群体。我们测试了两种基于本体的关联度量方法,并提供了包含个人评分的参考标准以及用于分析评分的 R 程序作为开源资源。目前,这些资源旨在用于重现和比较涉及计算机化语义关联测量的研究结果。我们的框架可以扩展到医学信息学中其他研究领域的参考标准制定,包括自动分类、从医疗记录中检索信息以及词汇/本体开发。

相似文献

1
Towards a framework for developing semantic relatedness reference standards.迈向开发语义关联参照标准的框架。
J Biomed Inform. 2011 Apr;44(2):251-65. doi: 10.1016/j.jbi.2010.10.004. Epub 2010 Oct 31.
2
Measures of semantic similarity and relatedness in the biomedical domain.生物医学领域中语义相似性和相关性的度量。
J Biomed Inform. 2007 Jun;40(3):288-99. doi: 10.1016/j.jbi.2006.06.004. Epub 2006 Jun 10.
3
Corpus domain effects on distributional semantic modeling of medical terms.语料库领域对医学术语分布语义建模的影响。
Bioinformatics. 2016 Dec 1;32(23):3635-3644. doi: 10.1093/bioinformatics/btw529. Epub 2016 Aug 16.
5
Anatomy of data integration.数据集成剖析。
J Biomed Inform. 2007 Jun;40(3):252-69. doi: 10.1016/j.jbi.2006.09.001. Epub 2006 Sep 24.

引用本文的文献

2
Quality of word and concept embeddings in targetted biomedical domains.靶向生物医学领域中词和概念嵌入的质量。
Heliyon. 2023 Jun 2;9(6):e16818. doi: 10.1016/j.heliyon.2023.e16818. eCollection 2023 Jun.
4
Improving medical term embeddings using UMLS Metathesaurus.利用 UMLS 语义学术语表改进医学术语嵌入。
BMC Med Inform Decis Mak. 2022 Apr 29;22(1):114. doi: 10.1186/s12911-022-01850-5.
10

本文引用的文献

4
Influence of the MedDRA hierarchy on pharmacovigilance data mining results.MedDRA 层次结构对药物警戒数据挖掘结果的影响。
Int J Med Inform. 2009 Dec;78(12):e97-e103. doi: 10.1016/j.ijmedinf.2009.01.001. Epub 2009 Feb 18.
5
Predicting judged similarity of natural categories from their neural representations.从自然类别的神经表征预测其被判断的相似性。
Neuropsychologia. 2009 Feb;47(3):859-68. doi: 10.1016/j.neuropsychologia.2008.12.029. Epub 2008 Dec 31.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验