Zhu Xinxin, Fan Jung-Wei, Baorto David M, Weng Chunhua, Cimino James J
Department of Biomedical Informatics, Columbia University, 622 West 168th Street, VC-5, New York, NY 10032, USA.
J Biomed Inform. 2009 Jun;42(3):413-25. doi: 10.1016/j.jbi.2009.03.003. Epub 2009 Mar 12.
Although controlled biomedical terminologies have been with us for centuries, it is only in the last couple of decades that close attention has been paid to the quality of these terminologies. The result of this attention has been the development of auditing methods that apply formal methods to assessing whether terminologies are complete and accurate. We have performed an extensive literature review to identify published descriptions of these methods and have created a framework for characterizing them. The framework considers manual, systematic and heuristic methods that use knowledge (within or external to the terminology) to measure quality factors of different aspects of the terminology content (terms, semantic classification, and semantic relationships). The quality factors examined included concept orientation, consistency, non-redundancy, soundness and comprehensive coverage. We reviewed 130 studies that were retrieved based on keyword search on publications in PubMed, and present our assessment of how they fit into our framework. We also identify which terminologies have been audited with the methods and provide examples to illustrate each part of the framework.
尽管受控生物医学术语已经存在了几个世纪,但直到最近几十年,人们才开始密切关注这些术语的质量。这种关注的结果是开发了审计方法,这些方法应用形式化方法来评估术语是否完整和准确。我们进行了广泛的文献综述,以识别这些方法的已发表描述,并创建了一个对其进行表征的框架。该框架考虑了手动、系统和启发式方法,这些方法使用(术语内部或外部的)知识来衡量术语内容不同方面(术语、语义分类和语义关系)的质量因素。所考察的质量因素包括概念导向、一致性、无冗余性、合理性和全面覆盖性。我们回顾了基于在PubMed上对出版物进行关键词搜索而检索到的130项研究,并展示了我们对它们如何符合我们框架的评估。我们还确定了哪些术语已经使用这些方法进行了审计,并提供示例来说明框架的每个部分。