School of Computer Science, University of Manchester, Manchester, UK.
J Am Med Inform Assoc. 2011 Jul-Aug;18(4):432-40. doi: 10.1136/amiajnl-2010-000045. Epub 2011 Apr 21.
(a) To determine the extent and range of errors and issues in the Systematised Nomenclature of Medicine-Clinical Terms (SNOMED CT) hierarchies as they affect two practical projects. (b) To determine the origin of issues raised and propose methods to address them.
The hierarchies for concepts in the Core Problem List Subset published by the Unified Medical Language System were examined for their appropriateness in two applications. Anomalies were traced to their source to determine whether they were simple local errors, systematic inferences propagated by SNOMED's classification process, or the result of problems with SNOMED's schemas. Conclusions were confirmed by showing that altering the root cause and reclassifying had the intended effects, and not others.
Major problems were encountered, involving concepts central to medicine including myocardial infarction, diabetes, and hypertension. Most of the issues raised were systematic. Some exposed fundamental errors in SNOMED's schemas, particularly with regards to anatomy. In many cases, the root cause could only be identified and corrected with the aid of a classifier.
This is a preliminary 'experiment of opportunity.' The results are not exhaustive; nor is consensus on all points definitive.
The SNOMED CT hierarchies cannot be relied upon in their present state in our applications. However, systematic quality assurance and correction are possible and practical but require sound techniques analogous to software engineering and combined lexical and semantic techniques. Until this is done, anyone using SNOMED codes should exercise caution. Errors in the hierarchies, or attempts to compensate for them, are likely to compromise interoperability and meaningful use.
(a) 确定医学系统命名法-临床术语(SNOMED CT)层次结构中错误和问题的程度与范围,因为它们影响两个实际项目。(b) 确定所提出问题的根源,并提出解决方法。
检查统一医学语言系统发布的核心问题列表子集中概念的层次结构在两个应用中的适用性。将异常追溯到其源头,以确定它们是简单的局部错误、由SNOMED分类过程传播的系统性推断,还是SNOMED模式问题的结果。通过表明改变根本原因并重新分类产生了预期效果而非其他效果来证实结论。
遇到了重大问题,涉及医学核心概念,包括心肌梗死、糖尿病和高血压。提出的大多数问题是系统性的。一些问题暴露了SNOMED模式中的根本性错误,特别是在解剖学方面。在许多情况下,只有借助分类器才能识别并纠正根本原因。
这是一次初步的“机会性实验”。结果并不详尽;也并非对所有问题都达成了最终共识。
SNOMED CT层次结构在我们目前的应用状态下不可靠。然而,系统性的质量保证和纠正可行且实际,但需要类似于软件工程的可靠技术以及词汇和语义相结合的技术。在此之前,任何使用SNOMED代码的人都应谨慎。层次结构中的错误或试图弥补这些错误的做法可能会损害互操作性和有意义的使用。