Suppr超能文献

使用 SNOMED CT 对临床数据集进行编码的方法。

A method for encoding clinical datasets with SNOMED CT.

机构信息

School of Health Information Science, University of Victoria, Human & Social Development Building A202, Victoria, BC V8P 5C2, Canada.

出版信息

BMC Med Inform Decis Mak. 2010 Sep 17;10:53. doi: 10.1186/1472-6947-10-53.

Abstract

BACKGROUND

Over the past decade there has been a growing body of literature on how the Systematised Nomenclature of Medicine Clinical Terms (SNOMED CT) can be implemented and used in different clinical settings. Yet, for those charged with incorporating SNOMED CT into their organisation's clinical applications and vocabulary systems, there are few detailed encoding instructions and examples available to show how this can be done and the issues involved. This paper describes a heuristic method that can be used to encode clinical terms in SNOMED CT and an illustration of how it was applied to encode an existing palliative care dataset.

METHODS

The encoding process involves: identifying input data items; cleaning the data items; encoding the cleaned data items; and exporting the encoded terms as output term sets. Four outputs are produced: the SNOMED CT reference set; interface terminology set; SNOMED CT extension set and unencodeable term set.

RESULTS

The original palliative care database contained 211 data elements, 145 coded values and 37,248 free text values. We were able to encode 84% of the terms, another ~8% require further encoding and verification while terms that had a frequency of fewer than five were not encoded (7%).

CONCLUSIONS

From the pilot, it would seem our SNOMED CT encoding method has the potential to become a general purpose terminology encoding approach that can be used in different clinical systems.

摘要

背景

在过去的十年中,关于如何在不同的临床环境中实施和使用系统命名法医学术语集(SNOMED CT)的文献越来越多。然而,对于那些负责将 SNOMED CT 纳入其组织的临床应用和词汇系统的人来说,几乎没有详细的编码说明和示例可以展示如何做到这一点以及涉及的问题。本文描述了一种可以用于对 SNOMED CT 中的临床术语进行编码的启发式方法,并说明了如何将其应用于对现有的姑息治疗数据集进行编码。

方法

编码过程包括:识别输入数据项;清理数据项;对清理后的数据项进行编码;并将编码后的术语作为输出术语集导出。生成四个输出:SNOMED CT 参考集;接口术语集;SNOMED CT 扩展集和无法编码的术语集。

结果

原始的姑息治疗数据库包含 211 个数据元素、145 个编码值和 37,248 个自由文本值。我们能够对约 84%的术语进行编码,另外约 8%的术语需要进一步编码和验证,而频率少于 5 的术语则未进行编码(约 7%)。

结论

从试点情况来看,我们的 SNOMED CT 编码方法似乎有可能成为一种通用的术语编码方法,可以在不同的临床系统中使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe54/2949694/564bf314855f/1472-6947-10-53-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验