Department of Neurology and Rehabilitation, University of Illinois at Chicago, 912 S. Wood Street (MC 796), Chicago, IL, 60612, USA.
BMC Med Inform Decis Mak. 2020 Mar 4;20(1):47. doi: 10.1186/s12911-020-1066-7.
The use of clinical data in electronic health records for machine-learning or data analytics depends on the conversion of free text into machine-readable codes. We have examined the feasibility of capturing the neurological examination as machine-readable codes based on UMLS Metathesaurus concepts.
We created a target ontology for capturing the neurological examination using 1100 concepts from the UMLS Metathesaurus. We created a dataset of 2386 test-phrases based on 419 published neurological cases. We then mapped the test-phrases to the target ontology.
We were able to map all of the 2386 test-phrases to 601 unique UMLS concepts. A neurological examination ontology with 1100 concepts has sufficient breadth and depth of coverage to encode all of the neurologic concepts derived from the 419 test cases. Using only pre-coordinated concepts, component ontologies of the UMLS, such as HPO, SNOMED CT, and OMIM, do not have adequate depth and breadth of coverage to encode the complexity of the neurological examination.
An ontology based on a subset of UMLS has sufficient breadth and depth of coverage to convert deficits from the neurological examination into machine-readable codes using pre-coordinated concepts. The use of a small subset of UMLS concepts for a neurological examination ontology offers the advantage of improved manageability as well as the opportunity to curate the hierarchy and subsumption relationships.
在电子健康记录中使用临床数据进行机器学习或数据分析取决于将自由文本转换为机器可读代码。我们研究了基于 UMLS Metathesaurus 概念捕获神经检查作为机器可读代码的可行性。
我们使用 UMLS Metathesaurus 中的 1100 个概念创建了一个用于捕获神经检查的目标本体。我们基于 419 个已发表的神经病例创建了包含 2386 个测试短语的数据集。然后,我们将测试短语映射到目标本体上。
我们能够将所有 2386 个测试短语映射到 601 个唯一的 UMLS 概念。具有 1100 个概念的神经检查本体具有足够的广度和深度,可以对从 419 个测试案例中得出的所有神经概念进行编码。仅使用预协调的概念,UMLS 的组件本体,如 HPO、SNOMED CT 和 OMIM,没有足够的深度和广度来对神经检查的复杂性进行编码。
基于 UMLS 子集的本体具有足够的广度和深度,可以使用预协调的概念将神经检查中的缺陷转换为机器可读代码。使用 UMLS 概念的一小部分子集来构建神经检查本体具有提高可管理性的优势,并且有机会对层次结构和子集关系进行编辑。