Geriatrics Research, Education, and Clinical Care, Tennessee Valley Healthcare System, Veteran's Administration, Nashville, TN 37212, USA.
Int J Med Inform. 2012 Mar;81(3):143-56. doi: 10.1016/j.ijmedinf.2011.11.005. Epub 2012 Jan 12.
The majority of clinical symptoms are stored as free text in the clinical record, and this information can inform clinical decision support and automated surveillance efforts if it can be accurately processed into computer interpretable data.
We developed rule-based algorithms and evaluated a natural language processing (NLP) system for infectious symptom detection using clinical narratives. Training (60) and testing (444) documents were randomly selected from VA emergency department, urgent care, and primary care records. Each document was processed with NLP and independently manually reviewed by two clinicians with adjudication by referee. Infectious symptom detection rules were developed in the training set using keywords and SNOMED-CT concepts, and subsequently evaluated using the testing set.
Overall symptom detection performance was measured with a precision of 0.91, a recall of 0.84, and an F measure of 0.87. Overall symptom detection with assertion performance was measured with a precision of 0.67, a recall of 0.62, and an F measure of 0.64. Among those instances in which the automated system matched the reference set determination for symptom, the system correctly detected 84.7% of positive assertions, 75.1% of negative assertions, and 0.7% of uncertain assertions.
This work demonstrates how processed text could enable detection of non-specific symptom clusters for use in automated surveillance activities.
大部分临床症状以自由文本形式存储在临床记录中,如果能够将其准确地转化为计算机可理解的数据,这些信息可以为临床决策支持和自动化监测工作提供信息。
我们开发了基于规则的算法,并评估了一种使用临床叙述进行传染病症状检测的自然语言处理(NLP)系统。从 VA 急诊部、紧急护理和初级保健记录中随机选择了 60 个训练(60)和 444 个测试文档。每个文档都经过 NLP 处理,并由两名临床医生独立进行人工审查,由裁判进行裁决。传染病症状检测规则是在训练集中使用关键字和 SNOMED-CT 概念开发的,然后在测试集中进行评估。
总体症状检测性能的精度为 0.91,召回率为 0.84,F1 分数为 0.87。带断言的总体症状检测精度为 0.67,召回率为 0.62,F1 分数为 0.64。在自动化系统与参考集确定的症状匹配的情况下,系统正确检测到 84.7%的阳性断言、75.1%的阴性断言和 0.7%的不确定断言。
这项工作展示了如何处理文本以用于自动化监测活动中检测非特异性症状群。