Department of Public Health Sciences, Medical University of South Carolina, Charleston, South Carolina, USA.
Biomedical Informatics Center, Medical University of South Carolina, Charleston, South Carolina, USA.
J Am Med Inform Assoc. 2020 Aug 1;27(8):1321-1325. doi: 10.1093/jamia/ocaa105.
In an effort to improve the efficiency of computer algorithms applied to screening for coronavirus disease 2019 (COVID-19) testing, we used natural language processing and artificial intelligence-based methods with unstructured patient data collected through telehealth visits.
After segmenting and parsing documents, we conducted analysis of overrepresented words in patient symptoms. We then developed a word embedding-based convolutional neural network for predicting COVID-19 test results based on patients' self-reported symptoms.
Text analytics revealed that concepts such as smell and taste were more prevalent than expected in patients testing positive. As a result, screening algorithms were adapted to include these symptoms. The deep learning model yielded an area under the receiver-operating characteristic curve of 0.729 for predicting positive results and was subsequently applied to prioritize testing appointment scheduling.
Informatics tools such as natural language processing and artificial intelligence methods can have significant clinical impacts when applied to data streams early in the development of clinical systems for outbreak response.
为提高计算机算法在新冠病毒 2019 疾病(COVID-19)检测筛查中的应用效率,我们利用自然语言处理和基于人工智能的方法,对远程医疗就诊中收集的非结构化患者数据进行分析。
在对文档进行分段和解析后,我们分析了患者症状中重复出现的词汇。然后,我们开发了一种基于词嵌入的卷积神经网络,根据患者自我报告的症状预测 COVID-19 检测结果。
文本分析显示,在检测呈阳性的患者中,嗅觉和味觉等概念比预期更为普遍。因此,筛选算法被调整以纳入这些症状。该深度学习模型预测阳性结果的受试者工作特征曲线下面积为 0.729,并随后应用于优化检测预约安排。
自然语言处理和人工智能等信息学工具在为应对疫情开发临床系统的早期阶段应用于数据流时,可能会产生重大的临床影响。