Wi Chung-Il, Sohn Sunghwan, Rolfes Mary C, Seabright Alicia, Ryu Euijung, Voge Gretchen, Bachman Kay A, Park Miguel A, Kita Hirohito, Croghan Ivana T, Liu Hongfang, Juhn Young J
1 Department of Pediatric and Adolescent Medicine.
2 Asthma Epidemiology Research Unit.
Am J Respir Crit Care Med. 2017 Aug 15;196(4):430-437. doi: 10.1164/rccm.201610-2006OC.
Difficulty of asthma ascertainment and its associated methodologic heterogeneity have created significant barriers to asthma care and research.
We evaluated the validity of an existing natural language processing (NLP) algorithm for asthma criteria to enable an automated chart review using electronic medical records (EMRs).
The study was designed as a retrospective birth cohort study using a random sample of 500 subjects from the 1997-2007 Mayo Birth Cohort who were born at Mayo Clinic and enrolled in primary pediatric care at Mayo Clinic Rochester. Performance of NLP-based asthma ascertainment using predetermined asthma criteria was assessed by determining both criterion validity (chart review of EMRs by abstractor as a gold standard) and construct validity (association with known risk factors for asthma, such as allergic rhinitis).
After excluding three subjects whose respiratory symptoms could be attributed to other conditions (e.g., tracheomalacia), among the remaining eligible 497 subjects, 51% were male, 77% white persons, and the median age at last follow-up date was 11.5 years. The asthma prevalence was 31% in the study cohort. Sensitivity, specificity, positive predictive value, and negative predictive value for NLP algorithm in predicting asthma status were 97%, 95%, 90%, and 98%, respectively. The risk factors for asthma (e.g., allergic rhinitis) that were identified either by NLP or the abstractor were the same.
Asthma ascertainment through NLP should be considered in the era of EMRs because it can enable large-scale clinical studies in a more time-efficient manner and improve the recognition and care of childhood asthma in practice.
哮喘确诊的困难及其相关的方法学异质性给哮喘护理和研究造成了重大障碍。
我们评估了一种现有的用于哮喘标准的自然语言处理(NLP)算法的有效性,以便能够使用电子病历(EMR)进行自动图表审查。
该研究设计为一项回顾性出生队列研究,从1997 - 2007年梅奥出生队列中随机抽取500名受试者,这些受试者在梅奥诊所出生,并在罗切斯特梅奥诊所登记接受初级儿科护理。使用预先确定的哮喘标准,通过确定标准效度(以摘要员对EMR的图表审查作为金标准)和结构效度(与已知的哮喘风险因素如过敏性鼻炎的关联)来评估基于NLP的哮喘确诊性能。
在排除三名呼吸症状可归因于其他疾病(如气管软化)的受试者后,在其余符合条件的497名受试者中,51%为男性,77%为白人,最后随访日期的中位年龄为11.5岁。研究队列中的哮喘患病率为31%。NLP算法预测哮喘状态的敏感性、特异性、阳性预测值和阴性预测值分别为97%、95%、90%和98%。通过NLP或摘要员确定的哮喘风险因素(如过敏性鼻炎)是相同的。
在电子病历时代,应考虑通过NLP进行哮喘确诊,因为它能够以更高效的方式开展大规模临床研究,并在实践中改善儿童哮喘的识别和护理。