Center for Oral and Systemic Health, Marshfield Clinic Research Institute, Marshfield, Wisconsin, United States.
Department of Critical Care Medicine, Marshfield Clinic Health System, Marshfield, Wisconsin, United States.
Methods Inf Med. 2022 May;61(1-02):29-37. doi: 10.1055/a-1801-2718. Epub 2022 Mar 17.
The International Classification of Disease (ICD) coding for pneumonia classification is based on causal organism or use of general pneumonia codes, creating challenges for epidemiological evaluations where pneumonia is standardly subtyped by settings, exposures, and time of emergence. Pneumonia subtype classification requires data available in electronic health records (EHRs), frequently in nonstructured formats including radiological interpretation or clinical notes that complicate electronic classification.
The current study undertook development of a rule-based pneumonia subtyping algorithm for stratifying pneumonia by the setting in which it emerged using information documented in the EHR.
Pneumonia subtype classification was developed by interrogating patient information within the EHR of a large private Health System. ICD coding was mined in the EHR applying requirements for "rule of two" pneumonia-related codes or one ICD code and radiologically confirmed pneumonia validated by natural language processing and/or documented antibiotic prescriptions. A rule-based algorithm flow chart was created to support subclassification based on features including symptomatic patient point of entry into the health care system timing of pneumonia emergence and identification of clinical, laboratory, or medication orders that informed definition of the pneumonia subclassification algorithm.
Data from 65,904 study-eligible patients with 91,998 episodes of pneumonia diagnoses documented by 380,509 encounters were analyzed, while 8,611 episodes were excluded following Natural Language Processing classification of pneumonia status as "negative" or "unknown." Subtyping of 83,387 episodes identified: community-acquired (54.5%), hospital-acquired (20%), aspiration-related (10.7%), health care-acquired (5%), and ventilator-associated (0.4%) cases, and 9.4% cases were not classifiable by the algorithm.
Study outcome indicated capacity to achieve electronic pneumonia subtype classification based on interrogation of big data available in the EHR. Examination of portability of the algorithm to achieve rule-based pneumonia classification in other health systems remains to be explored.
肺炎分类的国际疾病分类(ICD)编码基于病原体或使用通用肺炎代码,这给流行病学评估带来了挑战,因为肺炎通常按环境、暴露和出现时间进行亚型分类。肺炎亚型分类需要电子健康记录(EHR)中可用的数据,这些数据通常以非结构化格式存在,包括放射学解释或临床记录,这使得电子分类变得复杂。
本研究开发了一种基于规则的肺炎亚型分类算法,用于根据 EHR 中记录的信息,按出现的环境对肺炎进行分层。
通过查询大型私人医疗系统的 EHR 中的患者信息,开发肺炎亚型分类。在 EHR 中挖掘 ICD 编码,应用“两条规则”肺炎相关代码或一条 ICD 代码和放射学确认的肺炎的要求,通过自然语言处理和/或记录的抗生素处方进行验证。创建了一个基于规则的算法流程图,以支持基于特征的亚分类,这些特征包括有症状的患者进入医疗保健系统的时间点、肺炎出现的时间以及确定肺炎亚分类算法的临床、实验室或药物订单。
对 65904 名符合研究条件的患者的数据进行了分析,这些患者的 91998 例肺炎诊断记录在 380509 次就诊中,而 8611 例肺炎病例在通过自然语言处理对肺炎状态进行分类为“阴性”或“未知”后被排除在外。对 83387 例肺炎进行的亚分类确定了:社区获得性(54.5%)、医院获得性(20%)、吸入性(10.7%)、医疗保健获得性(5%)和呼吸机相关性(0.4%)病例,9.4%的病例无法通过算法进行分类。
研究结果表明,能够基于 EHR 中可用的大数据进行电子肺炎亚型分类。仍然需要探索该算法在其他医疗系统中实现基于规则的肺炎分类的可移植性。