Luo Gang, Stone Bryan L, Koebnick Corinna, He Shan, Au David H, Sheng Xiaoming, Murtaugh Maureen A, Sward Katherine A, Schatz Michael, Zeiger Robert S, Davidson Giana H, Nkoy Flory L
Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, United States.
Department of Pediatrics, University of Utah, Salt Lake City, UT, United States.
JMIR Res Protoc. 2019 Jun 6;8(6):e13783. doi: 10.2196/13783.
Both chronic obstructive pulmonary disease (COPD) and asthma incur heavy health care burdens. To support tailored preventive care for these 2 diseases, predictive modeling is widely used to give warnings and to identify patients for care management. However, 3 gaps exist in current modeling methods owing to rarely factoring in temporal aspects showing trends and early health change: (1) existing models seldom use temporal features and often give late warnings, making care reactive. A health risk is often found at a relatively late stage of declining health, when the risk of a poor outcome is high and resolving the issue is difficult and costly. A typical model predicts patient outcomes in the next 12 months. This often does not warn early enough. If a patient will actually be hospitalized for COPD next week, intervening now could be too late to avoid the hospitalization. If temporal features were used, this patient could potentially be identified a few weeks earlier to institute preventive therapy; (2) existing models often miss many temporal features with high predictive power and have low accuracy. This makes care management enroll many patients not needing it and overlook over half of the patients needing it the most; (3) existing models often give no information on why a patient is at high risk nor about possible interventions to mitigate risk, causing busy care managers to spend more time reviewing charts and to miss suited interventions. Typical automatic explanation methods cannot handle longitudinal attributes and fully address these issues.
To fill these gaps so that more COPD and asthma patients will receive more appropriate and timely care, we will develop comprehensible data-driven methods to provide accurate early warnings of poor outcomes and to suggest tailored interventions, making care more proactive, efficient, and effective.
By conducting a secondary data analysis and surveys, the study will: (1) use temporal features to provide accurate early warnings of poor outcomes and assess the potential impact on prediction accuracy, risk warning timeliness, and outcomes; (2) automatically identify actionable temporal risk factors for each patient at high risk for future hospital use and assess the impact on prediction accuracy and outcomes; and (3) assess the impact of actionable information on clinicians' acceptance of early warnings and on perceived care plan quality.
We are obtaining clinical and administrative datasets from 3 leading health care systems' enterprise data warehouses. We plan to start data analysis in 2020 and finish our study in 2025.
Techniques to be developed in this study can boost risk warning timeliness, model accuracy, and generalizability; improve patient finding for preventive care; help form tailored care plans; advance machine learning for many clinical applications; and be generalized for many other chronic diseases.
INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): PRR1-10.2196/13783.
慢性阻塞性肺疾病(COPD)和哮喘都会带来沉重的医疗负担。为了支持针对这两种疾病的个性化预防护理,预测模型被广泛用于发出警告并识别需要护理管理的患者。然而,由于目前的建模方法很少考虑显示趋势和早期健康变化的时间因素,存在3个差距:(1)现有模型很少使用时间特征,且往往发出较晚的警告,导致护理具有反应性。健康风险通常在健康状况下降的相对晚期才被发现,此时出现不良结果的风险很高,解决问题既困难又昂贵。一个典型的模型预测患者未来12个月的结果。这往往没有足够早地发出警告。如果一名患者下周实际上将因慢性阻塞性肺疾病住院,现在进行干预可能为时已晚,无法避免住院。如果使用时间特征,这名患者可能会提前几周被识别出来,从而开始预防性治疗;(2)现有模型往往遗漏了许多具有高预测力的时间特征,准确性较低。这使得护理管理纳入了许多不需要护理的患者,而忽视了一半以上最需要护理的患者;(3)现有模型通常不提供患者为何处于高风险的信息,也不提供减轻风险的可能干预措施的信息,导致忙碌的护理管理人员花费更多时间查看病历,错过合适的干预措施。典型的自动解释方法无法处理纵向属性,也无法完全解决这些问题。
为了填补这些差距,使更多慢性阻塞性肺疾病和哮喘患者能够获得更合适、更及时的护理,我们将开发可理解的数据驱动方法,以提供不良结果的准确早期预警,并建议个性化干预措施,使护理更加积极主动、高效且有效。
通过进行二次数据分析和调查,本研究将:(1)使用时间特征提供不良结果的准确早期预警,并评估对预测准确性、风险预警及时性和结果的潜在影响;(2)自动识别未来有高住院风险的每位患者的可采取行动的时间风险因素,并评估对预测准确性和结果的影响;(3)评估可采取行动的信息对临床医生接受早期预警以及对感知护理计划质量的影响。
我们正在从3个领先的医疗系统的企业数据仓库中获取临床和管理数据集。我们计划在2020年开始数据分析,并在2025年完成研究。
本研究中开发的技术可以提高风险预警及时性、模型准确性和可推广性;改善预防性护理的患者识别;帮助制定个性化护理计划;推动机器学习在许多临床应用中的发展;并推广到许多其他慢性疾病。
国际注册报告识别码(IRRID):PRR1-10.2196/13783