Institute for Health Informatics, University of Minnesota, 8-100 Phillips Wangensteen Building, 516 Delaware St. SE, Minneapolis, MN, 55455, USA.
OptumLabs Visiting Fellow, Cambridge, MA, USA.
J Med Syst. 2019 May 17;43(7):185. doi: 10.1007/s10916-019-1321-6.
Although machine learning models are increasingly being developed for clinical decision support for patients with type 2 diabetes, the adoption of these models into clinical practice remains limited. Currently, machine learning (ML) models are being constructed on local healthcare systems and are validated internally with no expectation that they would validate externally and thus, are rarely transferrable to a different healthcare system. In this work, we aim to demonstrate that (1) even a complex ML model built on a national cohort can be transferred to two local healthcare systems, (2) while a model constructed on a local healthcare system's cohort is difficult to transfer; (3) we examine the impact of training cohort size on the transferability; and (4) we discuss criteria for external validity. We built a model using our previously published Multi-Task Learning-based methodology on a national cohort extracted from OptumLabs® Data Warehouse and transferred the model to two local healthcare systems (i.e., University of Minnesota Medical Center and Mayo Clinic) for external evaluation. The model remained valid when applied to the local patient populations and performed as well as locally constructed models (concordance: .73-.92), demonstrating transferability. The performance of the locally constructed models reduced substantially when applied to each other's healthcare system (concordance: .62-.90). We believe that our modeling approach, in which a model is learned from a national cohort and is externally validated, produces a transferable model, allowing patients at smaller healthcare systems to benefit from precision medicine.
尽管机器学习模型越来越多地被开发用于 2 型糖尿病患者的临床决策支持,但这些模型在临床实践中的应用仍然有限。目前,机器学习 (ML) 模型正在本地医疗保健系统上构建,并在内部进行验证,没有期望它们能够在外部进行验证,因此很少能够转移到不同的医疗保健系统。在这项工作中,我们旨在证明:(1) 即使是基于国家队列构建的复杂 ML 模型也可以转移到两个本地医疗保健系统;(2) 虽然基于本地医疗保健系统队列构建的模型很难转移;(3) 我们研究了训练队列大小对可转移性的影响;(4) 我们讨论了外部有效性的标准。我们使用之前在 OptumLabs®Data Warehouse 中提取的国家队列上发表的基于多任务学习的方法构建了一个模型,并将该模型转移到两个本地医疗保健系统(即明尼苏达大学医学中心和梅奥诊所)进行外部评估。当应用于本地患者群体时,该模型仍然有效,并且表现与本地构建的模型一样好(一致性:.73-.92),证明了可转移性。当应用于彼此的医疗保健系统时,本地构建的模型的性能大大降低(一致性:.62-.90)。我们相信,我们的建模方法,即从国家队列中学习模型并进行外部验证,可以产生可转移的模型,使较小医疗保健系统的患者能够受益于精准医疗。