Liu Chang, Liu Zhangdaihong, Liu Jingjing, Cai Chenglai, Clifton David A, Wang Hui, Yang Yang
School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China.
Oxford Suzhou Center For Advanced Research, Suzhou, Jiangsu 215124, China.
J Am Med Inform Assoc. 2025 Jun 11. doi: 10.1093/jamia/ocaf070.
This study aims to tackle the critical challenge of adapting deep learning (DL) models for deployment in real-world healthcare settings, specifically focusing on catastrophic forgetting due to distribution shifts between hospital and non-hospital environments. Metabolic syndrome (MetS) is susceptible to misdiagnosis by DL models due to distribution shifts. This work demonstrates the potential of continual learning (CL) to enhance model performance in MetS identification across diverse settings.
We utilized 3 healthcare datasets from 2 distinct settings: Medical Information Mart for Intensive Care (MIMIC; from a hospital setting), National Health and Nutrition Examination Survey (NHANES), and a proprietary dataset (both from non-hospital settings). We proposed a robust MetS identification pipeline with CL strategies and evaluated its effectiveness in mitigating catastrophic forgetting while maintaining high predictive performance under distribution shift.
The CL method outperformed the control (sequential training without any strategies) method. The CL method reached a cumulative area under the ROC curve (AUROC) of 0.85 and area under the precision-recall curve of 0.65 on the combined test set. Moreover, training order proved critical: models trained from hospital to non-hospital settings achieved a 7.6% improvement in AUROC, increasing from 0.79 to 0.85, compared to the reverse order.
Our results demonstrate the potential of CL for applications across healthcare settings, particularly between hospital and non-hospital settings. We also discuss the impact of training order on the results.
The proposed CL model effectively mitigates catastrophic forgetting, enhancing the overall performance of DL models. Our results underscore the prospect of CL methods in developing medical DL models and maintaining scalability across diverse healthcare settings.
本研究旨在应对将深度学习(DL)模型应用于实际医疗环境时面临的关键挑战,特别关注医院和非医院环境之间分布变化导致的灾难性遗忘问题。由于分布变化,代谢综合征(MetS)容易被DL模型误诊。这项工作展示了持续学习(CL)在跨不同环境的MetS识别中提升模型性能的潜力。
我们使用了来自2个不同环境的3个医疗数据集:重症监护医学信息库(MIMIC;来自医院环境)、国家健康与营养检查调查(NHANES)以及一个专有数据集(均来自非医院环境)。我们提出了一种具有CL策略的强大的MetS识别流程,并评估了其在减轻灾难性遗忘同时在分布变化下保持高预测性能的有效性。
CL方法优于对照(无任何策略的顺序训练)方法。在组合测试集上,CL方法的ROC曲线下累积面积(AUROC)达到0.85,精确召回曲线下面积达到0.65。此外,训练顺序被证明至关重要:与相反顺序相比,从医院到非医院环境训练的模型在AUROC上提高了7.6%,从0.79提高到0.85。
我们的结果证明了CL在跨医疗环境应用中的潜力,特别是在医院和非医院环境之间。我们还讨论了训练顺序对结果的影响。
所提出的CL模型有效地减轻了灾难性遗忘,提高了DL模型的整体性能。我们的结果强调了CL方法在开发医学DL模型以及在不同医疗环境中保持可扩展性方面的前景。