Shi Zixin, Huang Linjun, Xu Xiaomei, Pu Kexue, Zhang Qingpeng, Wang Haolin
College of Medical Informatics, Chongqing Medical University, 1 Yixueyuan Road, Yuzhong District, Chongqing, 400016, China, 86 13500303273.
Department of Infectious Diseases, Chengdu Fifth People's Hospital, Chengdu, China.
JMIR Med Inform. 2025 Sep 10;13:e63581. doi: 10.2196/63581.
Cirrhosis is a leading cause of noncancer deaths in gastrointestinal diseases, resulting in high hospitalization and readmission rates. Early identification of high-risk patients is vital for proactive interventions and improving health care outcomes. However, the quality and integrity of real-world electronic health records (EHRs) limit their utility in developing risk assessment tools.
Despite the widespread application of classical and ensemble machine learning for EHR-based predictive tasks, the diversity of health conditions among patients and the inherent limitations of the data, such as incompleteness, sparsity, and temporal dynamics, have not been fully addressed. To tackle those challenges, we explored a framework that characterizes patient subgroups and adaptively selects optimal predictive models for each patient on the fly to enable individualized decision support.
The proposed framework uniquely addresses patient heterogeneity by aligning diverse subgroups with dynamically selected classifiers. First, patient subgroups are generated and characterized using rules indicating medical diagnosis patterns. Next, a meta-learning framework trains a meta-classifier for optimal dynamic model selection, which identifies suitable models for individual patients. Notably, we incorporated a tailored region of competence to refine model selection, specifically accounting for cirrhosis complications. This approach not only enhances predictive performance but also elucidates why individualized predictions are better supported by selected classifiers trained on specific data subsets.
The proposed framework was evaluated for predicting 14-day and 30-day readmission in patients with cirrhosis using multicenter data obtained from 6 hospitals. The final dataset comprised 3307 patients with at least 2 admission records, along with a range of factors including demographic information, complications, and laboratory test results. The proposed framework achieved an average AUC (area under the curve) improvement of 5% and 4% compared to the best baseline models, respectively.
By leveraging the expertise of the most competent classifiers for each patient subgroup, our approach enables interpretable training and dynamic selection of heterogeneous predictive models. This advancement not only improves prediction accuracy but also highlights its considerable potential for clinical applications, facilitating the alignment of diverse patient subgroups with tailored decision-support algorithms.
肝硬化是胃肠道疾病中非癌症死亡的主要原因,导致高住院率和再入院率。早期识别高危患者对于积极干预和改善医疗保健结果至关重要。然而,真实世界电子健康记录(EHR)的质量和完整性限制了其在开发风险评估工具中的效用。
尽管经典机器学习和集成机器学习在基于EHR的预测任务中得到了广泛应用,但患者健康状况的多样性以及数据的固有局限性,如不完整性、稀疏性和时间动态性,尚未得到充分解决。为应对这些挑战,我们探索了一个框架,该框架可对患者亚组进行特征描述,并为每个患者即时自适应选择最优预测模型,以实现个性化决策支持。
所提出的框架通过将不同亚组与动态选择的分类器对齐,独特地解决了患者异质性问题。首先,使用指示医学诊断模式的规则生成并表征患者亚组。接下来,一个元学习框架训练一个元分类器以进行最优动态模型选择,该元分类器为个体患者识别合适的模型。值得注意的是,我们纳入了一个定制的能力区域来优化模型选择,特别考虑了肝硬化并发症。这种方法不仅提高了预测性能,还阐明了为何在特定数据子集上训练的选定分类器能更好地支持个性化预测。
使用从6家医院获得的多中心数据,对所提出的框架进行了评估,以预测肝硬化患者的14天和30天再入院情况。最终数据集包括3307例至少有2次入院记录的患者,以及一系列因素,包括人口统计学信息、并发症和实验室检查结果。与最佳基线模型相比,所提出的框架分别实现了平均曲线下面积(AUC)提高5%和4%。
通过利用每个患者亚组中最胜任的分类器的专业知识,我们的方法能够进行可解释的训练和对异质预测模型进行动态选择。这一进展不仅提高了预测准确性,还凸显了其在临床应用中的巨大潜力,促进了不同患者亚组与定制决策支持算法的匹配。