Jing Li, Hua Peng, Shumei Zeng, Peng Qing, Wu Weizi, Lv Luofang, Yue Liqing, Jian Zhong Hu, Weihong Huang
Teaching and Research Section of Clinical Nursing, Xiangya Hospital Central South University, Changsha, Hunan, China.
Central South University Xiangya School of Nursing, Changsha, Hunan, China.
BMJ Open. 2025 Jul 5;15(7):e095460. doi: 10.1136/bmjopen-2024-095460.
To develop and validate an interpretable machine learning (ML)-based frailty risk prediction model that combines real-time health data with validated scale assessments for enhanced decision-making and targeted health management in integrated medical and older adult care institutions (IMOACIs) in central China.
Mixed-methods, cross-sectional study.
13 IMOACIs across seven cities in Hunan province, central China, from 8 to 16 July 2022.
Five healthcare experts and two data scientists participated in the requirements analysis stage. A total of 586 older adults were included in the assessment data collection stage, and 15 participants (10 healthcare professionals and five data scientists) were involved in the model evaluation stage.
A collaborative requirements analysis involving healthcare professionals and data scientists guided the design of an interpretable frailty risk prediction model. Five machine learning models were developed and evaluated: logistic regression, support vector machines (SVM), random forest, extreme gradient boosting (XGBoost) and a multimodel ensemble approach. Hyperparameter optimisation was performed using stratified fivefold cross-validation with grid search, incorporating class-weighted loss functions to address class imbalance and model-specific regularisation techniques to maximise performance while preventing overfitting. To enhance interpretability, the model incorporated Shapley Additive Explanations. The final model was integrated into a user-facing platform and validated using cross-sectional standardised assessment data collected from 13 IMOACIs. A mixed-methods evaluation approach combined quantitative performance metrics with qualitative user experience assessments.
The dataset (n=586) was randomly split into training (n=468) and validation (n=118) sets (4:1 ratio). Among models, XGBoost demonstrated superior performance, achieving an accuracy of 0.89 and an area under the receiver operating characteristic curve (AUC) of 0.89 on the training set. On the validation set, the XGBoost model achieved a precision of 0.76, recall of 0.72, F1 score of 0.74, accuracy of 0.83 and AUC of 0.80, outperforming other models. User experience surveys yielded high mean ratings for satisfaction (4.20/5), perceived accuracy (4.20/5), interpretability (4.30/5) and application value (4.10/5). Qualitative analysis of user feedback identified six key themes: practical and application value, performance and data analysis, interpretability and comprehensibility, impact and integration into practice, limitations and areas for improvement, and future development and innovation prospects, highlighting the model's strong potential for practical implementation.
This novel, interpretable ML-based frailty risk prediction model can enhance decision-making in the care of older adults by providing transparent predictions and identifying crucial factors associated with frailty. It establishes a foundation for targeted management and broader ML applications in healthcare systems, such as IMOACIs, particularly in developing regions.
开发并验证一种基于可解释机器学习(ML)的衰弱风险预测模型,该模型将实时健康数据与经过验证的量表评估相结合,以加强中国中部综合医疗和老年护理机构(IMOACI)中的决策制定和针对性健康管理。
混合方法横断面研究。
2022年7月8日至16日,中国中部湖南省七个城市的13家IMOACI。
五名医疗保健专家和两名数据科学家参与了需求分析阶段。共有586名老年人纳入评估数据收集阶段,15名参与者(10名医疗保健专业人员和五名数据科学家)参与了模型评估阶段。
涉及医疗保健专业人员和数据科学家的协作需求分析指导了可解释衰弱风险预测模型的设计。开发并评估了五个机器学习模型:逻辑回归、支持向量机(SVM)、随机森林、极端梯度提升(XGBoost)和多模型集成方法。使用分层五折交叉验证和网格搜索进行超参数优化,纳入类加权损失函数以解决类不平衡问题,并采用特定于模型的正则化技术以在防止过拟合的同时最大化性能。为提高可解释性,模型纳入了夏普利值加法解释(Shapley Additive Explanations)。最终模型被集成到一个面向用户的平台中,并使用从13家IMOACI收集的横断面标准化评估数据进行验证。混合方法评估方法将定量性能指标与定性用户体验评估相结合。
数据集(n = 586)被随机分为训练集(n = 468)和验证集(n = 118)(比例为4:1)。在各模型中,XGBoost表现出卓越性能,在训练集上的准确率达到0.89,受试者工作特征曲线下面积(AUC)为0.89。在验证集上,XGBoost模型的精确率为0.76,召回率为0.72,F1分数为0.74,准确率为0.83,AUC为0.80,优于其他模型。用户体验调查得出的满意度(4.20/5)、感知准确率(4.20/5)、可解释性(4.30/5)和应用价值(4.10/5)的平均评分较高。对用户反馈的定性分析确定了六个关键主题:实际和应用价值、性能和数据分析、可解释性和可理解性、影响和融入实践、局限性和改进领域以及未来发展和创新前景,突出了该模型在实际应用中的强大潜力。
这种基于ML的新型可解释衰弱风险预测模型可通过提供透明预测并识别与衰弱相关的关键因素来加强老年人护理中的决策制定。它为医疗保健系统(如IMOACI,特别是在发展中地区)的针对性管理和更广泛的ML应用奠定了基础。