Tarekegn Adane Nega, Michalak Krzysztof, Costa Giuseppe, Ricceri Fulvio, Giacobini Mario
Department of Information Science and Media Studies, University of Bergen, Bergen, Norway.
Faculty of Computing, Bahir Dar Institute of Technology, Bahir Dar University, Bahir Dar, Ethiopia.
J Healthc Inform Res. 2024 Oct 2;8(4):594-618. doi: 10.1007/s41666-024-00173-6. eCollection 2024 Dec.
Frailty syndrome is prevalent among the elderly, often linked to chronic diseases and resulting in various adverse health outcomes. Existing research has predominantly focused on predicting individual frailty-related outcomes. However, this paper takes a novel approach by framing frailty as a multi-label learning problem, aiming to predict multiple adverse outcomes simultaneously. In the context of multi-label classification, dealing with imbalanced label distribution poses inherent challenges to multi-label prediction. To address this issue, our study proposes a hybrid resampling approach tailored for handling imbalance problems in the multi-label scenario. The proposed resampling technique and prediction tasks were applied to a high-dimensional real-life medical dataset comprising individuals aged 65 years and above. Several multi-label algorithms were employed in the experiment, and their performance was evaluated using multi-label metrics. The results obtained through our proposed approach revealed that the best-performing prediction model achieved an average precision score of 83%. These findings underscore the effectiveness of our method in predicting multiple frailty outcomes from a complex and imbalanced multi-label dataset.
衰弱综合征在老年人中普遍存在,通常与慢性疾病相关,并导致各种不良健康后果。现有研究主要集中在预测个体与衰弱相关的结果。然而,本文采用了一种新颖的方法,将衰弱构建为一个多标签学习问题,旨在同时预测多个不良结果。在多标签分类的背景下,处理不平衡的标签分布对多标签预测构成了固有的挑战。为了解决这个问题,我们的研究提出了一种专门针对处理多标签场景中不平衡问题的混合重采样方法。所提出的重采样技术和预测任务应用于一个包含65岁及以上个体的高维真实医疗数据集。实验中采用了几种多标签算法,并使用多标签指标评估了它们的性能。通过我们提出的方法获得的结果表明,表现最佳的预测模型的平均精确率得分为83%。这些发现强调了我们的方法在从复杂且不平衡的多标签数据集中预测多个衰弱结果方面的有效性。