Milella Frida, Famiglini Lorenzo, Banfi Giuseppe, Cabitza Federico
IRCCS Istituto Ortopedico Galeazzi, Via Cristina Belgioioso 173, 20157 Milano, Italy.
DISCo, Dipartimento di Informatica, Sistemistica e Comunicazione, University of Milano-Bicocca, Viale Sarca 336, 20126 Milano, Italy.
J Pers Med. 2022 Oct 12;12(10):1706. doi: 10.3390/jpm12101706.
The rise of personalized medicine and its remarkable advancements have revealed new requirements for the availability of appropriate medical decision-making models. Computer science is an area that plays an essential role in the field of personalized medicine, where one of the goals is to provide algorithms and tools to extrapolate knowledge and improve the decision-support process. The minimum clinically important difference (MCID) is the smallest change in PROM scores that patients perceive as meaningful. Treatment that does not achieve the minimum level of improvement is considered inappropriate as well as a potential waste of resources. Using the MCID threshold to identify patients who fail to achieve the minimum change in PROM that results in a meaningful outcome may aid in pre-surgical shared decision-making. The decision tree algorithm is a method for extracting valuable information and providing further meaningful information to the domain expert that supports the decision-making. In the present study, different tools based on machine learning were developed. On the one hand, we compared three XGBoost models to predict the non-achievement of the MCID at six months post-operation in the SF-12 physical score. The prediction score threshold was set to 0.75 to provide three decision-making areas on the basis of the high confidence (HC) intervals; the minority class was re-balanced by weighting the positive class to penalize the loss function (XGBoost cost-sensitive), oversampling the minority class (XGBoost with SMOTE), and re-sampling the negative class (XGBoost with undersampling). On the other hand, we modeled the data through a decision tree (assessment tree), based on different complexity levels, to identify the hidden pattern and to provide a new way to understand possible relationships between the gathered features and the several outcomes. The results showed that all the proposed models were effective as binary classifiers, as they showed moderate predictive performance both regarding the minority or positive class (i.e., our targeted patients, those who will not benefit from surgery) and the negative class. The decision tree visualization can be exploited during the patient assessment status to better understand if those patients will benefit or not from the medical intervention. Both of these tools can come in handy for increasing knowledge about the patient's psychophysical state and for creating an increasingly specialized assessment of the individual patient.
个性化医疗的兴起及其显著进展揭示了对合适的医疗决策模型可用性的新要求。计算机科学是在个性化医疗领域发挥重要作用的一个领域,其目标之一是提供算法和工具来推断知识并改善决策支持过程。最小临床重要差异(MCID)是患者认为有意义的患者报告结局测量(PROM)分数的最小变化。未达到最小改善水平的治疗被认为是不合适的,也是对资源的潜在浪费。使用MCID阈值来识别未能在PROM中实现导致有意义结果的最小变化的患者,可能有助于术前共同决策。决策树算法是一种提取有价值信息并向支持决策的领域专家提供进一步有意义信息的方法。在本研究中,开发了基于机器学习的不同工具。一方面,我们比较了三个XGBoost模型,以预测SF-12身体评分术后六个月未达到MCID的情况。预测分数阈值设置为0.75,以便在高置信区间的基础上提供三个决策区域;通过对正类加权以惩罚损失函数(XGBoost成本敏感)、对少数类进行过采样(带SMOTE的XGBoost)以及对负类进行重采样(带欠采样的XGBoost)来重新平衡少数类。另一方面,我们基于不同的复杂度水平通过决策树(评估树)对数据进行建模,以识别隐藏模式,并提供一种新方法来理解收集到的特征与多个结果之间的可能关系。结果表明,所有提出的模型作为二元分类器都是有效的,因为它们在少数类或正类(即我们的目标患者,那些不会从手术中受益的患者)和负类方面都表现出适度的预测性能。在患者评估状态期间,可以利用决策树可视化来更好地了解这些患者是否会从医疗干预中受益。这两种工具都有助于增加对患者心理生理状态的了解,并对个体患者进行越来越专业化的评估。