Grim Stephanie, Kotz Alexander, Kotz Glenn, Halliwell Cat, Thomas John Fred, Kessler Rodger
University of Colorado Anschutz Medical Campus, 13001 East 17th Place, Aurora, CO, 80045, USA.
Mid-Valley Family Practice, Basalt, CO, USA.
Sci Rep. 2024 Dec 3;14(1):30077. doi: 10.1038/s41598-024-80064-3.
Health-related quality of life (HRQol) is a crucial dimension of care outcomes. Many HRQoL measures exist, but methodological and implementation challenges impede primary care (PC) use. We aim to develop and evaluate a novel machine learning (ML) algorithm that predicts binary risk levels among PC patients by combining validated elements from existing measures with demographic data from patient electronic health records (eHR) to increase predictive accuracy while reducing prospectively-collected data required to generate valid risk estimates. Self-report questions from previously validated QoL surveys were collected from PC patients and combined with their demographic and social determinant (SD) data to form a 53-question item bank from which ML chose the most predictive elements. For algorithm development, 375 observations were allocated to training (n = 301, 80%) or test partitions (n = 74, 20%). Questions that asked participants to rate how happy or satisfied they have been with their lives and how easy or hard their emotional health makes work/school showed a good ability to classify participants' mental QoL (98% max balanced accuracy). Questions that asked participants to rate how easy or hard it is to do activities such as walking or climbing stairs and how much pain limits their everyday activities showed ability to classify physical QoL (94% max balanced accuracy). No demographic or SD factors were significantly predictive. Supervised machine learning can inform QoL measurements to reduce data collection, simplify scoring, and allow for meaningful use by clinicians. Results from the current study show that a reduced 4-question model may predict QoL almost as well as a full-length 40-question measure.
健康相关生活质量(HRQol)是护理结果的一个关键维度。存在许多HRQol测量方法,但方法学和实施方面的挑战阻碍了其在初级保健(PC)中的应用。我们旨在开发和评估一种新型机器学习(ML)算法,该算法通过将现有测量方法中的经过验证的要素与患者电子健康记录(eHR)中的人口统计学数据相结合,预测PC患者的二元风险水平,以提高预测准确性,同时减少生成有效风险估计所需的前瞻性收集数据。从PC患者中收集先前经过验证的生活质量调查中的自我报告问题,并将其与他们的人口统计学和社会决定因素(SD)数据相结合,形成一个包含53个问题的题库,ML从中选择最具预测性的要素。对于算法开发,375个观察值被分配到训练组(n = 301,80%)或测试组(n = 74,20%)。那些要求参与者对自己生活的幸福程度或满意度以及情绪健康对工作/学习的难易程度进行评分的问题,显示出对参与者心理生活质量的良好分类能力(最大平衡准确率为98%)。那些要求参与者对诸如步行或爬楼梯等活动的难易程度以及疼痛对日常活动的限制程度进行评分的问题,则显示出对身体生活质量进行分类的能力(最大平衡准确率为94%)。没有人口统计学或SD因素具有显著预测性。监督式机器学习可为生活质量测量提供信息,以减少数据收集、简化评分,并使临床医生能够进行有意义的应用。当前研究结果表明,一个简化的4个问题模型预测生活质量的效果几乎与一个完整的40个问题测量方法相同。