Suppr超能文献

预测建模健康素养中的有效社会人口变量:一种机器学习方法。

Prediction of effective sociodemographic variables in modeling health literacy: A machine learning approach.

机构信息

Malatya Turgut Ozal University, Medicine Faculty, Biostatistics, Malatya, Turkey.

Malatya Turgut Ozal University, Medicine Faculty, Public Health, Malatya, Turkey.

出版信息

Int J Med Inform. 2023 Oct;178:105167. doi: 10.1016/j.ijmedinf.2023.105167. Epub 2023 Aug 1.

Abstract

INTRODUCTION

Health literacy is becoming a more important concept for the effective use of health systems day by day. The main purpose of the study is to determine the importance levels of the variables by using Machine Learning methods in order to determine the main factors affecting health literacy, and to find the most important variables for health literacy.

MATERIAL AND METHODS

1001 participants with a mean age of 18.05 ± 0.81 standard deviations were included in the study. The European Health Literacy Scale was used to determine the health literacy level of the participants. The scale cut-off point is 25, and 516 (51.5%) of the participants have low health literacy and 485 (48.5%) have a high level of health literacy. In the study, XGBoost, random forest, logistic regression models from machine learning methods were used and indexes were calculated.

RESULTS

When the results of XGBoost, random forest, logistic regression models were evaluated, it was found that the model with the best performance was XGBoost. Sensitivity, specificity, F1-score, AUROC and Brier score values for the XGBoost models were obtained as 0.979, 0.965, 0.973, 0.983, 0.054 respectively.

CONCLUSION

It was found that HL levels differed significantly in the variables of gender, age, class, family education, place of residence, economic situation, and covering health expenses (p < 0.05). According to the XGBoost model, it was found that the variable with the highest level of importance was reading the newspaper, while the variable with the lowest level of importance was the educational status of the mother. With the help of the established model, the basic variables that will affect the HL level were determined. The designed model will constitute the basic step of an supporting design system to improve physician-patient communication.

摘要

简介

健康素养对于有效利用医疗系统变得越来越重要。本研究的主要目的是使用机器学习方法确定变量的重要性水平,以确定影响健康素养的主要因素,并找到对健康素养最重要的变量。

材料与方法

本研究纳入了 1001 名年龄在 18.05 ± 0.81 岁的参与者。使用欧洲健康素养量表来确定参与者的健康素养水平。该量表的截断值为 25,其中 516 名(51.5%)参与者的健康素养较低,485 名(48.5%)参与者的健康素养较高。在研究中,使用了机器学习方法中的 XGBoost、随机森林和逻辑回归模型,并计算了相关指标。

结果

当评估 XGBoost、随机森林和逻辑回归模型的结果时,发现性能最佳的模型是 XGBoost。XGBoost 模型的敏感性、特异性、F1 评分、AUROC 和 Brier 评分值分别为 0.979、0.965、0.973、0.983 和 0.054。

结论

研究发现,健康素养水平在性别、年龄、班级、受教育程度、居住地、经济状况和医疗费用支付等变量方面存在显著差异(p<0.05)。根据 XGBoost 模型,发现最重要的变量是阅读报纸,而最重要的变量是母亲的受教育程度。通过建立的模型,确定了将影响 HL 水平的基本变量。设计的模型将构成改善医患沟通的支持设计系统的基本步骤。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验