Suppr超能文献

在数据丰富的环境中,使用H2O自动机器学习算法识别患者中基于网络的医疗记录未使用情况的预测因素:混合方法研究。

Using the H2O Automatic Machine Learning Algorithms to Identify Predictors of Web-Based Medical Record Nonuse Among Patients in a Data-Rich Environment: Mixed Methods Study.

作者信息

Chen Yang, Liu Xuejiao, Gao Lei, Zhu Miao, Shia Ben-Chang, Chen Mingchih, Ye Linglong, Qin Lei

机构信息

School of Statistics, University of International Business and Economics, Beijing, China.

School of Law, University of International Business and Economics, Beijing, China.

出版信息

JMIR Med Inform. 2023 Jun 19;11:e41576. doi: 10.2196/41576.

Abstract

BACKGROUND

With the advent of electronic storage of medical records and the internet, patients can access web-based medical records. This has facilitated doctor-patient communication and built trust between them. However, many patients avoid using web-based medical records despite their greater availability and readability.

OBJECTIVE

On the basis of demographic and individual behavioral characteristics, this study explores the predictors of web-based medical record nonuse among patients.

METHODS

Data were collected from the National Cancer Institute 2019 to 2020 Health Information National Trends Survey. First, based on the data-rich environment, the chi-square test (categorical variables) and 2-tailed t tests (continuous variables) were performed on the response variables and the variables in the questionnaire. According to the test results, the variables were initially screened, and those that passed the test were selected for subsequent analysis. Second, participants were excluded from the study if any of the initially screened variables were missing. Third, the data obtained were modeled using 5 machine learning algorithms, namely, logistic regression, automatic generalized linear model, automatic random forest, automatic deep neural network, and automatic gradient boosting machine, to identify and investigate factors affecting web-based medical record nonuse. The aforementioned automatic machine learning algorithms were based on the R interface (R Foundation for Statistical Computing) of the H2O (H2O.ai) scalable machine learning platform. Finally, 5-fold cross-validation was adopted for 80% of the data set, which was used as the training data to determine hyperparameters of 5 algorithms, and 20% of the data set was used as the test data for model comparison.

RESULTS

Among the 9072 respondents, 5409 (59.62%) had no experience using web-based medical records. Using the 5 algorithms, 29 variables were identified as crucial predictors of nonuse of web-based medical records. These 29 variables comprised 6 (21%) sociodemographic variables (age, BMI, race, marital status, education, and income) and 23 (79%) variables related to individual lifestyles and behavioral habits (such as electronic and internet use, individuals' health status and their level of health concern, etc). H2O's automatic machine learning methods have a high model accuracy. On the basis of the performance of the validation data set, the optimal model was the automatic random forest with the highest area under the curve in the validation set (88.52%) and the test set (82.87%).

CONCLUSIONS

When monitoring web-based medical record use trends, research should focus on social factors such as age, education, BMI, and marital status, as well as personal lifestyle and behavioral habits, including smoking, use of electronic devices and the internet, patients' personal health status, and their level of health concern. The use of electronic medical records can be targeted to specific patient groups, allowing more people to benefit from their usefulness.

摘要

背景

随着电子病历存储和互联网的出现,患者可以访问基于网络的病历。这促进了医患沟通并在他们之间建立了信任。然而,尽管基于网络的病历更易获取且可读性更强,但许多患者仍避免使用。

目的

基于人口统计学和个体行为特征,本研究探讨患者中不使用基于网络病历的预测因素。

方法

数据收集自美国国家癌症研究所2019年至2020年健康信息国家趋势调查。首先,基于数据丰富的环境,对问卷中的响应变量和变量进行卡方检验(分类变量)和双尾t检验(连续变量)。根据检验结果,对变量进行初步筛选,通过检验的变量被选用于后续分析。其次,如果任何一个初步筛选的变量缺失,则将参与者排除在研究之外。第三,使用5种机器学习算法对获得的数据进行建模,即逻辑回归、自动广义线性模型、自动随机森林、自动深度神经网络和自动梯度提升机,以识别和研究影响不使用基于网络病历的因素。上述自动机器学习算法基于H2O(H2O.ai)可扩展机器学习平台的R接口(R统计计算基金会)。最后,对80%的数据集采用5折交叉验证,将其用作训练数据以确定5种算法的超参数,20%的数据集用作测试数据进行模型比较。

结果

在9072名受访者中,5409名(59.62%)没有使用过基于网络病历的经验。使用这5种算法,29个变量被确定为不使用基于网络病历的关键预测因素。这29个变量包括6个(21%)社会人口统计学变量(年龄、体重指数、种族、婚姻状况、教育程度和收入)和23个(79%)与个人生活方式和行为习惯相关的变量(如电子设备和互联网使用、个人健康状况及其健康关注度等)。H2O的自动机器学习方法具有较高的模型准确性。基于验证数据集的性能,最优模型是自动随机森林,其在验证集(88.52%)和测试集(82.87%)中的曲线下面积最高。

结论

在监测基于网络病历的使用趋势时,研究应关注年龄、教育程度、体重指数和婚姻状况等社会因素,以及个人生活方式和行为习惯,包括吸烟、电子设备和互联网使用、患者个人健康状况及其健康关注度。电子病历的使用可以针对特定患者群体,使更多人受益于其效用。

相似文献

3
Identifying the predictors of severe psychological distress by auto-machine learning methods.
Inform Med Unlocked. 2023;39:101258. doi: 10.1016/j.imu.2023.101258. Epub 2023 Apr 28.
4
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
9
Patterns of high-risk drinking among medical students: A web-based survey with machine learning.
Comput Biol Med. 2021 Sep;136:104747. doi: 10.1016/j.compbiomed.2021.104747. Epub 2021 Aug 16.

本文引用的文献

1
Wide range of applications for machine-learning prediction models in orthopedic surgical outcome: a systematic review.
Acta Orthop. 2021 Oct;92(5):526-531. doi: 10.1080/17453674.2021.1932928. Epub 2021 Jun 10.
3
Barriers to accessing online medical records in the United States.
Am J Manag Care. 2021 Jan;27(1):33-40. doi: 10.37765/ajmc.2021.88575.
5
7
Who Isn't Using Patient Portals And Why? Evidence And Implications From A National Sample Of US Adults.
Health Aff (Millwood). 2018 Dec;37(12):1948-1954. doi: 10.1377/hlthaff.2018.05117.
8
Cancer patients' attitudes and experiences of online access to their electronic medical records: A qualitative study.
Health Informatics J. 2018 Jun;24(2):115-124. doi: 10.1177/1460458216658778. Epub 2016 Jul 19.
9
Communication Barriers Perceived by Nurses and Patients.
Glob J Health Sci. 2015 Sep 28;8(6):65-74. doi: 10.5539/gjhs.v8n6p65.
10
Predictors and intensity of online access to electronic medical records among patients with cancer.
J Oncol Pract. 2014 Sep;10(5):e307-12. doi: 10.1200/JOP.2013.001347. Epub 2014 Jul 8.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验