在数据丰富的环境中，使用H2O自动机器学习算法识别患者中基于网络的医疗记录未使用情况的预测因素：混合方法研究。

Using the H2O Automatic Machine Learning Algorithms to Identify Predictors of Web-Based Medical Record Nonuse Among Patients in a Data-Rich Environment: Mixed Methods Study.

作者信息

Chen Yang, Liu Xuejiao, Gao Lei, Zhu Miao, Shia Ben-Chang, Chen Mingchih, Ye Linglong, Qin Lei

机构信息

School of Statistics, University of International Business and Economics, Beijing, China.

School of Law, University of International Business and Economics, Beijing, China.

出版信息

JMIR Med Inform. 2023 Jun 19;11:e41576. doi: 10.2196/41576.

DOI:10.2196/41576

PMID:37335616

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10337515/

Abstract

BACKGROUND

With the advent of electronic storage of medical records and the internet, patients can access web-based medical records. This has facilitated doctor-patient communication and built trust between them. However, many patients avoid using web-based medical records despite their greater availability and readability.

OBJECTIVE

On the basis of demographic and individual behavioral characteristics, this study explores the predictors of web-based medical record nonuse among patients.

METHODS

Data were collected from the National Cancer Institute 2019 to 2020 Health Information National Trends Survey. First, based on the data-rich environment, the chi-square test (categorical variables) and 2-tailed t tests (continuous variables) were performed on the response variables and the variables in the questionnaire. According to the test results, the variables were initially screened, and those that passed the test were selected for subsequent analysis. Second, participants were excluded from the study if any of the initially screened variables were missing. Third, the data obtained were modeled using 5 machine learning algorithms, namely, logistic regression, automatic generalized linear model, automatic random forest, automatic deep neural network, and automatic gradient boosting machine, to identify and investigate factors affecting web-based medical record nonuse. The aforementioned automatic machine learning algorithms were based on the R interface (R Foundation for Statistical Computing) of the H2O (H2O.ai) scalable machine learning platform. Finally, 5-fold cross-validation was adopted for 80% of the data set, which was used as the training data to determine hyperparameters of 5 algorithms, and 20% of the data set was used as the test data for model comparison.

RESULTS

Among the 9072 respondents, 5409 (59.62%) had no experience using web-based medical records. Using the 5 algorithms, 29 variables were identified as crucial predictors of nonuse of web-based medical records. These 29 variables comprised 6 (21%) sociodemographic variables (age, BMI, race, marital status, education, and income) and 23 (79%) variables related to individual lifestyles and behavioral habits (such as electronic and internet use, individuals' health status and their level of health concern, etc). H2O's automatic machine learning methods have a high model accuracy. On the basis of the performance of the validation data set, the optimal model was the automatic random forest with the highest area under the curve in the validation set (88.52%) and the test set (82.87%).

CONCLUSIONS

When monitoring web-based medical record use trends, research should focus on social factors such as age, education, BMI, and marital status, as well as personal lifestyle and behavioral habits, including smoking, use of electronic devices and the internet, patients' personal health status, and their level of health concern. The use of electronic medical records can be targeted to specific patient groups, allowing more people to benefit from their usefulness.

摘要

背景

随着电子病历存储和互联网的出现，患者可以访问基于网络的病历。这促进了医患沟通并在他们之间建立了信任。然而，尽管基于网络的病历更易获取且可读性更强，但许多患者仍避免使用。

目的

基于人口统计学和个体行为特征，本研究探讨患者中不使用基于网络病历的预测因素。

方法

数据收集自美国国家癌症研究所2019年至2020年健康信息国家趋势调查。首先，基于数据丰富的环境，对问卷中的响应变量和变量进行卡方检验（分类变量）和双尾t检验（连续变量）。根据检验结果，对变量进行初步筛选，通过检验的变量被选用于后续分析。其次，如果任何一个初步筛选的变量缺失，则将参与者排除在研究之外。第三，使用5种机器学习算法对获得的数据进行建模，即逻辑回归、自动广义线性模型、自动随机森林、自动深度神经网络和自动梯度提升机，以识别和研究影响不使用基于网络病历的因素。上述自动机器学习算法基于H2O（H2O.ai）可扩展机器学习平台的R接口（R统计计算基金会）。最后，对80%的数据集采用5折交叉验证，将其用作训练数据以确定5种算法的超参数，20%的数据集用作测试数据进行模型比较。

结果

在9072名受访者中，5409名（59.62%）没有使用过基于网络病历的经验。使用这5种算法，29个变量被确定为不使用基于网络病历的关键预测因素。这29个变量包括6个（21%）社会人口统计学变量（年龄、体重指数、种族、婚姻状况、教育程度和收入）和23个（79%）与个人生活方式和行为习惯相关的变量（如电子设备和互联网使用、个人健康状况及其健康关注度等）。H2O的自动机器学习方法具有较高的模型准确性。基于验证数据集的性能，最优模型是自动随机森林，其在验证集（88.52%）和测试集（82.87%）中的曲线下面积最高。

结论

在监测基于网络病历的使用趋势时，研究应关注年龄、教育程度、体重指数和婚姻状况等社会因素，以及个人生活方式和行为习惯，包括吸烟、电子设备和互联网使用、患者个人健康状况及其健康关注度。电子病历的使用可以针对特定患者群体，使更多人受益于其效用。

相似文献

Using the H2O Automatic Machine Learning Algorithms to Identify Predictors of Web-Based Medical Record Nonuse Among Patients in a Data-Rich Environment: Mixed Methods Study.在数据丰富的环境中，使用H2O自动机器学习算法识别患者中基于网络的医疗记录未使用情况的预测因素：混合方法研究。

JMIR Med Inform. 2023 Jun 19;11:e41576. doi: 10.2196/41576.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Identifying the predictors of severe psychological distress by auto-machine learning methods.通过自动机器学习方法识别严重心理困扰的预测因素。

Inform Med Unlocked. 2023;39:101258. doi: 10.1016/j.imu.2023.101258. Epub 2023 Apr 28.

Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者？

Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.

Machine Learning Electronic Health Record Identification of Patients with Rheumatoid Arthritis: Algorithm Pipeline Development and Validation Study.机器学习在类风湿性关节炎患者电子健康记录识别中的应用：算法流程开发与验证研究。

JMIR Med Inform. 2020 Nov 30;8(11):e23930. doi: 10.2196/23930.

Performance of a Machine Learning Algorithm Using Electronic Health Record Data to Predict Postoperative Complications and Report on a Mobile Platform.基于电子健康记录数据的机器学习算法预测术后并发症的性能及移动平台报告。

JAMA Netw Open. 2022 May 2;5(5):e2211973. doi: 10.1001/jamanetworkopen.2022.11973.

Machine learning approaches for prediction of early death among lung cancer patients with bone metastases using routine clinical characteristics: An analysis of 19,887 patients.利用常规临床特征预测肺癌伴骨转移患者早期死亡的机器学习方法：对 19887 例患者的分析。

Front Public Health. 2022 Oct 6;10:1019168. doi: 10.3389/fpubh.2022.1019168. eCollection 2022.

Biological signatures and prediction of an immunosuppressive status-persistent critical illness-among orthopedic trauma patients using machine learning techniques.利用机器学习技术对骨科创伤患者免疫抑制状态持续的危重症进行生物学特征分析和预测。

Front Immunol. 2022 Oct 17;13:979877. doi: 10.3389/fimmu.2022.979877. eCollection 2022.

Patterns of high-risk drinking among medical students: A web-based survey with machine learning.医学生高危饮酒模式：基于网络的机器学习调查。

Comput Biol Med. 2021 Sep;136:104747. doi: 10.1016/j.compbiomed.2021.104747. Epub 2021 Aug 16.

Prediction and Evaluation of Machine Learning Algorithm for Prediction of Blood Transfusion during Cesarean Section and Analysis of Risk Factors of Hypothermia during Anesthesia Recovery.机器学习算法预测剖宫产术中输血的预测及麻醉恢复期低体温风险因素分析。

Comput Math Methods Med. 2022 Apr 13;2022:8661324. doi: 10.1155/2022/8661324. eCollection 2022.

引用本文的文献

Diagnostic performance of machine learning in systemic infection following percutaneous nephrolithotomy and identification of associated risk factors.机器学习在经皮肾镜取石术后全身感染中的诊断性能及相关危险因素的识别

Heliyon. 2024 May 9;10(10):e30956. doi: 10.1016/j.heliyon.2024.e30956. eCollection 2024 May 30.

Automated machine learning for predicting liver metastasis in patients with gastrointestinal stromal tumor: a SEER-based analysis.基于 SEER 数据库的自动化机器学习预测胃肠道间质瘤患者肝转移的研究

Sci Rep. 2024 May 30;14(1):12415. doi: 10.1038/s41598-024-62311-9.

本文引用的文献

Wide range of applications for machine-learning prediction models in orthopedic surgical outcome: a systematic review.机器学习预测模型在骨科手术结果中的广泛应用：系统评价。

Acta Orthop. 2021 Oct;92(5):526-531. doi: 10.1080/17453674.2021.1932928. Epub 2021 Jun 10.

Online Medical Record Nonuse Among Patients: Data Analysis Study of the 2019 Health Information National Trends Survey.在线医疗记录不使用：2019 年健康信息国家趋势调查数据分析研究。

J Med Internet Res. 2021 Feb 22;23(2):e24767. doi: 10.2196/24767.

Barriers to accessing online medical records in the United States.美国获取在线医疗记录的障碍。

Am J Manag Care. 2021 Jan;27(1):33-40. doi: 10.37765/ajmc.2021.88575.

The Promise of Patient Portals for Individuals Living With Chronic Illness: Qualitative Study Identifying Pathways of Patient Engagement.患者门户为慢性病患者带来的前景：定性研究确定患者参与途径

J Med Internet Res. 2020 Jul 17;22(7):e17744. doi: 10.2196/17744.

Use of Patient Portals of Electronic Health Records Remains Low From 2014 to 2018: Results From a National Survey and Policy Implications.2014 年至 2018 年，电子健康记录患者门户的使用率仍然较低：来自全国性调查的结果及政策启示。

Am J Health Promot. 2020 Jul;34(6):677-680. doi: 10.1177/0890117119900591. Epub 2020 Feb 7.

Patients' perception of communication at the interface between primary and secondary care: a cross-sectional survey in 34 countries.患者对初级保健和二级保健之间沟通的感知：34 个国家的横断面调查。

BMC Health Serv Res. 2019 Dec 30;19(1):1018. doi: 10.1186/s12913-019-4848-9.

Who Isn't Using Patient Portals And Why? Evidence And Implications From A National Sample Of US Adults.谁没有使用患者门户，以及为什么？来自美国成年人全国样本的证据和影响。

Health Aff (Millwood). 2018 Dec;37(12):1948-1954. doi: 10.1377/hlthaff.2018.05117.

Cancer patients' attitudes and experiences of online access to their electronic medical records: A qualitative study.癌症患者对在线获取电子病历的态度和体验：一项定性研究。

Health Informatics J. 2018 Jun;24(2):115-124. doi: 10.1177/1460458216658778. Epub 2016 Jul 19.

Communication Barriers Perceived by Nurses and Patients.护士和患者感知到的沟通障碍。

Glob J Health Sci. 2015 Sep 28;8(6):65-74. doi: 10.5539/gjhs.v8n6p65.

Predictors and intensity of online access to electronic medical records among patients with cancer.癌症患者在线访问电子病历的预测因素及强度

J Oncol Pract. 2014 Sep;10(5):e307-12. doi: 10.1200/JOP.2013.001347. Epub 2014 Jul 8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验