Song Wenzhu, Gong Hao, Wang Qili, Zhang Lijuan, Qiu Lixia, Hu Xueli, Han Huimin, Li Yaheng, Li Rongshan, Li Yafeng
School of Public Health, Shanxi Medical University, Taiyuan, China.
Department of Biochemistry and Molecular Biology, Basic Medical College, Shanxi Medical University, Taiyuan, China.
Front Cardiovasc Med. 2022 Aug 30;9:984883. doi: 10.3389/fcvm.2022.984883. eCollection 2022.
Multimorbidity (MMD) is a medical condition that is linked with high prevalence and closely related to many adverse health outcomes and expensive medical costs. The present study aimed to construct Bayesian networks (BNs) with Max-Min Hill-Climbing algorithm (MMHC) algorithm to explore the network relationship between MMD and its related factors. We also aimed to compare the performance of BNs with traditional multivariate logistic regression model.
The data was downloaded from the Online Open Database of CHARLS 2018, a population-based longitudinal survey. In this study, we included 10 variables from data on demographic background, health status and functioning, and lifestyle. Missing value imputation was first performed using Random Forest. Afterward, the variables were included into logistic regression model construction and BNs model construction. The structural learning of BNs was achieved using MMHC algorithm and the parameter learning was conducted using maximum likelihood estimation.
Among 19,752 individuals (9,313 men and 10,439 women) aged 64.73 ± 10.32 years, there are 9,129 ones without MMD (46.2%) and 10,623 ones with MMD (53.8%). Logistic regression model suggests that physical activity, sex, age, sleep duration, nap, smoking, and alcohol consumption are associated with MMD ( < 0.05). BNs, by establishing a complicated network relationship, reveals that age, sleep duration, and physical activity have a direct connection with MMD. It also shows that education levels are indirectly connected to MMD through sleep duration and residence is indirectly linked to MMD through sleep duration.
BNs could graphically reveal the complex network relationship between MMD and its related factors, outperforming traditional logistic regression model. Besides, BNs allows for risk reasoning for MMD through Bayesian reasoning, which is more consistent with clinical practice and thus holds some application prospects.
多病共存(MMD)是一种患病率高的医学状况,与许多不良健康结局及高昂的医疗费用密切相关。本研究旨在使用最大最小爬山算法(MMHC)构建贝叶斯网络(BNs),以探索MMD与其相关因素之间的网络关系。我们还旨在比较BNs与传统多变量逻辑回归模型的性能。
数据从基于人群的纵向调查CHARLS 2018在线开放数据库下载。在本研究中,我们从人口统计学背景、健康状况与功能以及生活方式数据中纳入了10个变量。首先使用随机森林进行缺失值插补。之后,将变量纳入逻辑回归模型构建和BNs模型构建。使用MMHC算法实现BNs的结构学习,并使用最大似然估计进行参数学习。
在19752名年龄为64.73±10.32岁的个体(9313名男性和10439名女性)中,有9129人无MMD(46.2%),10623人有MMD(53.8%)。逻辑回归模型表明,身体活动、性别、年龄、睡眠时间、午睡、吸烟和饮酒与MMD相关(<0.05)。BNs通过建立复杂的网络关系,揭示年龄、睡眠时间和身体活动与MMD有直接联系。它还表明教育水平通过睡眠时间与MMD间接相关,居住情况通过睡眠时间与MMD间接相关。
BNs能够以图形方式揭示MMD与其相关因素之间的复杂网络关系,优于传统逻辑回归模型。此外,BNs允许通过贝叶斯推理对MMD进行风险推理,这与临床实践更一致,因此具有一定的应用前景。