School of Public Health, Guangdong Medical University, Dongguan, 523808, China.
BMC Med Res Methodol. 2023 Oct 25;23(1):249. doi: 10.1186/s12874-023-02070-9.
To predict the influencing factors of neonatal pneumonia in pregnant women with diabetes mellitus using a Bayesian network model. By examining the intricate network connections between the numerous variables given by Bayesian networks (BN), this study aims to compare the prediction effect of the Bayesian network model and to analyze the influencing factors directly associated to neonatal pneumonia.
Through the structure learning algorithms of BN, Naive Bayesian (NB), Tree Augmented Naive Bayes (TAN), and k-Dependence Bayesian Classifier (KDB), complex networks connecting variables were presented and their predictive abilities were tested. The BN model and three machine learning models computed using the R bnlean package were also compared in the data set.
In constraint-based algorithms, three algorithms had different presentation DAGs. KDB had a better prediction effect than NB and TAN, and it achieved higher AUC compared with TAN. Among three machine learning modes, Support Vector Machine showed a accuracy rate of 91.04% and 67.88% of precision, which was lower than TAN (92.70%; 72.10%).
KDB was applicable, and it can detect the dependencies between variables, identify more potential associations and track changes between variables and outcome.
利用贝叶斯网络模型预测糖尿病孕妇新生儿肺炎的影响因素。通过检查贝叶斯网络(BN)给出的众多变量之间复杂的网络连接,本研究旨在比较贝叶斯网络模型的预测效果,并分析与新生儿肺炎直接相关的影响因素。
通过 BN 的结构学习算法,Naive Bayesian(NB)、Tree Augmented Naive Bayes(TAN)和 k-Dependence Bayesian Classifier(KDB),展示了变量之间的复杂网络连接,并测试了它们的预测能力。还在数据集上比较了 BN 模型和使用 R bnlean 包计算的三种机器学习模型。
在约束基算法中,三种算法呈现出不同的 DAG。KDB 的预测效果优于 NB 和 TAN,与 TAN 相比,它的 AUC 更高。在三种机器学习模式中,支持向量机的准确率为 91.04%,精度为 67.88%,低于 TAN(92.70%;72.10%)。
KDB 是适用的,它可以检测变量之间的依赖关系,识别更多潜在的关联,并跟踪变量和结果之间的变化。