Department of Public Health, Faculty of Medicine and Biomedical Sciences, University of Yaoundé I, Yaoundé, Cameroon.
Epidemiol Health. 2011;33:e2011006. doi: 10.4178/epih/e2011006. Epub 2011 Jun 17.
To propose an alternative procedure, based on a Bayesian network (BN), for estimation and prediction, and to discuss its usefulness for taking into account the hierarchical relationships among covariates.
The procedure is illustrated by modeling the risk of diarrhea infection for 2,740 children aged 0 to 59 months in Cameroon. We compare the procedure with a standard logistic regression and with a model based on multi-level logistic regression.
The standard logistic regression approach is inadequate, or at least incomplete, in that it does not attempt to account for potentially causal relationships between risk factors. The multi-level logistic regression does model the hierarchical structure, but does so in a piecewise manner; the resulting estimates and interpretations differ from those of the BN approach proposed here. An advantage of the BN approach is that it enables one to determine the probability that a risk factor (and/or the outcome) is in any specific state, given the states of the others. The currently available approaches can only predict the outcome (disease), given the states of the covariates.
A major advantage of BNs is that they can deal with more complex interrelationships between variables whereas competing approaches deal at best only with hierarchical ones. We propose that BN be considered as well as a worthwhile method for summarizing the data in epidemiological studies whose aim is understanding the determinants of diseases and quantifying their effects.
提出一种基于贝叶斯网络(BN)的替代方法,用于估计和预测,并讨论其在考虑协变量的层次关系方面的有用性。
通过对喀麦隆 2740 名 0 至 59 个月大的儿童腹泻感染风险进行建模,说明了该程序。我们将该程序与标准逻辑回归和基于多水平逻辑回归的模型进行了比较。
标准逻辑回归方法是不充分的,或者至少是不完整的,因为它没有尝试考虑危险因素之间可能存在的因果关系。多水平逻辑回归确实对层次结构进行了建模,但它是分段进行的;由此产生的估计值和解释与这里提出的 BN 方法不同。BN 方法的一个优点是,它可以确定在给定其他因素状态的情况下,一个风险因素(和/或结果)处于任何特定状态的概率。目前可用的方法只能在给定协变量状态的情况下预测结果(疾病)。
BN 的一个主要优点是,它们可以处理变量之间更复杂的相互关系,而竞争方法最多只能处理层次关系。我们建议将 BN 作为一种有价值的方法来总结流行病学研究的数据,这些研究的目的是了解疾病的决定因素并量化其影响。