MRC Tropical Epidemiology Group, London School of Hygiene & Tropical Medicine, London, UK.
Malar J. 2013 Oct 5;12:355. doi: 10.1186/1475-2875-12-355.
Malaria transmission is highly heterogeneous and analysis of incidence data must account for this for correct statistical inference. Less widely appreciated is the occurrence of a large number of zero counts (children without a malaria episode) in malaria cohort studies. Zero-inflated regression methods provide one means of addressing this issue, and also allow risk factors providing complete and partial protection to be disentangled.
Poisson, negative binomial (NB), zero-inflated Poisson (ZIP) and zero-inflated negative binomial (ZINB) regression models were fitted to data from two cohort studies of malaria in children in Ghana. Multivariate models were used to understand risk factors for elevated incidence of malaria and for remaining malaria-free, and to estimate the fraction of the population not at risk of malaria.
ZINB models, which account for both heterogeneity in individual risk and an unexposed sub-group within the population, provided the best fit to data in both cohorts. These approaches gave additional insight into the mechanism of factors influencing the incidence of malaria compared to simpler approaches, such as NB regression. For example, compared to urban areas, rural residence was found to both increase the incidence rate of malaria among exposed children, and increase the probability of being exposed. In Navrongo, 34% of urban residents were estimated to be at no risk, compared to 3% of rural residents. In Kintampo, 47% of urban residents and 13% of rural residents were estimated to be at no risk.
These results illustrate the utility of zero-inflated regression methods for analysis of malaria cohort data that include a large number of zero counts. Specifically, these results suggest that interventions that reach mainly urban residents will have limited overall impact, since some urban residents are essentially at no risk, even in areas of high endemicity, such as in Ghana.
疟疾传播具有高度异质性,分析发病率数据必须考虑到这一点,以进行正确的统计推断。但人们较少认识到的是,在疟疾队列研究中会出现大量零计数(没有疟疾发作的儿童)。零膨胀回归方法提供了一种解决此问题的方法,并且还可以区分提供完全和部分保护的风险因素。
泊松、负二项式(NB)、零膨胀泊松(ZIP)和零膨胀负二项式(ZINB)回归模型被应用于加纳两项儿童疟疾队列研究的数据中。使用多变量模型来了解疟疾发病率升高和保持无疟疾的风险因素,并估计没有疟疾风险的人群比例。
ZINB 模型同时考虑了个体风险的异质性和人群中未暴露的亚组,对两个队列的数据拟合得最好。与简单的方法(如 NB 回归)相比,这些方法为影响疟疾发病率的因素的机制提供了更多的见解。例如,与城市地区相比,农村居住不仅增加了暴露儿童中疟疾的发病率,而且增加了暴露的可能性。在纳夫龙戈,估计 34%的城市居民没有风险,而 3%的农村居民没有风险。在金塔波,估计 47%的城市居民和 13%的农村居民没有风险。
这些结果说明了零膨胀回归方法在分析包括大量零计数的疟疾队列数据中的应用。具体来说,这些结果表明,在高流行地区(如加纳),主要针对城市居民的干预措施的总体影响将有限,因为即使在这些地区,一些城市居民实际上没有风险。