Hajihosseini Morteza, Amini Payam, Saidi-Mehrabad Alireza, Dinu Irina
Stanford Department of Urology, Center for Academic Medicine, Palo Alto, CA 94304.
Department of Biostatistics, School of public Health, IRAN University of Medical Sciences, Tehran, Iran.
Comput Struct Biotechnol J. 2023 Feb 15;21:1621-1629. doi: 10.1016/j.csbj.2023.02.027. eCollection 2023.
The infants' gut microbiome is dynamic in nature. Literature has shown high inter-individual variability of gut microbial composition in the early years of infancy compared to adulthood. Although next-generation sequencing technologies are rapidly evolving, several statistical analysis aspects need to be addressed to capture the variability and dynamic nature of the infants' gut microbiome. In this study, we proposed a Bayesian Marginal Zero-inflated Negative Binomial (BAMZINB) model, addressing complexities associated with zero-inflation and multivariate structure of the infants' gut microbiome data. Here, we simulated 32 scenarios to compare the performance of BAMZINB with glmFit and BhGLM as the two other widely similar methods in the literature in handling zero-inflation, over-dispersion, and multivariate structure of the infants' gut microbiome. Then, we showed the performance of the BAMZINB approach on a real dataset using SKOT cohort (I and II) studies. Our simulation results showed that the BAMZINB model performed as well as those two methods in estimating the average abundance difference and had a better fit for almost all scenarios when the signal and sample size were large. Applying BAMZINB on SKOT cohorts showed remarkable changes in the average absolute abundance of specific bacteria from 9 to 18 months for infants of healthy and obese mothers. In conclusion, we recommend using the BAMZINB approach for infants' gut microbiome data taking zero-inflation and over-dispersion properties into account in multivariate analysis when comparing the average abundance difference.
婴儿的肠道微生物群本质上是动态的。文献表明,与成年人相比,婴儿早期肠道微生物组成存在高度的个体间差异。尽管下一代测序技术正在迅速发展,但在捕捉婴儿肠道微生物群的变异性和动态性质方面,仍有几个统计分析方面需要解决。在本研究中,我们提出了一种贝叶斯边际零膨胀负二项式(BAMZINB)模型,以解决与婴儿肠道微生物群数据的零膨胀和多变量结构相关的复杂性。在此,我们模拟了32种情况,以比较BAMZINB与文献中另外两种广泛相似的方法glmFit和BhGLM在处理婴儿肠道微生物群的零膨胀、过度离散和多变量结构方面的性能。然后,我们在使用SKOT队列(I和II)研究的真实数据集上展示了BAMZINB方法的性能。我们的模拟结果表明,BAMZINB模型在估计平均丰度差异方面与这两种方法表现相当,并且在信号和样本量较大时,几乎对所有情况都有更好的拟合。将BAMZINB应用于SKOT队列显示,健康和肥胖母亲的婴儿在9至18个月时,特定细菌的平均绝对丰度有显著变化。总之,我们建议在比较平均丰度差异时,在多变量分析中考虑零膨胀和过度离散特性,使用BAMZINB方法处理婴儿肠道微生物群数据。