Department of Biostatistics and Bioinformatics, Emory University, Atlanta, GA 30322, USA.
Department of Gynecology and Obstetrics, Emory University School of Medicine, Atlanta, GA 30322, USA.
Genes (Basel). 2023 Sep 8;14(9):1777. doi: 10.3390/genes14091777.
Microbiome data are subject to experimental bias that is caused by DNA extraction and PCR amplification, among other sources, but this important feature is often ignored when developing statistical methods for analyzing microbiome data. McLaren, Willis, and Callahan (2019) proposed a model for how such biases affect the observed taxonomic profiles; this model assumes the main effects of bias without taxon-taxon interactions. Our newly developed method for testing the differential abundance of taxa, LOCOM, is the first method to account for experimental bias and is robust to the main effect biases. However, there is also evidence for taxon-taxon interactions. In this report, we formulated a model for interaction biases and used simulations based on this model to evaluate the impact of interaction biases on the performance of LOCOM as well as other available compositional analysis methods. Our simulation results indicate that LOCOM remained robust to a reasonable range of interaction biases. The other methods tend to have an inflated FDR even when there were only main effect biases. LOCOM maintained the highest sensitivity even when the other methods could not control the FDR. We thus conclude that LOCOM outperforms the other methods for compositional analysis of microbiome data considered here.
微生物组数据受到实验偏差的影响,这些偏差源自 DNA 提取和 PCR 扩增等来源,但在开发用于分析微生物组数据的统计方法时,这一重要特征常常被忽视。McLaren、Willis 和 Callahan(2019)提出了一个模型,说明这种偏差如何影响观察到的分类群分布;该模型假设了偏差的主要效应,而没有考虑分类群之间的相互作用。我们新开发的用于测试分类群丰度差异的方法 LOCOM 是第一个考虑实验偏差的方法,并且对主要效应偏差具有稳健性。然而,也有证据表明存在分类群之间的相互作用。在本报告中,我们提出了一个用于交互偏差的模型,并基于该模型进行模拟,以评估交互偏差对 LOCOM 以及其他可用的组成分析方法性能的影响。我们的模拟结果表明,LOCOM 在相当大的交互偏差范围内仍然保持稳健。其他方法即使只有主要效应偏差,也往往会出现 FDR 膨胀。即使其他方法无法控制 FDR,LOCOM 仍保持着最高的灵敏度。因此,我们得出结论,LOCOM 在分析这里考虑的微生物组数据的组成方面优于其他方法。