Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA.
Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.
BMC Bioinformatics. 2023 Jan 19;24(1):22. doi: 10.1186/s12859-023-05147-w.
Microbial communities are known to be closely related to many diseases, such as obesity and HIV, and it is of interest to identify differentially abundant microbial species between two or more environments. Since the abundances or counts of microbial species usually have different scales and suffer from zero-inflation or over-dispersion, normalization is a critical step before conducting differential abundance analysis. Several normalization approaches have been proposed, but it is difficult to optimize the characterization of the true relationship between taxa and interesting outcomes. RESULTS: To avoid the challenge of picking an optimal normalization and accommodate the advantages of several normalization strategies, we propose an omnibus approach. Our approach is based on a Cauchy combination test, which is flexible and powerful by aggregating individual p values. We also consider a truncated test statistic to prevent substantial power loss. We experiment with a basic linear regression model as well as recently proposed powerful association tests for microbiome data and compare the performance of the omnibus approach with individual normalization approaches. Experimental results show that, regardless of simulation settings, the new approach exhibits power that is close to the best normalization strategy, while controling the type I error well. CONCLUSIONS: The proposed omnibus test releases researchers from choosing among various normalization methods and it is an aggregated method that provides the powerful result to the underlying optimal normalization, which requires tedious trial and error. While the power may not exceed the best normalization, it is always much better than using a poor choice of normalization.
微生物群落与许多疾病密切相关,例如肥胖症和艾滋病,确定两种或更多环境之间差异丰富的微生物物种是很有意义的。由于微生物物种的丰度或计数通常具有不同的规模,并受到零膨胀或过分散的影响,因此在进行差异丰度分析之前,归一化是一个关键步骤。已经提出了几种归一化方法,但很难优化真实关系的特征分类与有趣的结果。结果:为了避免选择最佳归一化的挑战,并利用几种归一化策略的优势,我们提出了一种综合方法。我们的方法基于柯西组合检验,通过聚合个体 p 值,该方法具有灵活性和强大性。我们还考虑了截断的检验统计量,以防止实质性的功效损失。我们用基本线性回归模型以及最近提出的用于微生物组数据的强大关联检验进行实验,并将综合方法与单个归一化方法的性能进行比较。实验结果表明,无论模拟设置如何,新方法的功效都接近最佳归一化策略,同时很好地控制了第一类错误。结论:所提出的综合检验方法使研究人员无需在各种归一化方法之间进行选择,它是一种聚合方法,可以为基础的最优归一化提供强大的结果,这需要繁琐的反复试验。虽然功效可能不会超过最佳归一化,但它总是比使用较差的归一化要好得多。