Zheng Huiru, Wang Haiying, Dewhurst Richard J, Roehe Rainer
IEEE/ACM Trans Comput Biol Bioinform. 2020 May-Jun;17(3):858-867. doi: 10.1109/TCBB.2018.2879342. Epub 2018 Nov 2.
The importance of the composition and signature of rumen microbial communities has gained increasing attention. One of the key techniques was to infer co-abundance networks through correlation analysis based on relative abundances. While substantial insights and progress have been made, it has been found that due to the compositional nature of data, correlation analysis derived from relative abundance could produce misleading results and spurious associations. In this study, we proposed the use of a framework including a compendium of two correlation measures and three dissimilarity metrics in an attempt to mitigate the compositional effect in the inference of significant associations in the bovine rumen microbiome. We tested the framework on rumen microbiome data including both 16S rRNA and KEGG genes associated with methane production in cattle. Based on the identification of significant positive and negative associations supported by multiple metrics, two co-occurrence networks, e.g., co-presence and mutual-exclusion networks, were constructed. Significant modules associated with methane emissions were identified. In comparison to previous studies, our analysis demonstrates that deriving microbial associations based on the correlations between relative abundances may not only lead to missing information but also produce spurious associations. To bridge together different co-presence and mutual-exclusion relations, a multiplex network model has been proposed for integrative analysis of co-occurrence networks which has great potential to support the prediction of animal phytotypes and to provide additional insights into biological mechanisms of the microbiome associated with the traits.
瘤胃微生物群落的组成和特征的重要性日益受到关注。关键技术之一是基于相对丰度通过相关性分析推断共丰度网络。虽然已经取得了大量的见解和进展,但人们发现,由于数据的组成性质,基于相对丰度的相关性分析可能会产生误导性结果和虚假关联。在本研究中,我们提出使用一个框架,该框架包括两种相关性度量和三种相异度指标的汇总,以试图减轻在推断牛瘤胃微生物组中显著关联时的组成效应。我们在瘤胃微生物组数据上测试了该框架,这些数据包括与牛甲烷产生相关的16S rRNA和KEGG基因。基于多种指标支持的显著正相关和负相关的识别,构建了两个共现网络,即共存网络和互斥网络。识别出了与甲烷排放相关的显著模块。与先前的研究相比,我们的分析表明,基于相对丰度之间的相关性推导微生物关联不仅可能导致信息缺失,还可能产生虚假关联。为了将不同的共存和互斥关系联系起来,提出了一种多重网络模型用于共现网络的综合分析,该模型在支持动物植物型预测和提供与性状相关的微生物组生物学机制的更多见解方面具有巨大潜力。