Department of Bioinformatics and Systems Biology, University of California San Diego, La Jolla, CA, 92093-083, USA.
Halıcıoğlu Data Science Institute, University of California San Diego, La Jolla, CA, 92093-083, USA.
BMC Bioinformatics. 2021 Feb 5;22(1):49. doi: 10.1186/s12859-020-03941-4.
Microbiomes consist of bacteria, viruses, and other microorganisms, and are responsible for many different functions in both organisms and the environment. Past analyses of microbiomes focused on using correlation to determine linear relationships between microbes and diseases. Weak correlations due to nonlinearity between microbe pairs may cause researchers to overlook critical components of the data. With the abundance of available microbiome, we need a method that comprehensively studies microbiomes and how they are related to each other.
We collected publicly available datasets from human, environment, and animal samples to determine both symmetric and asymmetric Boolean implication relationships between a pair of microbes. We then found relationships that are potentially invariants, meaning they will hold in any microbe community. In other words, if we determine there is a relationship between two microbes, we expect the relationship to hold in almost all contexts. We discovered that around 330,000 pairs of microbes universally exhibit the same relationship in almost all the datasets we studied, thus making them good candidates for invariants. Our results also confirm known biological properties and seem promising in terms of disease diagnosis.
Since the relationships are likely universal, we expect them to hold in clinical settings, as well as general populations. If these strong invariants are present in disease settings, it may provide insight into prognostic, predictive, or therapeutic properties of clinically relevant diseases. For example, our results indicate that there is a difference in the microbe distributions between patients who have or do not have IBD, eczema and psoriasis. These new analyses may improve disease diagnosis and drug development in terms of accuracy and efficiency.
微生物组由细菌、病毒和其他微生物组成,负责生物体和环境中的许多不同功能。过去对微生物组的分析侧重于使用相关性来确定微生物与疾病之间的线性关系。由于微生物对之间的非线性关系,弱相关性可能导致研究人员忽略数据的关键组成部分。随着微生物组的大量可用,我们需要一种方法来全面研究微生物组以及它们之间的相互关系。
我们收集了来自人类、环境和动物样本的公开可用数据集,以确定一对微生物之间的对称和非对称布尔蕴涵关系。然后,我们找到了潜在的不变关系,这意味着它们将在任何微生物群落中成立。换句话说,如果我们确定两个微生物之间存在关系,我们期望这种关系在几乎所有情况下都成立。我们发现,大约 330000 对微生物在我们研究的几乎所有数据集中普遍表现出相同的关系,因此它们是不变关系的良好候选者。我们的结果还证实了已知的生物学特性,并且在疾病诊断方面似乎很有前途。
由于这些关系可能是普遍存在的,我们期望它们在临床环境以及一般人群中成立。如果这些强不变关系存在于疾病环境中,它可能为临床相关疾病的预后、预测或治疗特性提供深入了解。例如,我们的结果表明,患有或不患有 IBD、湿疹和银屑病的患者的微生物分布存在差异。这些新的分析可能会提高疾病诊断和药物开发的准确性和效率。