Jiang Yifan, Aton Matthew, Zhu Qiyun, Lu Yang Young
Cheriton School of Computer Science, University of Waterloo, Waterloo, Ontario, Canada.
School of Life Sciences, Arizona State University, Tempe, AZ, USA.
Microbiome. 2025 Mar 29;13(1):87. doi: 10.1186/s40168-025-02080-3.
The human microbiome, a complex ecosystem of microorganisms inhabiting the body, plays a critical role in human health. Investigating its association with host traits is essential for understanding its impact on various diseases. Although shotgun metagenomic sequencing technologies have produced vast amounts of microbiome data, analyzing such data is highly challenging due to its sparsity, noisiness, and high feature dimensionality. Here, we develop MIOSTONE, an accurate and interpretable neural network model for microbiome-disease association that simulates a real taxonomy by encoding the relationships among microbial features. The taxonomy-encoding architecture provides a natural bridge from variations in microbial taxa abundance to variations in traits, encompassing increasingly coarse scales from species to domains. MIOSTONE has the ability to determine whether taxa within the corresponding taxonomic group provide a better explanation in a data-driven manner. MIOSTONE serves as an effective predictive model, as it not only accurately predicts microbiome-trait associations across extensive simulated and real datasets but also offers interpretability for scientific discovery. Both attributes are crucial for facilitating in silico investigations into the biological mechanisms underlying such associations among microbial taxa. Video Abstract.
人类微生物组是居住在人体中的一个复杂微生物生态系统,对人类健康起着关键作用。研究其与宿主特征的关联对于理解其对各种疾病的影响至关重要。尽管鸟枪法宏基因组测序技术已产生了大量的微生物组数据,但由于其稀疏性、噪声和高特征维度,分析此类数据极具挑战性。在此,我们开发了MIOSTONE,这是一种用于微生物组与疾病关联的准确且可解释的神经网络模型,它通过编码微生物特征之间的关系来模拟真实的分类法。分类法编码架构提供了一座从微生物分类群丰度变化到特征变化的自然桥梁,涵盖了从物种到域的越来越粗略的尺度。MIOSTONE能够以数据驱动的方式确定相应分类组内的分类群是否能提供更好的解释。MIOSTONE作为一种有效的预测模型,不仅能在广泛的模拟和真实数据集上准确预测微生物组与特征的关联,还为科学发现提供可解释性。这两个属性对于促进对微生物分类群之间此类关联背后的生物学机制进行计算机模拟研究至关重要。视频摘要。