Sullivan Kyle A, Miller J Izaak, Townsend Alice, Morgan Mallory, Lane Matthew, Pavicic Mirko, Shah Manesh, Cashman Mikaela, Jacobson Daniel A
Computational and Predictive Biology, Oak Ridge National Laboratory, Oak Ridge, TN, USA.
Office of Innovative Technologies, University of Tennessee-Knoxville, Knoxville, TN.
bioRxiv. 2024 Jul 22:2024.07.17.603821. doi: 10.1101/2024.07.17.603821.
While the proliferation of data-driven omics technologies has continued to accelerate, methods of identifying relationships among large-scale changes from omics experiments have stagnated. It is therefore imperative to develop methods that can identify key mechanisms among one or more omics experiments in order to advance biological discovery. To solve this problem, here we describe the network-based algorithm MENTOR - Multiplex Embedding of Networks for Team-Based Omics Research. We demonstrate MENTOR's utility as a supervised learning approach to successfully partition a gene set containing multiple ontological functions into their respective functions. Subsequently, we used MENTOR as an unsupervised learning approach to identify important biological functions pertaining to the host genetic architectures in associated with microbial abundance of multiple taxa. Moreover, as open source software designed with scientific teams in mind, we demonstrate the ability to use the output of MENTOR to facilitate distributed interpretation of omics experiments.
尽管数据驱动的组学技术的扩散仍在加速,但从组学实验中识别大规模变化之间关系的方法却停滞不前。因此,开发能够识别一个或多个组学实验中的关键机制的方法对于推进生物学发现至关重要。为了解决这个问题,我们在此描述基于网络的算法MENTOR——用于基于团队的组学研究的网络多重嵌入。我们证明了MENTOR作为一种监督学习方法的效用,能够成功地将包含多种本体功能的基因集划分为各自的功能。随后,我们将MENTOR用作无监督学习方法,以识别与多个分类群的微生物丰度相关的宿主遗传结构的重要生物学功能。此外,作为一款为科学团队设计的开源软件,我们展示了利用MENTOR的输出促进组学实验分布式解释的能力。