Department of Genetics, Stanford School of Medicine, Stanford, CA 94305, USA.
Cardiovascular Institute, Stanford School of Medicine, Stanford, CA 94305, USA.
Biomolecules. 2020 Nov 27;10(12):1606. doi: 10.3390/biom10121606.
The number of researchers using multi-omics is growing. Though still expensive, every year it is cheaper to perform multi-omic studies, often exponentially so. In addition to its increasing accessibility, multi-omics reveals a view of systems biology to an unprecedented depth. Thus, multi-omics can be used to answer a broad range of biological questions in finer resolution than previous methods. We used six omic measurements-four nucleic acid (i.e., genomic, epigenomic, transcriptomics, and metagenomic) and two mass spectrometry (proteomics and metabolomics) based-to highlight an analysis workflow on this type of data, which is often vast. This workflow is not exhaustive of all the omic measurements or analysis methods, but it will provide an experienced or even a novice multi-omic researcher with the tools necessary to analyze their data. This review begins with analyzing a single ome and study design, and then synthesizes best practices in data integration techniques that include machine learning. Furthermore, we delineate methods to validate findings from multi-omic integration. Ultimately, multi-omic integration offers a window into the complexity of molecular interactions and a comprehensive view of systems biology.
使用多组学进行研究的人员正在增加。尽管多组学研究仍然昂贵,但每年的成本都在降低,通常呈指数级下降。除了越来越容易获得之外,多组学还以前所未有的深度揭示了系统生物学的全貌。因此,多组学可以用于以比以前的方法更高的分辨率回答更广泛的生物学问题。我们使用了六种组学测量方法(四种核酸(即基因组、表观基因组、转录组和宏基因组)和两种基于质谱的方法(蛋白质组学和代谢组学))来突出展示这种通常非常庞大的数据的分析工作流程。该工作流程并非涵盖所有组学测量或分析方法,但它将为有经验甚至是新手的多组学研究人员提供分析其数据所需的工具。本综述首先分析单个组学和研究设计,然后综合了包括机器学习在内的数据集成技术的最佳实践。此外,我们还描述了验证多组学整合结果的方法。最终,多组学整合提供了一个观察分子相互作用复杂性和系统生物学全貌的窗口。