McLuskey Karen, Wandy Joe, Vincent Isabel, van der Hooft Justin J J, Rogers Simon, Burgess Karl, Daly Rónán
Glasgow Polyomics, University of Glasgow, Glasgow G61 1QH, UK.
IBioIC, Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, Glasgow G1 1XQ, UK.
Metabolites. 2021 Feb 11;11(2):103. doi: 10.3390/metabo11020103.
Related metabolites can be grouped into sets in many ways, e.g., by their participation in series of chemical reactions (forming metabolic pathways), or based on fragmentation spectral similarities or shared chemical substructures. Understanding how such metabolite sets change in relation to experimental factors can be incredibly useful in the interpretation and understanding of complex metabolomics data sets. However, many of the available tools that are used to perform this analysis are not entirely suitable for the analysis of untargeted metabolomics measurements. Here, we present PALS (Pathway Activity Level Scoring), a Python library, command line tool, and Web application that performs the ranking of significantly changing metabolite sets over different experimental conditions. The main algorithm in PALS is based on the pathway level analysis of gene expression (PLAGE) factorisation method and is denoted as mPLAGE (PLAGE for metabolomics). As an example of an application, PALS is used to analyse metabolites grouped as metabolic pathways and by shared tandem mass spectrometry fragmentation patterns. A comparison of mPLAGE with two other commonly used methods (overrepresentation analysis (ORA) and gene set enrichment analysis (GSEA)) is also given and reveals that mPLAGE is more robust to missing features and noisy data than the alternatives. As further examples, PALS is also applied to human African trypanosomiasis, Rhamnaceae, and American Gut Project data. In addition, normalisation can have a significant impact on pathway analysis results, and PALS offers a framework to further investigate this. PALS is freely available from our project Web site.
相关代谢物可以通过多种方式分组,例如,根据它们参与的一系列化学反应(形成代谢途径),或者基于碎片光谱相似性或共享的化学子结构。了解这些代谢物集如何随实验因素变化,对于解释和理解复杂的代谢组学数据集可能非常有用。然而,许多用于执行此分析的现有工具并不完全适用于非靶向代谢组学测量的分析。在这里,我们介绍了PALS(途径活性水平评分),它是一个Python库、命令行工具和Web应用程序,可对不同实验条件下显著变化的代谢物集进行排名。PALS中的主要算法基于基因表达的途径水平分析(PLAGE)分解方法,记为mPLAGE(代谢组学的PLAGE)。作为一个应用示例,PALS用于分析按代谢途径和共享串联质谱碎片模式分组的代谢物。还给出了mPLAGE与其他两种常用方法(过度表达分析(ORA)和基因集富集分析(GSEA))的比较,结果表明mPLAGE比其他方法对缺失特征和噪声数据更具鲁棒性。作为进一步的示例,PALS还应用于人类非洲锥虫病、鼠李科和美国肠道项目的数据。此外,归一化可能对途径分析结果产生重大影响,PALS提供了一个框架来进一步研究这一点。PALS可从我们的项目网站免费获取。