Suppr超能文献

一种基于改良代谢组和探针分子的重要通路相关代谢物高效筛选与注释新方法

[A novel method for efficient screening and annotation of important pathway-associated metabolites based on the modified metabolome and probe molecules].

作者信息

Li Zaifang, Zheng Fujian, Xia Yueyi, Zhang Xiuqiong, Wang Xinxin, Zhao Chunxia, Zhao Xinjie, Lu Xin, Xu Guowang

机构信息

CAS Key Laboratory of Separation Science for Analytical Chemistry, Liaoning Province Key Laboratory of Metabolomics, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China.

University of Chinese Academy of Sciences, Beijing 100049, China.

出版信息

Se Pu. 2022 Sep;40(9):788-796. doi: 10.3724/SP.J.1123.2022.03025.

Abstract

Plants produce a wide variety of secondary metabolites in the process of evolution. Secondary metabolites have highly diverse structures due to the modification of the basic skeletons of metabolites. They are required for interaction with the environment and are produced in response to abiotic/biotic stress. Characterization of secondary metabolic pathways is significant to plant molecular breeding and natural product biosynthesis. The liquid chromatography-high resolution tandem mass spectrometry (LC-HRMS/MS) is one of the major techniques for untargeted metabolomics study. The LC-HRMS/MS method could detect tens of thousands of metabolic features and provide abundant structural information. It has been widely used in the discovery and characterization of the secondary metabolome. However, due to the largely diverse structure and limited records in the mass spectral library, the annotation of the secondary metabolome is very difficult. To address the analytical challenges associated with the vast structural diversity and the large numbers of secondary metabolites, particularly those previously unknown structural metabolites, a novel method for the efficient characterization of pathway-associated metabolites was developed. Modification reactions and MS/MS spectral information were collected using the metabolic pathways database and mass spectral library. Screening and annotation of metabolites involved in phenylpropanoid metabolism in maize leaves were used as an example. First, a database of modified groups was established via pathway-associated modifications from open access metabolic pathway database and literature. Here, pathway databases included the Kyoto Encyclopedia of Genes and Genomes (KEGG) and Plant Metabolic Pathways (PlantCyc). A total of 61 modification types were enrolled, including 10 generic and 51 pathway-specific modifications. Modified metabolomes were filtered from untargeted LC-HRMS/MS metabolomics data. Next, MS/MS spectra of the pathway-associated compounds (probe molecules) were collected in the Global Natural Products Social Molecular Networking (GNPS) MS/MS spectral library. The MS/MS of compounds assigned to chemical classes of phenylpropanoids were kept. An MS/MS spectral database of the probe molecules was constructed. It included 2677 spectra of 1542 phenylpropanoid compounds in the positive mode and 814 spectra of 661 phenylpropanoid compounds in the negative mode. Then, an MS/MS molecular network was generated by modified metabolome and probe molecules. The clusters comprising both probe molecules and modified metabolites were kept. To explore more previously unknown structural metabolites, the clusters with one more pathway-specific modified metabolite were retained even though they didn't contain any probe molecule. A total of 392 and 417 phenylpropanoid pathway-related metabolic metabolites were obtained in positive and negative ion modes, respectively. The pathway-associated metabolites were annotated based on the propagation of the molecular network. For the metabolites within the co-cluster, annotations were performed using the probe molecules as the initial seed. The modification group's substructure information was used for network propagation annotation. For the clusters containing only pathway-specific modified metabolites, the annotation is similar to the above process if identified nodes were present within the cluster. Otherwise, de novo annotation was manually executed based on substructure information. Finally, 129 unique metabolites were annotated after integration and removal of redundancy. Ten annotated metabolites were validated using commercially available or synthesized reference compounds. The other annotation results were validated using predicted chemical classes, MS/MS, and predicted retention time. They are mainly involved in the downstream branch of phenylpropanoid pathways, including the flavonoid pathway (8 flavonoids, 19 flavonoid -glycosides, 32 flavonoid -glycosides), the hydroxycinnamic acid pathway (31 hydroxycinnamic acids and derivatives), and the lignan pathway (22 neo-lignans/lignan/lignan glycosides). All the annotated structures were searched against the PubChem and SciFinder databases. Among them, 26 metabolites were previously unreported in both the databases. In this study, the pathway-associated metabolites could be quickly discovered and annotated by the integration of probe molecules and modified metabolome. It provides a method for the in-depth study of the phenylpropanoid pathway.

摘要

植物在进化过程中产生了各种各样的次生代谢产物。由于代谢产物基本骨架的修饰,次生代谢产物具有高度多样的结构。它们是与环境相互作用所必需的,并在非生物/生物胁迫下产生。次生代谢途径的表征对植物分子育种和天然产物生物合成具有重要意义。液相色谱 - 高分辨率串联质谱(LC - HRMS/MS)是用于非靶向代谢组学研究的主要技术之一。LC - HRMS/MS方法可以检测数以万计的代谢特征,并提供丰富的结构信息。它已广泛应用于次生代谢组的发现和表征。然而,由于结构差异很大且质谱库中的记录有限,次生代谢组的注释非常困难。为了解决与巨大的结构多样性和大量次生代谢产物相关的分析挑战,特别是那些以前未知结构的代谢产物,开发了一种有效表征途径相关代谢产物的新方法。利用代谢途径数据库和质谱库收集修饰反应和MS/MS光谱信息。以玉米叶片中参与苯丙烷代谢的代谢产物的筛选和注释为例。首先,通过从开放获取的代谢途径数据库和文献中进行与途径相关的修饰,建立了一个修饰基团数据库。这里,途径数据库包括京都基因与基因组百科全书(KEGG)和植物代谢途径(PlantCyc)。总共纳入了61种修饰类型,包括10种通用修饰和51种途径特异性修饰。从非靶向LC - HRMS/MS代谢组学数据中筛选出修饰代谢组。接下来,在全球天然产物社会分子网络(GNPS)MS/MS光谱库中收集途径相关化合物(探针分子)的MS/MS光谱。保留分配到苯丙烷类化学类别的化合物的MS/MS。构建了探针分子的MS/MS光谱数据库。它包括正模式下来自1542种苯丙烷类化合物的2677个光谱和负模式下来自661种苯丙烷类化合物的814个光谱。然后,由修饰代谢组和探针分子生成MS/MS分子网络。保留包含探针分子和修饰代谢产物的簇。为了探索更多以前未知结构的代谢产物,即使不包含任何探针分子,也保留含有一种以上途径特异性修饰代谢产物的簇。在正离子模式和负离子模式下分别获得了392个和417个与苯丙烷途径相关的代谢代谢产物。基于分子网络的传播对途径相关代谢产物进行注释。对于共簇内的代谢产物,以探针分子作为初始种子进行注释。修饰基团的子结构信息用于网络传播注释。对于仅包含途径特异性修饰代谢产物的簇,如果簇内存在已识别的节点,则注释过程与上述过程类似。否则,根据子结构信息手动进行从头注释。最后,经过整合和去除冗余后,注释了129种独特的代谢产物。使用市售或合成的参考化合物对10种注释的代谢产物进行了验证。其他注释结果使用预测的化学类别、MS/MS和预测的保留时间进行验证。它们主要参与苯丙烷途径下游分支,包括类黄酮途径(8种类黄酮、19种黄酮糖苷、32种黄酮糖苷)、羟基肉桂酸途径(31种羟基肉桂酸及其衍生物)和木脂素途径(22种新木脂素/木脂素/木脂素糖苷)。所有注释的结构都在PubChem和SciFinder数据库中进行了搜索。其中,有26种代谢产物在这两个数据库中均未被报道。在本研究中,通过整合探针分子和修饰代谢组,可以快速发现和注释途径相关代谢产物。它为深入研究苯丙烷途径提供了一种方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa78/9520374/188790bd829f/cjc-40-09-788-img_1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验