• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

化合物特征比较(CCC)方法:一种提高天然化合物鉴定可信度的工具。

The Compound Characteristics Comparison (CCC) approach: a tool for improving confidence in natural compound identification.

作者信息

Narduzzi Luca, Stanstrup Jan, Mattivi Fulvio, Franceschi Pietro

机构信息

a Research and Innovation Centre , Fondazione Edmund Mach (FEM) , San Michele all'Adige , Italy.

b Department of Plant and Environmental Sciences, Faculty of Science , University of Copenhagen , Copenhagen , Denmark.

出版信息

Food Addit Contam Part A Chem Anal Control Expo Risk Assess. 2018 Nov;35(11):2145-2157. doi: 10.1080/19440049.2018.1523572. Epub 2018 Oct 23.

DOI:10.1080/19440049.2018.1523572
PMID:30352003
Abstract

Compound identification is the main hurdle in LC-HRMS-based metabolomics, given the high number of 'unknown' metabolites. In recent years, numerous in silico fragmentation simulators have been developed to simplify and improve mass spectral interpretation and compound annotation. Nevertheless, expert mass spectrometry users and chemists are still needed to select the correct entry from the numerous candidates proposed by automatic tools, especially in the plant kingdom due to the huge structural diversity of natural compounds occurring in plants. In this work, we propose the use of a supervised machine learning approach to predict molecular substructures from isotopic patterns, training the model on a large database of grape metabolites. This approach, called 'Compounds Characteristics Comparison' (CCC) emulates the experience of a plant chemist who 'gains experience' from a (proof-of-principle) dataset of grape compounds. The results show that the CCC approach is able to predict with good accuracy most of the sub-structures proposed. In addition, after querying MS/MS spectra in Metfrag 2.2 and applying CCC predictions as scoring terms with real data, the CCC approach helped to give a better ranking to the correct candidates, improving users' confidence in candidate selection. Our results demonstrated that the proposed approach can complement current identification strategies based on fragmentation simulators and formula calculators, assisting compound identification. The CCC algorithm is freely available as R package (https://github.com/lucanard/CCC) which includes a seamless integration with Metfrag. The CCC package also permits uploading additional training data, which can be used to extend the proposed approach to other systems biological matrices. List of abbreviations: Acidic: acidic moiety; aliph: aliphatic chain; AUC: area under the ROC curve; bs: best glycosidic structure; CCC: Compounds' Characteristics Comparison; Cees: Carbons estimation errors; CO: Carbon to Oxygen ratio; Het: Heterocyclic moiety; IMD: Isotopic Mass Defect (and Pattern); LC-HRMS: Liquid Chromatography - High Resolution Mass Spectrometry; md: mass defect; MM: Monoisotopic Mass; MS: Mass Spectrometry; MSE: Mean Squared Error; nC: number of Carbons; NN: Nitrogen; pC: percentage of Carbon mass on the total mass; Pho: Phosphate; PLSr: Partial Least Square regression; ppm: parts per million; QSRR: Quantitative structure-retention relationship; RMD: Relative Mass Defect; ROC: Receiver Operating Characteristics; rRMD: residual Relative Mass Defect; RT: retention time; Sul: Sulphur; UPLC-ESI-Q-TOF-MS: Ultra Performance Liquid Chromatography - ElectroSpray Ionization -Quadropole - Time of Flight - Mass Spectrometry; VAT: Vitis arizonica Texas.

摘要

鉴于“未知”代谢物数量众多,化合物鉴定是基于液相色谱-高分辨质谱的代谢组学中的主要障碍。近年来,已开发出许多计算机模拟碎片化模拟器,以简化和改进质谱解释及化合物注释。然而,仍需要专业的质谱用户和化学家从自动工具提出的众多候选物中选择正确的条目,特别是在植物领域,因为植物中天然化合物的结构具有巨大的多样性。在这项工作中,我们提出使用监督机器学习方法,根据同位素模式预测分子子结构,并在一个大型葡萄代谢物数据库上训练该模型。这种方法称为“化合物特征比较”(CCC),它模拟了植物化学家从葡萄化合物的(原理验证)数据集中“积累经验”的过程。结果表明,CCC方法能够以较高的准确率预测大多数提出的子结构。此外,在Metfrag 2.2中查询MS/MS光谱并将CCC预测作为真实数据的评分项后,CCC方法有助于为正确的候选物给出更好的排名,提高用户在候选物选择上的信心。我们的结果表明,所提出的方法可以补充当前基于碎片化模拟器和分子式计算器的鉴定策略,辅助化合物鉴定。CCC算法可作为R包免费获取(https://github.com/lucanard/CCC),该包与Metfrag无缝集成。CCC包还允许上传额外的训练数据,可用于将所提出的方法扩展到其他系统生物学基质。缩写列表:酸性:酸性部分;脂肪族:脂肪族链;AUC:ROC曲线下面积;bs:最佳糖苷结构;CCC:化合物特征比较;Cees:碳估计误差;CO:碳氧比;杂环:杂环部分;IMD:同位素质量缺陷(及模式);LC-HRMS:液相色谱-高分辨质谱;md:质量缺陷;MM:单同位素质量;MS:质谱;MSE:均方误差;nC:碳数;NN:氮;pC:碳质量占总质量的百分比;Pho:磷酸盐;PLSr:偏最小二乘回归;ppm:百万分之一;QSRR:定量结构-保留关系;RMD:相对质量缺陷;ROC:受试者工作特征;rRMD:残余相对质量缺陷;RT:保留时间;Sul:硫;UPLC-ESI-Q-TOF-MS:超高效液相色谱-电喷雾电离-四极杆-飞行时间-质谱;VAT:亚利桑那葡萄德州种

相似文献

1
The Compound Characteristics Comparison (CCC) approach: a tool for improving confidence in natural compound identification.化合物特征比较(CCC)方法:一种提高天然化合物鉴定可信度的工具。
Food Addit Contam Part A Chem Anal Control Expo Risk Assess. 2018 Nov;35(11):2145-2157. doi: 10.1080/19440049.2018.1523572. Epub 2018 Oct 23.
2
Compound annotation in liquid chromatography/high-resolution mass spectrometry based metabolomics: robust adduct ion determination as a prerequisite to structure prediction in electrospray ionization mass spectra.基于液相色谱/高分辨率质谱的代谢组学中的化合物注释:可靠的加合离子测定作为电喷雾电离质谱中结构预测的先决条件。
Rapid Commun Mass Spectrom. 2017 Aug 15;31(15):1261-1266. doi: 10.1002/rcm.7905.
3
High-resolution liquid chromatography/electrospray ionization time-of-flight mass spectrometry combined with liquid chromatography/electrospray ionization tandem mass spectrometry to identify polyphenols from grape antioxidant dietary fiber.高分辨率液相色谱/电喷雾电离飞行时间质谱联用液相色谱/电喷雾电离串联质谱法鉴定葡萄抗氧化膳食纤维中的多酚类物质。
Rapid Commun Mass Spectrom. 2008 Nov;22(22):3489-500. doi: 10.1002/rcm.3756.
4
Liquid chromatography-quadrupole time of flight tandem mass spectrometry-based targeted metabolomic study for varietal discrimination of grapes according to plant sterols content.基于液相色谱-四极杆飞行时间串联质谱的靶向代谢组学研究,用于根据植物甾醇含量对葡萄品种进行鉴别。
J Chromatogr A. 2016 Jul 8;1454:67-77. doi: 10.1016/j.chroma.2016.05.081. Epub 2016 May 24.
5
Implementation of a semi-automated strategy for the annotation of metabolomic fingerprints generated by liquid chromatography-high resolution mass spectrometry from biological samples.生物样本液相色谱-高分辨质谱代谢指纹图谱半自动注释策略的实现。
Analyst. 2012 Nov 7;137(21):4958-67. doi: 10.1039/c2an35865d. Epub 2012 Sep 12.
6
A comprehensive high-resolution mass spectrometry approach for characterization of metabolites by combination of ambient ionization, chromatography and imaging methods.一种通过结合常压电离、色谱和成像方法来表征代谢物的综合高分辨率质谱方法。
Rapid Commun Mass Spectrom. 2014 Aug 30;28(16):1779-91. doi: 10.1002/rcm.6960.
7
Annotation of metabolites from gas chromatography/atmospheric pressure chemical ionization tandem mass spectrometry data using an in silico generated compound database and MetFrag.使用计算机生成的化合物数据库和MetFrag对气相色谱/大气压化学电离串联质谱数据中的代谢物进行注释。
Rapid Commun Mass Spectrom. 2015 Aug 30;29(16):1521-9. doi: 10.1002/rcm.7244.
8
Evaluation of an Artificial Neural Network Retention Index Model for Chemical Structure Identification in Nontargeted Metabolomics.评价人工神经网络保留指数模型在非靶向代谢组学中的化学结构鉴定。
Anal Chem. 2018 Nov 6;90(21):12752-12760. doi: 10.1021/acs.analchem.8b03118. Epub 2018 Oct 24.
9
MetFusion: integration of compound identification strategies.MetFusion:化合物鉴定策略的整合。
J Mass Spectrom. 2013 Mar;48(3):291-8. doi: 10.1002/jms.3123.
10
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍

引用本文的文献

1
LC-MS untargeted approach showed that methyl jasmonate application on Vitis labrusca L. grapes increases phenolics at subtropical Brazilian regions.LC-MS 非靶向方法表明,茉莉酸甲酯在巴西亚热带地区应用于美洲葡萄可增加酚类物质。
Metabolomics. 2020 Jan 23;16(2):18. doi: 10.1007/s11306-020-1641-z.
2
The metaRbolomics Toolbox in Bioconductor and beyond.生物导体及其他领域中的代谢组学工具箱。
Metabolites. 2019 Sep 23;9(10):200. doi: 10.3390/metabo9100200.