液相色谱-质谱联用数据中的高灵敏度和特异性特征检测：深度学习框架。

High sensitivity and specificity feature detection in liquid chromatography-mass spectrometry data: A deep learning framework.

机构信息

CAS Key Laboratory of Separation Sciences for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 106023, China.

出版信息

Talanta. 2021 Jan 15;222:121580. doi: 10.1016/j.talanta.2020.121580. Epub 2020 Aug 28.

DOI:10.1016/j.talanta.2020.121580

PMID:33167267

Abstract

Feature detection is a crucial pre-processing step for high-resolution liquid chromatography-mass spectrometry (LC-MS) data analysis. Typical practices based on thresholds or rigid mathematical assumptions can cause ineffective performance in detecting low abundance and non-ideal distributed compounds. We herein introduce a novel feature detection method based on deep learning named SeA-M2Net that considers feature detection as an image-based object detection task. By fully employing raw data directly, and integrating all related factors (e.g., LC elution, charge state, and isotope distribution) with two-dimensional pseudo color images to calculate the probability of the presence of the compound, low abundance compounds can be well preserved and observed. More importantly, SeA-M2Net, with deep multilevel and multiscale structures focuses on compound pattern detection in a learned method instead of assuming a mathematical parametric model. All parameters in SeA-M2Net are learned from data in the training procedure, thus allowing for maximum flexibility of pattern distribution deformation. The algorithm is tested on several LC-MS datasets of multiple biological samples obtained from different instruments with varied experimental settings. We demonstrate the superiority of the new approach in handling complex compound patterns (e.g., low abundance, overlapping regions, LC shifts, and missing values). Our experiments indicate that SeA-M2Net outperforms widely used detection methods in terms of detection accuracy.

摘要

特征检测是高分辨率液相色谱-质谱（LC-MS）数据分析的关键预处理步骤。基于阈值或严格数学假设的典型方法可能会导致在检测低丰度和非理想分布化合物时性能不佳。我们在此引入了一种基于深度学习的新型特征检测方法，称为 SeA-M2Net，它将特征检测视为基于图像的目标检测任务。通过直接充分利用原始数据，并将所有相关因素（例如 LC 洗脱、电荷状态和同位素分布）与二维伪彩色图像集成，以计算化合物存在的概率，可以很好地保留和观察低丰度化合物。更重要的是，SeA-M2Net 具有深层次的多级和多尺度结构，专注于以学习的方法检测化合物模式，而不是假设数学参数模型。SeA-M2Net 中的所有参数都是在训练过程中从数据中学习得到的，因此允许模式分布变形的最大灵活性。该算法在来自不同仪器的多个具有不同实验设置的生物样本的多个 LC-MS 数据集上进行了测试。我们证明了该新方法在处理复杂化合物模式（例如低丰度、重叠区域、LC 位移和缺失值）方面的优越性。我们的实验表明，SeA-M2Net 在检测准确性方面优于广泛使用的检测方法。

相似文献

High sensitivity and specificity feature detection in liquid chromatography-mass spectrometry data: A deep learning framework.液相色谱-质谱联用数据中的高灵敏度和特异性特征检测：深度学习框架。

Talanta. 2021 Jan 15;222:121580. doi: 10.1016/j.talanta.2020.121580. Epub 2020 Aug 28.

Deep Learning Based MS2 Feature Detection for Data-Independent Shotgun Proteomics.基于深度学习的数据非依赖鸟枪法蛋白质组学中的MS2特征检测

Proceedings (IEEE Int Conf Bioinformatics Biomed). 2022 Dec;2022:2342-2348. doi: 10.1109/bibm55620.2022.9995258. Epub 2023 Jan 2.

Artificial Neural Network for Probabilistic Feature Recognition in Liquid Chromatography Coupled to High-Resolution Mass Spectrometry.人工神经网络在液相色谱与高分辨率质谱联用中用于概率特征识别。

Anal Chem. 2017 Jan 17;89(2):1212-1221. doi: 10.1021/acs.analchem.6b03678. Epub 2016 Dec 29.

Shape-based feature matching improves protein identification via LC-MS and tandem MS.基于形状的特征匹配通过液相色谱-质谱联用和串联质谱提高蛋白质鉴定水平。

J Comput Biol. 2011 Apr;18(4):547-57. doi: 10.1089/cmb.2010.0155. Epub 2011 Mar 21.

Graph-based peak alignment algorithms for multiple liquid chromatography-mass spectrometry datasets.基于图的多液相色谱-质谱数据集的峰对齐算法。

Bioinformatics. 2013 Oct 1;29(19):2469-76. doi: 10.1093/bioinformatics/btt435. Epub 2013 Jul 30.

An accurate-mass-based spectral-averaging isotope-pattern-filtering algorithm for extraction of drug metabolites possessing a distinct isotope pattern from LC-MS data.一种基于精确质量的光谱平均同位素模式过滤算法，用于从液相色谱-质谱数据中提取具有独特同位素模式的药物代谢物。

Anal Chem. 2009 Jul 15;81(14):5910-7. doi: 10.1021/ac900626d.

Role of liquid chromatography-high-resolution mass spectrometry (LC-HR/MS) in clinical toxicology.液相色谱-高分辨率质谱（LC-HR/MS）在临床毒理学中的作用。

Clin Toxicol (Phila). 2012 Sep;50(8):733-42. doi: 10.3109/15563650.2012.713108. Epub 2012 Aug 13.

Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学：基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍

Inversion of peak elution order prevents uniform time alignment of complex liquid-chromatography coupled to mass spectrometry datasets.峰洗脱顺序的倒置会妨碍复杂液相色谱与质谱数据集的均匀时间对齐。

J Chromatogr A. 2014 Dec 19;1373:61-72. doi: 10.1016/j.chroma.2014.10.101. Epub 2014 Nov 13.

LC-MSsim--a simulation software for liquid chromatography mass spectrometry data.LC-MSsim——一款用于液相色谱质谱数据的模拟软件。

BMC Bioinformatics. 2008 Oct 8;9:423. doi: 10.1186/1471-2105-9-423.

引用本文的文献

Autonomous CE Mass-Spectra Examination for the Ocean Worlds Life Surveyor.用于海洋世界生命探测仪的自主彗星质谱检查

Earth Space Sci. 2022 Oct;9(10):e2022EA002247. doi: 10.1029/2022EA002247. Epub 2022 Oct 18.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

液相色谱-质谱联用数据中的高灵敏度和特异性特征检测：深度学习框架。

High sensitivity and specificity feature detection in liquid chromatography-mass spectrometry data: A deep learning framework.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献