• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于非靶向代谢组学工作流程的基于Python的液相色谱-质谱数据预处理管道。

A Python-Based Pipeline for Preprocessing LC-MS Data for Untargeted Metabolomics Workflows.

作者信息

Riquelme Gabriel, Zabalegui Nicolás, Marchi Pablo, Jones Christina M, Monge María Eugenia

机构信息

Centro de Investigaciones en Bionanociencias (CIBION), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Godoy Cruz 2390, Ciudad de Buenos Aires C1425FQD, Argentina.

Departamento de Química Inorgánica Analítica y Química Física, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Buenos Aires C1428EGA, Argentina.

出版信息

Metabolites. 2020 Oct 16;10(10):416. doi: 10.3390/metabo10100416.

DOI:10.3390/metabo10100416
PMID:33081373
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7602939/
Abstract

Preprocessing data in a reproducible and robust way is one of the current challenges in untargeted metabolomics workflows. Data curation in liquid chromatography-mass spectrometry (LC-MS) involves the removal of biologically non-relevant features (retention time, pairs) to retain only high-quality data for subsequent analysis and interpretation. The present work introduces TidyMS, a package for the Python programming language for preprocessing LC-MS data for quality control (QC) procedures in untargeted metabolomics workflows. It is a versatile strategy that can be customized or fit for purpose according to the specific metabolomics application. It allows performing quality control procedures to ensure accuracy and reliability in LC-MS measurements, and it allows preprocessing metabolomics data to obtain cleaned matrices for subsequent statistical analysis. The capabilities of the package are shown with pipelines for an LC-MS system suitability check, system conditioning, signal drift evaluation, and data curation. These applications were implemented to preprocess data corresponding to a new suite of candidate plasma reference materials developed by the National Institute of Standards and Technology (NIST; hypertriglyceridemic, diabetic, and African-American plasma pools) to be used in untargeted metabolomics studies in addition to NIST SRM 1950 Metabolites in Frozen Human Plasma. The package offers a rapid and reproducible workflow that can be used in an automated or semi-automated fashion, and it is an open and free tool available to all users.

摘要

以可重复且稳健的方式预处理数据是当前非靶向代谢组学工作流程中的挑战之一。液相色谱 - 质谱联用(LC - MS)中的数据整理涉及去除生物学上不相关的特征(保留时间、峰对),仅保留高质量数据用于后续分析和解释。本工作介绍了TidyMS,这是一个用于Python编程语言的软件包,用于在非靶向代谢组学工作流程中预处理LC - MS数据以进行质量控制(QC)程序。这是一种通用策略,可以根据特定的代谢组学应用进行定制或适配。它允许执行质量控制程序以确保LC - MS测量的准确性和可靠性,并且允许预处理代谢组学数据以获得用于后续统计分析的清理后的矩阵。该软件包的功能通过用于LC - MS系统适用性检查、系统调节、信号漂移评估和数据整理的流程展示。这些应用被用于预处理与美国国家标准与技术研究院(NIST;高甘油三酯血症、糖尿病和非裔美国人血浆库)开发的一组新的候选血浆参考物质相对应的数据,除了NIST SRM 1950冷冻人血浆中的代谢物之外,这些数据将用于非靶向代谢组学研究。该软件包提供了一个快速且可重复的工作流程,可以以自动化或半自动化方式使用,并且它是一个可供所有用户使用的开放且免费的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41df/7602939/84c21c406ad3/metabolites-10-00416-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41df/7602939/5f6ec1260ec9/metabolites-10-00416-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41df/7602939/7d3558646dbc/metabolites-10-00416-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41df/7602939/7629ac832d90/metabolites-10-00416-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41df/7602939/84c21c406ad3/metabolites-10-00416-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41df/7602939/5f6ec1260ec9/metabolites-10-00416-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41df/7602939/7d3558646dbc/metabolites-10-00416-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41df/7602939/7629ac832d90/metabolites-10-00416-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41df/7602939/84c21c406ad3/metabolites-10-00416-g004.jpg

相似文献

1
A Python-Based Pipeline for Preprocessing LC-MS Data for Untargeted Metabolomics Workflows.用于非靶向代谢组学工作流程的基于Python的液相色谱-质谱数据预处理管道。
Metabolites. 2020 Oct 16;10(10):416. doi: 10.3390/metabo10100416.
2
Model-driven data curation pipeline for LC-MS-based untargeted metabolomics.基于 LC-MS 的非靶向代谢组学的模型驱动的数据管理流程。
Metabolomics. 2023 Mar 1;19(3):15. doi: 10.1007/s11306-023-01976-1.
3
Metabolomics Data Preprocessing Using ADAP and MZmine 2.基于 ADAP 和 MZmine 2 的代谢组学数据预处理
Methods Mol Biol. 2020;2104:25-48. doi: 10.1007/978-1-0716-0239-3_3.
4
Tidy-Direct-to-MS: An Open-Source Data-Processing Pipeline for Direct Mass Spectrometry-Based Metabolomics Experiments.Tidy-Direct-to-MS:一种基于直接质谱的代谢组学实验的开源数据处理流程。
J Proteome Res. 2024 Aug 2;23(8):3208-3216. doi: 10.1021/acs.jproteome.3c00784. Epub 2024 Jun 4.
5
geoRge: A Computational Tool To Detect the Presence of Stable Isotope Labeling in LC/MS-Based Untargeted Metabolomics.乔治:一种用于在基于液相色谱/质谱的非靶向代谢组学中检测稳定同位素标记存在的计算工具。
Anal Chem. 2016 Jan 5;88(1):621-8. doi: 10.1021/acs.analchem.5b03628. Epub 2015 Dec 18.
6
Filtering procedures for untargeted LC-MS metabolomics data.非靶向 LC-MS 代谢组学数据的过滤程序。
BMC Bioinformatics. 2019 Jun 14;20(1):334. doi: 10.1186/s12859-019-2871-9.
7
Large-scale untargeted LC-MS metabolomics data correction using between-batch feature alignment and cluster-based within-batch signal intensity drift correction.使用批次间特征比对和基于聚类的批次内信号强度漂移校正对大规模非靶向液相色谱-质谱代谢组学数据进行校正。
Metabolomics. 2016;12(11):173. doi: 10.1007/s11306-016-1124-4. Epub 2016 Sep 22.
8
Visualization, Quantification, and Alignment of Spectral Drift in Population Scale Untargeted Metabolomics Data.群体水平无靶向代谢组学数据中光谱漂移动力学的可视化、量化和对准。
Anal Chem. 2017 Feb 7;89(3):1399-1404. doi: 10.1021/acs.analchem.6b04337. Epub 2017 Jan 26.
9
Dissemination and analysis of the quality assurance (QA) and quality control (QC) practices of LC-MS based untargeted metabolomics practitioners.基于 LC-MS 的非靶向代谢组学从业者的质量保证(QA)和质量控制(QC)实践的传播和分析。
Metabolomics. 2020 Oct 12;16(10):113. doi: 10.1007/s11306-020-01728-5.
10
Data Treatment for LC-MS Untargeted Analysis.液相色谱-质谱联用非靶向分析的数据处理
Methods Mol Biol. 2018;1738:27-39. doi: 10.1007/978-1-4939-7643-0_3.

引用本文的文献

1
Effective data visualization strategies in untargeted metabolomics.非靶向代谢组学中的有效数据可视化策略
Nat Prod Rep. 2024 Dec 2. doi: 10.1039/d4np00039k.
2
Statistical analysis of feature-based molecular networking results from non-targeted metabolomics data.基于特征的非靶向代谢组学数据分子网络结果的统计分析
Nat Protoc. 2025 Jan;20(1):92-162. doi: 10.1038/s41596-024-01046-3. Epub 2024 Sep 20.
3
Polypy: A Framework to Interpret Polymer Properties from Mass Spectroscopy Data.Polypy:一个从质谱数据解释聚合物性质的框架。

本文引用的文献

1
Dissemination and analysis of the quality assurance (QA) and quality control (QC) practices of LC-MS based untargeted metabolomics practitioners.基于 LC-MS 的非靶向代谢组学从业者的质量保证(QA)和质量控制(QC)实践的传播和分析。
Metabolomics. 2020 Oct 12;16(10):113. doi: 10.1007/s11306-020-01728-5.
2
Data-dependent normalization strategies for untargeted metabolomics-a case study.基于数据的非靶向代谢组学归一化策略——案例研究。
Anal Bioanal Chem. 2020 Sep;412(24):6391-6405. doi: 10.1007/s00216-020-02594-9. Epub 2020 Apr 14.
3
Data normalization strategies in metabolomics: Current challenges, approaches, and tools.
Polymers (Basel). 2024 Jun 22;16(13):1771. doi: 10.3390/polym16131771.
4
Common data models to streamline metabolomics processing and annotation, and implementation in a Python pipeline.常见的数据模型可简化代谢组学处理和注释,并在 Python 管道中实现。
PLoS Comput Biol. 2024 Jun 6;20(6):e1011912. doi: 10.1371/journal.pcbi.1011912. eCollection 2024 Jun.
5
Common data models to streamline metabolomics processing and annotation, and implementation in a Python pipeline.简化代谢组学处理和注释的通用数据模型及其在Python管道中的实现。
bioRxiv. 2024 Feb 14:2024.02.13.580048. doi: 10.1101/2024.02.13.580048.
6
Comparing baseline correction algorithms in discriminating brownish soils from five proximity locations based on UPLC and PLS-DA methods.基于超高效液相色谱法(UPLC)和偏最小二乘判别分析(PLS-DA)方法,比较用于区分来自五个邻近地点的褐色土壤的基线校正算法。
Forensic Sci Res. 2023 Dec 19;8(4):313-320. doi: 10.1093/fsr/owad045. eCollection 2023 Dec.
7
Integrative open workflow for confident annotation and molecular networking of metabolomics MSE/DIA data.整合开放式工作流程,实现代谢组学 MSE/DIA 数据的自信注释和分子网络分析。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae013.
8
Current Practices in LC-MS Untargeted Metabolomics: A Scoping Review on the Use of Pooled Quality Control Samples.LC-MS 非靶向代谢组学的当前实践:关于使用混合质量控制样品的范围综述。
Anal Chem. 2023 Dec 26;95(51):18645-18654. doi: 10.1021/acs.analchem.3c02924. Epub 2023 Dec 6.
9
Mass-Suite: a novel open-source python package for high-resolution mass spectrometry data analysis.Mass-Suite:一个用于高分辨率质谱数据分析的新型开源Python软件包。
J Cheminform. 2023 Sep 23;15(1):87. doi: 10.1186/s13321-023-00741-9.
10
Thermostable chaperone-based polypeptide biosynthesis: Enfuvirtide model product quality and protocol-related impurities.基于热稳定分子伴侣的多肽生物合成:恩夫韦肽模型产品的质量和与方案相关的杂质。
PLoS One. 2023 Jun 8;18(6):e0286752. doi: 10.1371/journal.pone.0286752. eCollection 2023.
代谢组学中的数据归一化策略:当前的挑战、方法和工具。
Eur J Mass Spectrom (Chichester). 2020 Jun;26(3):165-174. doi: 10.1177/1469066720918446. Epub 2020 Apr 10.
4
Chemical Discovery in the Era of Metabolomics.代谢组学时代的化学发现。
J Am Chem Soc. 2020 May 20;142(20):9097-9105. doi: 10.1021/jacs.9b13198. Epub 2020 May 11.
5
"notame": Workflow for Non-Targeted LC-MS Metabolic Profiling.“notame”:非靶向液相色谱-质谱代谢谱分析工作流程。
Metabolites. 2020 Mar 31;10(4):135. doi: 10.3390/metabo10040135.
6
Implementation of liquid chromatography-high resolution mass spectrometry methods for untargeted metabolomic analyses of biological samples: A tutorial.液相色谱-高分辨质谱法在生物样品非靶向代谢组学分析中的应用:教程。
Anal Chim Acta. 2020 Apr 8;1105:28-44. doi: 10.1016/j.aca.2019.12.062. Epub 2020 Jan 2.
7
SciPy 1.0: fundamental algorithms for scientific computing in Python.SciPy 1.0:Python 中的科学计算基础算法。
Nat Methods. 2020 Mar;17(3):261-272. doi: 10.1038/s41592-019-0686-2. Epub 2020 Feb 3.
8
From Samples to Insights into Metabolism: Uncovering Biologically Relevant Information in LC-HRMS Metabolomics Data.从样本到代谢洞察:揭示液相色谱-高分辨质谱代谢组学数据中的生物学相关信息
Metabolites. 2019 Dec 17;9(12):308. doi: 10.3390/metabo9120308.
9
MetaboLights: a resource evolving in response to the needs of its scientific community.代谢组学文献共享资源库(MetaboLights):一个响应其科研群体需求而不断发展的资源库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D440-D444. doi: 10.1093/nar/gkz1019.
10
Recent Developments along the Analytical Process for Metabolomics Workflows.代谢组学工作流程分析过程的最新进展。
Anal Chem. 2020 Jan 7;92(1):203-226. doi: 10.1021/acs.analchem.9b04553. Epub 2019 Nov 1.