Helmeczi Erick, Kroezen Zachary, Shanmuganathan Meera, Stanciu Ana Ruxandra, Martinez Vanessa, Kurysko Natasia, Normando Paula, Castro Inês Rugani Ribeiro de, Schincaglia Raquel Machado, Kac Gilberto, Britz-McKibbin Philip
Department of Chemistry and Chemical Biology, McMaster University, Hamilton, Ontario L8S 4M1, Canada.
Nutritional Epidemiology Observatory, Josué de Castro Nutrition Institute, Rio de Janeiro Federal University, Rio de Janeiro 21941-902, Brazil.
Anal Chem. 2025 Jan 14;97(1):175-184. doi: 10.1021/acs.analchem.4c03513. Epub 2024 Dec 27.
Mass spectrometry (MS)-based metabolomics often rely on separation techniques when analyzing complex biological specimens to improve method resolution, metabolome coverage, quantitative performance, and/or unknown identification. However, low sample throughput and complicated data preprocessing procedures remain major barriers to affordable metabolomic studies that are scalable to large populations. Herein, we introduce PeakMeister as a new software tool in the R statistical environment to enable standardized processing of serum metabolomic data acquired by multisegment injection-capillary electrophoresis-mass spectrometry (MSI-CE-MS), a high-throughput separation platform (<4 min/sample) which takes advantage of a serial injection format of 13 samples within a single analytical run. We performed a rigorous validation of PeakMeister by analyzing 47 cationic metabolites consistently measured in 5,000 serum and 420 quality control samples from the Brazilian National Survey on Child Nutrition (ENANI-2019) comprising a total of 224,983 metabolite peaks acquired in 40 days across three batches over an eight-month period. A migration time index using a panel of 11 internal standards was introduced to correct for large variations in migration times, which allowed for reliable peak annotation, peak integration, and sample position assignment for serum metabolites having two flanking internal standards or a single comigrating stable-isotope internal standard. PeakMeister accelerated data preprocessing times by 30-fold compared to manual processing of MSI-CE-MS data by an experienced analyst using vendor software, while also achieving excellent peak annotation fidelity (median accuracy >99.9%), acceptable intermediate precision (median CV = 16.0%), consistent metabolite peak integration (mean bias = -2.1%), and good mutual agreement when quantifying 16 plasma metabolites from NIST SRM-1950 (mean bias = -1.3%). Reference ranges are also reported for 40 serum metabolites in a national nutritional survey of Brazilian children under 5 years of age from the ENANI-2019 study. MSI-CE-MS in conjunction with PeakMeister allows for rapid and automated processing of large-scale metabolomic studies that tolerate nonlinear migration time shifts without complicated dynamic time warping or effective mobility scale transformations.
基于质谱(MS)的代谢组学在分析复杂生物样本时,通常依赖分离技术来提高方法分辨率、代谢组覆盖范围、定量性能和/或未知物鉴定能力。然而,低样本通量和复杂的数据预处理程序仍然是可扩展至大规模人群的经济实惠的代谢组学研究的主要障碍。在此,我们介绍PeakMeister,这是R统计环境中的一种新软件工具,用于对通过多段进样-毛细管电泳-质谱(MSI-CE-MS)获取的血清代谢组学数据进行标准化处理,MSI-CE-MS是一种高通量分离平台(<4分钟/样本),它利用单次分析运行中13个样本的序列进样格式。我们通过分析在巴西全国儿童营养调查(ENANI-2019)的5000份血清和420份质量控制样本中一致测量的47种阳离子代谢物,对PeakMeister进行了严格验证,这些样本在八个月内分三批共40天采集了总共224,983个代谢物峰。引入了一个使用11种内标物的迁移时间指数来校正迁移时间的大幅变化,这使得对于具有两个侧翼内标物或单个共迁移稳定同位素内标物的血清代谢物能够进行可靠的峰注释、峰积分和样本位置分配。与经验丰富的分析师使用供应商软件手动处理MSI-CE-MS数据相比,PeakMeister将数据预处理时间加快了30倍,同时还实现了出色的峰注释保真度(中位准确率>99.9%)、可接受的中间精密度(中位CV = 16.0%)、一致的代谢物峰积分(平均偏差 = -2.1%),并且在对NIST SRM-1950中的16种血浆代谢物进行定量时具有良好的相互一致性(平均偏差 = -1.3%)。还报告了来自ENANI-2019研究的巴西5岁以下儿童全国营养调查中40种血清代谢物的参考范围。MSI-CE-MS与PeakMeister相结合,能够对大规模代谢组学研究进行快速自动化处理,该研究能够容忍非线性迁移时间偏移,而无需复杂的动态时间规整或有效迁移率标度转换。