Hu Xutao, Liu Wen, Xing Xiaopeng
School of Chemical Science and Engineering, Tongji University, Shanghai, China.
Rapid Commun Mass Spectrom. 2025 Dec 15;39(23):e10121. doi: 10.1002/rcm.10121.
Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) is a highly efficient technique for microbial identification; however, the accuracy has always been a problem when identifying closely related microbial species. Improving spectral data identification algorithms is one of the key approaches to enhancing the discriminatory power and reliability of identification for the closely related species.
This study develops a dimensionality reduction method based on inter-spectral distance computation for the analysis of MALDI-TOF MS data. The method comprises four steps: average spectrum construction, peak matching, distance calculation, and spectral vectorization. We applied this method, along with the conventional principal component analysis (PCA) method, to a MALDI-TOF MS dataset of closely related microbial species. Binary classification experiments were conducted to compare the classification performance of the two methods, and multiclass classification experiments were conducted to evaluate the feasibility of the proposed approach for database construction.
A systematic evaluation of the newly proposed distance-based method was conducted using MALDI-TOF mass spectral data from five pairs of closely related microbial species. The results indicated that this method effectively extracted spectral features and enabled accurate classification. It outperformed the conventional PCA method, and even other more sophisticated methods like LDA and t-SNE, in terms of both clustering performance and identification accuracy.
The findings suggest that the newly proposed distance-based dimensionality reduction algorithm (DbDRA) largely enhances the reliability of identifying closely related microbial species, highlighting its potential applicability in microbial identification using MALDI-TOF mass spectroscopy.
基质辅助激光解吸/电离飞行时间质谱(MALDI-TOF MS)是一种高效的微生物鉴定技术;然而,在鉴定亲缘关系相近的微生物物种时,准确性一直是个问题。改进光谱数据识别算法是提高亲缘关系相近物种鉴定的区分能力和可靠性的关键方法之一。
本研究开发了一种基于光谱间距离计算的降维方法,用于分析MALDI-TOF MS数据。该方法包括四个步骤:平均光谱构建、峰匹配、距离计算和光谱矢量化。我们将此方法与传统的主成分分析(PCA)方法一起应用于亲缘关系相近的微生物物种的MALDI-TOF MS数据集。进行二元分类实验以比较两种方法的分类性能,并进行多类分类实验以评估所提出方法用于数据库构建的可行性。
使用来自五对亲缘关系相近的微生物物种的MALDI-TOF质谱数据,对新提出的基于距离的方法进行了系统评估。结果表明,该方法有效地提取了光谱特征并实现了准确分类。在聚类性能和识别准确性方面,它优于传统的PCA方法,甚至优于其他更复杂的方法,如LDA和t-SNE。
研究结果表明,新提出的基于距离的降维算法(DbDRA)在很大程度上提高了亲缘关系相近的微生物物种鉴定的可靠性,突出了其在使用MALDI-TOF质谱进行微生物鉴定中的潜在适用性。