Hilario Melanie, Kalousis Alexandros, Pellegrini Christian, Müller Markus
Artificial Intelligence Laboratory, Computer Science Department, University of Geneva, CH-1211 Geneva 4, Switzerland.
Mass Spectrom Rev. 2006 May-Jun;25(3):409-49. doi: 10.1002/mas.20072.
Among the many applications of mass spectrometry, biomarker pattern discovery from protein mass spectra has aroused considerable interest in the past few years. While research efforts have raised hopes of early and less invasive diagnosis, they have also brought to light the many issues to be tackled before mass-spectra-based proteomic patterns become routine clinical tools. Known issues cover the entire pipeline leading from sample collection through mass spectrometry analytics to biomarker pattern extraction, validation, and interpretation. This study focuses on the data-analytical phase, which takes as input mass spectra of biological specimens and discovers patterns of peak masses and intensities that discriminate between different pathological states. We survey current work and investigate computational issues concerning the different stages of the knowledge discovery process: exploratory analysis, quality control, and diverse transforms of mass spectra, followed by further dimensionality reduction, classification, and model evaluation. We conclude after a brief discussion of the critical biomedical task of analyzing discovered discriminatory patterns to identify their component proteins as well as interpret and validate their biological implications.
在质谱分析的众多应用中,从蛋白质质谱图中发现生物标志物模式在过去几年引起了相当大的关注。尽管研究工作带来了早期诊断和微创诊断的希望,但也揭示了在基于质谱的蛋白质组学模式成为常规临床工具之前需要解决的许多问题。已知问题涵盖了从样本采集到质谱分析,再到生物标志物模式提取、验证和解释的整个流程。本研究聚焦于数据分析阶段,该阶段将生物样本的质谱图作为输入,发现能够区分不同病理状态的峰质量和强度模式。我们综述了当前的工作,并研究了知识发现过程不同阶段的计算问题:探索性分析、质量控制、质谱的各种变换,随后进行进一步的降维、分类和模型评估。在简要讨论了分析发现的鉴别模式以识别其组成蛋白质并解释和验证其生物学意义这一关键生物医学任务后,我们得出了结论。