Intelligent Systems Program, Department of Computer Science, University of Pittsburgh, Pittsburgh, PA 15260, USA.
IEEE/ACM Trans Comput Biol Bioinform. 2010 Jan-Mar;7(1):126-37. doi: 10.1109/TCBB.2008.31.
Whole-sample mass spectrometry (MS) proteomics allows for a parallel measurement of hundreds of proteins present in a variety of biospecimens. Unfortunately, the association between MS signals and these proteins is not straightforward. The need to interpret mass spectra demands the development of methods for accurate labeling of ion species in such profiles. To aid this process, we have developed a new peak-labeling procedure for associating protein and peptide labels with peaks. This computational method builds upon characteristics of proteins expected to be in the sample, such as the amino sequence, mass weight, and expected concentration within the sample. A new probabilistic score that incorporates this information is proposed. We evaluate and demonstrate our method's ability to label peaks first on simulated MS spectra and then on MS spectra from human serum with a spiked-in calibration mixture.
全样本质谱(MS)蛋白质组学允许对各种生物标本中存在的数百种蛋白质进行平行测量。不幸的是,MS 信号与这些蛋白质之间的关联并不直接。需要解释质谱,因此需要开发用于准确标记此类谱中离子种类的方法。为了辅助这一过程,我们开发了一种新的峰标记程序,用于将蛋白质和肽标记与峰相关联。这种计算方法基于预期存在于样品中的蛋白质的特性,例如氨基酸序列、质量重量以及在样品中的预期浓度。提出了一种新的概率评分,其中包含此信息。我们首先在模拟 MS 光谱上评估和展示我们的方法标记峰的能力,然后在含有掺入校准混合物的人血清的 MS 光谱上进行评估和展示。