Goldberg David, Bern Marshall, North Simon J, Haslam Stuart M, Dell Anne
Palo Alto Research Center, 3333 Coyote Hill Rd, Palo Alto CA 94304, USA.
Bioinformatics. 2009 Feb 1;25(3):365-71. doi: 10.1093/bioinformatics/btn636. Epub 2008 Dec 10.
In the past few years, mass spectrometry (MS) has emerged as the premier tool for identification and quantification of biological molecules such as peptides and glycans. There are two basic strategies: single-MS, which uses a single round of mass analysis, and MS/MS (or higher order MS(n)), which adds one or more additional rounds of mass analysis, interspersed with fragmentation steps. Single-MS offers higher throughput, broader mass coverage and more direct quantitation, but generally much weaker identification. Single-MS, however, does work fairly well for the case of N-glycan identification, which are more constrained than other biological polymers. We previously demonstrated single-MS identification of N-glycans to the level of 'cartoons' (monosaccharide composition and topology) by a system that incorporates an expert's detailed knowledge of the biological sample. In this article, we explore the possibility of ab initio single-MS N-glycan identification, with the goal of extending single-MS, or primarily-single-MS, identification to non-expert users, novel conditions and unstudied tissues.
We propose and test three cartoon-assignment algorithms that make inferences informed by biological knowledge about glycan synthesis. To test the algorithms, we used 71 single-MS spectra from a variety of tissues and organisms, containing more than 2800 manually annotated peaks. The most successful of the algorithms computes the most richly connected subgraph within a 'cartoon graph'. This algorithm uniquely assigns the correct cartoon to more than half of the peaks in 41 out of the 71 spectra.
在过去几年中,质谱(MS)已成为鉴定和定量肽和聚糖等生物分子的首要工具。有两种基本策略:单重质谱(single-MS),它使用一轮质量分析;以及串联质谱(MS/MS,或更高阶的MS(n)),它增加一轮或多轮额外的质量分析,并穿插碎片化步骤。单重质谱具有更高的通量、更宽的质量覆盖范围和更直接的定量,但通常鉴定能力较弱。然而,单重质谱对于N-聚糖的鉴定效果相当不错,N-聚糖比其他生物聚合物的结构限制更多。我们之前通过一个纳入了专家对生物样品详细知识的系统,展示了单重质谱对N-聚糖的鉴定可达到“卡通图”(单糖组成和拓扑结构)水平。在本文中,我们探索从头开始进行单重质谱N-聚糖鉴定的可能性,目标是将单重质谱或主要是单重质谱的鉴定扩展到非专业用户、新条件和未研究的组织。
我们提出并测试了三种基于聚糖合成生物学知识进行推断的卡通图分配算法。为了测试这些算法,我们使用了来自各种组织和生物体的71个单重质谱图,其中包含超过2800个手动注释的峰。最成功的算法在“卡通图”内计算连接最丰富的子图。该算法在71个光谱中的41个光谱中,将正确的卡通图唯一地分配给了超过一半的峰。