NPI-Semel Institute for Neuroscience and Human Behavior, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California 90024, United States.
Anal Chem. 2012 May 15;84(10):4383-95. doi: 10.1021/ac203337s. Epub 2012 May 2.
Top-down mass spectrometry has been used to investigate structural diversity within some abundant salivary protein families. In this study, we report the identification of two isoforms of protein II-2 which differed in mass by less than 1 Da, the determination of a sequence for protein IB8a that was best satisfied by including a mutation and a covalent modification in the C-terminal part, and the assignment of a sequence of a previously unreported protein of mass 10433 Da. The final characterization of Peptide P-J was achieved, and the discovery of a truncated form of this peptide was reported. The first sequence assignment was done at low resolution using a hybrid quadrupole time-of-flight instrument to quickly identify and characterize proteins, and data acquisition was switched to Fourier-transform ion cyclotron resonance (FTICR) for proteins that required additional sequence coverage and certainty of assignment. High-resolution and high mass accuracy mass spectrometry on a FTICR-mass spectrometry (MS) instrument combined with electron-capture dissociation (ECD) provided the most informative data sets, with the more frequent presence of "unique" ions that unambiguously define the primary structure. A mixture of predictable and unusual post-translational modifications in the protein sequence precluded the use of shotgun-annotated databases at this stage, requiring manual iterations of sequence refinement in many cases. This led us to propose guidelines for an iterative processing workflow of MS and MSMS data sets that allow researchers to completely assign the identity and the structure of a protein.
自上而下的质谱分析已被用于研究某些丰富的唾液蛋白家族内的结构多样性。在这项研究中,我们报告了两种质量相差不到 1 Da 的蛋白 II-2 同工型的鉴定,确定了蛋白 IB8a 的序列,最好通过在 C 末端包含突变和共价修饰来满足,以及以前未报道的分子量为 10433 Da 的蛋白的序列分配。肽 P-J 的最终特征得以确定,并报告了该肽的截断形式的发现。首次序列分配是在低分辨率下使用混合四极杆飞行时间仪器进行的,以快速识别和表征蛋白质,并且对于需要额外序列覆盖和分配确定性的蛋白质,数据采集切换到傅里叶变换离子回旋共振(FTICR)。在傅里叶变换离子回旋共振质谱仪(MS)上进行高分辨率和高质量精度 MS 与电子捕获解离(ECD)相结合,提供了最具信息量的数据组,其中“独特”离子的出现更为频繁,这些离子明确定义了一级结构。蛋白质序列中存在可预测和不寻常的翻译后修饰混合物,使得在现阶段无法使用 shotgun 注释数据库,在许多情况下需要手动迭代序列细化。这使我们提出了 MS 和 MSMS 数据集的迭代处理工作流程指南,使研究人员能够完全分配蛋白质的身份和结构。