Fagerquist Clifton K
Western Regional Research Center, Agricultural Research Service, United States Department of Agriculture, 800 Buchanan Street, Albany, California 94710, USA.
J Proteome Res. 2007 Jul;6(7):2539-49. doi: 10.1021/pr060700o. Epub 2007 May 18.
We have identified the protein biomarkers observed in the matrix-assisted laser desorption/ionization time-of-flight mass spectra (MALDI-TOF-MS) of cell lysates of five strains of Campylobacter upsaliensis and one strain of C. helveticus by "bottom-up" proteomic techniques. Only one C. upsaliensis strain had previously been genomically sequenced. The significant findings are as follows: (1) The protein biomarkers identified were: 10 kD chaperonin, protein of unknown function (DUF465), phnA protein, probable periplasmic protein, D-methionine-binding lipoprotein MetQ, cytochrome c family protein, DNA-binding protein HU, thioredoxin, asparigenase family protein, helix-turn-helix domain protein, as well as several ribosomal and conserved hypothetical proteins. (2) Amino acid substitutions in protein biomarkers across species and strains account for variations in biomarker ion mass-to-charge (m/z). (3) The most common post-translational modifications (PTMs) identified were cleavage of N-terminal methionine and N-terminal signal peptides. The rule that predicts N-terminal methionine cleavage, based on the penultimate residue, does not appear to apply to C. upsaliensis proteins when the penultimate residue is threonine. (4) It was discovered that some protein biomarker genes of the genomically sequenced C. upsaliensis strain were found to have nucleotide sequences with GTG or TTG "start" codons that were not the actual start codon (ATG) of the protein based on proteomic analysis. (5) Proteomic identification of the protein biomarkers of the non-genomically sequenced C. upsaliensis and C. helveticus strains involved identification of homologous protein amino acid sequences to that of the sequenced strain. Interestingly, some protein sequence regions that were not completely homologous to the sequenced strain, due to amino acid substitutions, were found to have homologous sequence regions from more phyogenetically distant species/strains, e.g., C. jejuni. Exploiting this partial homology of more distant species/strains, it was possible to construct a "composite" amino acid sequence using multiple non-overlapping sequence regions from both phylogenetically proximate and distant strains. The new composite sequence was confirmed by both MS and MS/MS data. Thus, it was possible in some cases to determine the amino acid sequence of an unknown protein biomarker from a genomically non-sequenced bacterial strain without the necessity of either genetically sequencing the biomarker gene or resorting to de novo MS/MS analysis of the full protein sequence.
我们通过“自下而上”的蛋白质组学技术,鉴定了在五株乌普萨拉弯曲杆菌和一株瑞士弯曲杆菌细胞裂解物的基质辅助激光解吸/电离飞行时间质谱(MALDI-TOF-MS)中观察到的蛋白质生物标志物。此前仅有一株乌普萨拉弯曲杆菌进行过基因组测序。重要发现如下:(1)鉴定出的蛋白质生物标志物有:10kD伴侣蛋白、功能未知蛋白(DUF465)、phnA蛋白、可能的周质蛋白、D-甲硫氨酸结合脂蛋白MetQ、细胞色素c家族蛋白、DNA结合蛋白HU、硫氧还蛋白、天冬酰胺酶家族蛋白、螺旋-转角-螺旋结构域蛋白,以及几种核糖体蛋白和保守的假定蛋白。(2)跨物种和菌株的蛋白质生物标志物中的氨基酸替换导致生物标志物离子质荷比(m/z)出现差异。(3)鉴定出的最常见的翻译后修饰(PTM)是N端甲硫氨酸和N端信号肽的切割。基于倒数第二个残基预测N端甲硫氨酸切割的规则,在倒数第二个残基为苏氨酸时,似乎不适用于乌普萨拉弯曲杆菌的蛋白质。(4)发现基因组测序的乌普萨拉弯曲杆菌菌株的一些蛋白质生物标志物基因具有GTG或TTG“起始”密码子,基于蛋白质组学分析,这些并非蛋白质的实际起始密码子(ATG)。(5)对未进行基因组测序的乌普萨拉弯曲杆菌和瑞士弯曲杆菌菌株的蛋白质生物标志物进行蛋白质组学鉴定,涉及鉴定与已测序菌株同源的蛋白质氨基酸序列。有趣的是,一些由于氨基酸替换而与已测序菌株不完全同源的蛋白质序列区域,被发现具有来自系统发育关系更远的物种/菌株(如空肠弯曲杆菌)的同源序列区域。利用更远的物种/菌株的这种部分同源性,有可能使用来自系统发育关系相近和较远菌株的多个不重叠序列区域构建一个“复合”氨基酸序列。新的复合序列通过MS和MS/MS数据得到证实。因此,在某些情况下,无需对生物标志物基因进行基因测序或对完整蛋白质序列进行从头MS/MS分析,就有可能确定来自未进行基因组测序的细菌菌株的未知蛋白质生物标志物的氨基酸序列。