Rison Stuart C G, Mattow Jens, Jungblut Peter R, Stoker Neil G
The Royal Veterinary College, Royal College Street, London NW1 0TU, UK.
Max-Planck-Institute for Infection Biology, Core Facility Protein Analysis, Campus Charité Mitte, Schumannstr. 21/22, D-10117 Berlin, Germany.
Microbiology (Reading). 2007 Feb;153(Pt 2):521-528. doi: 10.1099/mic.0.2006/001537-0.
Identification of protein translation start sites is largely a bioinformatics exercise, with relatively few confirmed by N-terminal sequencing. Translation start site determination is critical for defining both the protein sequence and the upstream DNA which may contain regulatory motifs. It is demonstrated here that translation start sites can be determined during routine protein identification, using MALDI-MS and MS/MS data to select the correct N-terminal sequence from a list of alternatives generated in silico. Applying the method to 13 proteins from Mycobacterium tuberculosis, 11 predicted translational start sites were confirmed, and two reassigned. The authors suggest that these data (be they confirmation or reassignments) are important for the annotation of both this genome and those of organisms with related genes. It was also shown that N-acetylation, reported to be rare in prokaryotes, was present in three of the 13 proteins (23 %), suggesting that in the mycobacteria this modification may be common, and an important regulator of protein function, although more proteins need to be analysed. This method can be performed with little or no additional experimental work during proteomics investigations.
蛋白质翻译起始位点的鉴定在很大程度上是一项生物信息学工作,通过N端测序确认的相对较少。翻译起始位点的确定对于定义蛋白质序列以及可能包含调控基序的上游DNA都至关重要。本文证明,在常规蛋白质鉴定过程中可以确定翻译起始位点,利用基质辅助激光解吸电离质谱(MALDI-MS)和串联质谱(MS/MS)数据从计算机模拟生成的备选列表中选择正确的N端序列。将该方法应用于结核分枝杆菌的13种蛋白质,11个预测的翻译起始位点得到确认,2个被重新分配。作者认为这些数据(无论是确认还是重新分配)对于该基因组以及具有相关基因的生物体基因组的注释都很重要。研究还表明,据报道在原核生物中很少见的N-乙酰化在13种蛋白质中的3种(23%)中存在,这表明在分枝杆菌中这种修饰可能很常见,并且是蛋白质功能的重要调节因子,尽管还需要分析更多的蛋白质。在蛋白质组学研究期间,该方法几乎不需要额外的实验工作或完全不需要额外的实验工作就能实施。