Russeth Kevin P, Higgins LeeAnn, Andrews Matthew T
Department of Biology, University of Minnesota Duluth, 1035 Kirby Drive, 55812, USA.
J Proteome Res. 2006 Apr;5(4):829-39. doi: 10.1021/pr050306a.
A major challenge in the life sciences is the extraction of detailed molecular information from plants and animals that are not among the handful of exhaustively studied "model organisms." As a consequence, certain species with novel phenotypes are often ignored due to the lack of searchable databases, tractable genetics, stock centers, and more recently, a sequenced genome. Characterization of phenotype at the molecular level commonly relies on the identification of differentially expressed proteins by combining database searching with tandem mass spectrometry (MS) of peptides derived from protein fragmentation. However, the identification of short peptides from nonmodel organisms can be hampered by the lack of sufficient amino acid sequence homology with proteins in existing databases; therefore, a database search strategy that encompasses both identity and homology can provide stronger evidence than a single search alone. The use of multiple algorithms for database searches may also increase the probability of correct protein identification since it is unlikely that each program would produce false negative or positive hits for the same peptides. In this study, four software packages, Mascot, Pro ID, Sequest, and Pro BLAST, were compared in their ability to identify proteins from the thirteen-lined ground squirrel (Spermophilus tridecemlineatus), a hibernating mammal that lacks a completely sequenced genome. Our results show similarities as well as the degree of variability among different software packages when the identical protein database is searched. In the process of this study, we identified the up-regulation of succinyl CoA-transferase (SCOT) in the heart of hibernators. SCOT is the rate-limiting enzyme in the catabolism of ketone bodies, an important alternative fuel source during hibernation.
生命科学中的一个主要挑战是,从少数经过详尽研究的“模式生物”之外的动植物中提取详细的分子信息。因此,某些具有新表型的物种往往因缺乏可搜索的数据库、易于处理的遗传学、保种中心以及最近的测序基因组而被忽视。在分子水平上对表型进行表征通常依赖于通过将数据库搜索与源自蛋白质片段化的肽段串联质谱(MS)相结合来鉴定差异表达的蛋白质。然而,从非模式生物中鉴定短肽可能会因与现有数据库中的蛋白质缺乏足够的氨基酸序列同源性而受到阻碍;因此,一种兼顾同一性和同源性的数据库搜索策略比单独的单一搜索能提供更有力的证据。使用多种算法进行数据库搜索也可能增加正确鉴定蛋白质的概率,因为不太可能每个程序对相同的肽段都产生假阴性或假阳性结果。在本研究中,对四种软件包Mascot、Pro ID、Sequest和Pro BLAST从十三条纹地松鼠(Spermophilus tridecemlineatus)中鉴定蛋白质的能力进行了比较,十三条纹地松鼠是一种缺乏完整测序基因组的冬眠哺乳动物。我们的结果显示,在搜索相同蛋白质数据库时,不同软件包之间存在相似性以及变异性程度。在本研究过程中,我们鉴定出冬眠动物心脏中琥珀酰辅酶A转移酶(SCOT)的上调。SCOT是酮体分解代谢中的限速酶,酮体是冬眠期间一种重要的替代燃料来源。