Department of Biomedical Informatics, ‡Vanderbilt-Ingram Cancer Center, and §Department of Cancer Biology, Vanderbilt University School of Medicine , Nashville, Tennessee 37232, United States.
J Proteome Res. 2014 Jun 6;13(6):2715-23. doi: 10.1021/pr500194t. Epub 2014 May 12.
Mass spectrometry (MS)-based shotgun proteomics is an effective technology for global proteome profiling. The ultimate goal is to assign tandem MS spectra to peptides and subsequently infer proteins and their abundance. In addition to database searching and protein assembly algorithms, computational approaches have been developed to integrate genomic, transcriptomic, and interactome information to improve peptide and protein identification. Earlier efforts focus primarily on making databases more comprehensive using publicly available genomic and transcriptomic data. More recently, with the increasing affordability of the Next Generation Sequencing (NGS) technologies, personalized protein databases derived from sample-specific genomic and transcriptomic data have emerged as an attractive strategy. In addition, incorporating interactome data not only improves protein identification but also puts identified proteins into their functional context and thus facilitates data interpretation. In this paper, we survey the major integrative bioinformatics approaches that have been developed during the past decade and discuss their merits and demerits.
基于质谱(MS)的鸟枪法蛋白质组学是一种用于进行全局蛋白质组分析的有效技术。其最终目标是将串联 MS 谱分配给肽,进而推断蛋白质及其丰度。除了数据库搜索和蛋白质组装算法外,还开发了计算方法来整合基因组、转录组和互作组信息,以提高肽和蛋白质的鉴定。早期的工作主要集中在使用公共基因组和转录组数据来使数据库更加全面。最近,随着下一代测序(NGS)技术的成本降低,从特定样本的基因组和转录组数据中衍生的个性化蛋白质数据库已成为一种很有吸引力的策略。此外,整合互作组数据不仅可以提高蛋白质的鉴定,还可以将鉴定出的蛋白质置于其功能背景下,从而便于解释数据。在本文中,我们调查了过去十年中开发的主要综合生物信息学方法,并讨论了它们的优缺点。