Instituto de Investigaciones Biomédicas Alberto Sols, Spanish National Research Council (CSIC), Madrid, Spain.
Proteomics. 2011 Feb;11(4):590-603. doi: 10.1002/pmic.201000287. Epub 2011 Jan 13.
Recent reviews pinpointed the enormous diversity of proteins found in living organisms, especially in higher eukaryotes. Protein diversity is driven through three main processes: first, at deoxyribonucleic acid (DNA) level (i.e. gene polymorphisms), second, at precursor messenger ribonucleic acid (pre-mRNA) or messenger ribonucleic acid (mRNA) level (i.e. alternative splicing, also termed as differential splicing) and, finally, at the protein level (i.e. PTM). Current proteomic technologies allow the identification, characterization and quantitation of up to several thousands of proteins in a single experiment. Nevertheless, the identification and characterization of protein species using these technologies are still hampered. Here, we review the use of the terms "protein species" and "protein isoform." We evidence that the appropriate selection of the database used for searches can impede or facilitate the identification of protein species. We also describe examples where protein identification search engines systematically fail in the attribution of protein species. We briefly review the characterization of protein species using proteomic technologies including gel-based, gel-free, bottom-up and top-down analysis and discuss their limitations. As an example, we discuss the theoretical characterization of the two human choline kinase species, α-1 and α-2, sharing the same catalytic activity but generated by alternative splicing on CHKA gene.
最近的评论指出了在生物体中发现的蛋白质的巨大多样性,尤其是在高等真核生物中。蛋白质多样性是通过三个主要过程驱动的:首先,在脱氧核糖核酸(DNA)水平上(即基因多态性),其次,在前体信使核糖核酸(pre-mRNA)或信使核糖核酸(mRNA)水平上(即选择性剪接,也称为差异剪接),最后,在蛋白质水平上(即 PTM)。当前的蛋白质组学技术允许在单个实验中鉴定、描述和定量多达几千种蛋白质。然而,使用这些技术鉴定和描述蛋白质种类仍然存在障碍。在这里,我们回顾了“蛋白质种类”和“蛋白质同工型”这两个术语的使用。我们证明,适当选择用于搜索的数据库可以阻碍或促进蛋白质种类的鉴定。我们还描述了一些例子,其中蛋白质鉴定搜索引擎在赋予蛋白质种类时系统地失败。我们简要回顾了使用蛋白质组学技术(包括基于凝胶的、无凝胶的、自上而下和自下而上的分析)来描述蛋白质种类,并讨论了它们的局限性。作为一个例子,我们讨论了具有相同催化活性但由 CHKA 基因的选择性剪接产生的两种人类胆碱激酶同工型α-1 和 α-2 的理论特征。