Jordan Bryen A., Fernholz Brian D., Neubert Thomas A., Ziff Edward B.
Most questions in modern cell biology have been approached using reductionist methods, i.e., by studying one gene, one protein or one specific protein modification at a time. This reductionism has been necessary, given the complexity of biological systems and lack of tools for developing more integrative methodologies. It is, nonetheless, responsible for most of our knowledge of biological systems. However, a more thorough understanding of complex systems will require the simultaneous observation of their many characteristics. Proteomic methodologies, especially those that are mass spectrometry based, have enabled the large-scale study of protein modifications, protein abundance and protein interactors. Only by using such integrative approaches will we develop rigorous models that more accurately reflect actual cellular processes. Proteomics is the study of the set or subset of all proteins expressed in an organism, tissue or cell culture. While proteomics is by no means a novel idea, recent advances in mass spectrometry and the determination of the complete genomic sequences of several organisms greatly enhance its usefulness [1–3]. Only by mass spectrometry can one efficiently identify the individual protein components derived from protein complexes, and only large-scale genomic data allows amino acid sequences to be reliably assigned to peptide fragments identified by mass spectrometry. Most proteomics-based studies today seek to answer four basic questions: Which proteins were found? What is the relative abundance of the proteins found? What modifications were found on the proteins? Which proteins physically associate with one another? One type of proteomics experiment, protein profiling, involves the identification of the proteins present in a complex, cell culture, tissue or organism. However, unlike genomes, proteomes are dynamic with changes that reflect their current functional state. Thus, analyzing changes in the abundance and modifications of proteins in a complex could be considered a functional study (functional proteomics). While several proteomic-based methods exist, most are based on a standard sequence of experiments: a protein complex is obtained by biochemical prefraction-ation or affinity purification; the sample is subjected to enzymatic digestion (usually trypsin); the resulting peptides can be labeled for future analysis; and the peptides are analyzed by mass spectrometry to determine their identity and characteristics. At the core of most modern proteomic studies lies mass spectrometry (MS) [2,3]. A mass spectrometer is, in essence, a detector that measures the mass-to-charge ratio (/) of ionized particles and detects the relative number of ions at each / ratio. In general, peptide fragments generated by enzymatic cleavage (typically trypsin) are ionized most commonly by matrix-assisted laser desorption/ionization (MALDI) or electrospray ionization (ESI) and injected into a mass spectrometer. In most cases, peptides with specific / ratios can be selected by quadrupole exclusion or time of flight (TOF) selection. These peptides can then be fragmented by cleavage at peptide bonds via collision with a gaseous matrix. The masses of the fragments can then be determined by a second mass analysis (often by TOF or ion trap MS) to determine the amino acid sequence of peptides. Prefractionation of samples by nanoflow high-performance liquid chromatography (HPLC) and increases in sensitivity and accuracy of mass spectrometers allows for the identification of up to 1000 or more peptides per sample. In this way, complex mixtures can be resolved and the protein compositions elucidated.
现代细胞生物学中的大多数问题都是采用还原论方法来研究的,即一次只研究一个基因、一种蛋白质或一种特定的蛋白质修饰。鉴于生物系统的复杂性以及缺乏开发更综合方法的工具,这种还原论是必要的。尽管如此,我们对生物系统的大部分认知都归功于它。然而,要更全面地理解复杂系统,就需要同时观察它们的众多特征。蛋白质组学方法,尤其是基于质谱的方法,使得对蛋白质修饰、蛋白质丰度和蛋白质相互作用分子进行大规模研究成为可能。只有通过使用这种综合方法,我们才能建立更严格的模型,更准确地反映实际的细胞过程。蛋白质组学是对生物体、组织或细胞培养物中表达的所有蛋白质的集合或子集的研究。虽然蛋白质组学绝不是一个新想法,但质谱技术的最新进展以及几种生物体完整基因组序列的测定极大地提高了它的实用性[1 - 3]。只有通过质谱分析,才能有效地鉴定出蛋白质复合物中的各个蛋白质成分,而且只有大规模的基因组数据才能将氨基酸序列可靠地分配给通过质谱鉴定出的肽片段。如今,大多数基于蛋白质组学的研究试图回答四个基本问题:发现了哪些蛋白质?所发现蛋白质的相对丰度是多少?在蛋白质上发现了哪些修饰?哪些蛋白质彼此发生物理关联?一种蛋白质组学实验,即蛋白质谱分析,涉及鉴定存在于复合物、细胞培养物、组织或生物体中的蛋白质。然而,与基因组不同,蛋白质组是动态的,其变化反映了它们当前的功能状态。因此,分析复合物中蛋白质丰度和修饰的变化可被视为一项功能研究(功能蛋白质组学)。虽然存在几种基于蛋白质组学的方法,但大多数都基于标准的实验流程:通过生化预分级或亲和纯化获得蛋白质复合物;对样品进行酶解(通常用胰蛋白酶);所得肽段可进行标记以便后续分析;然后通过质谱分析肽段以确定其身份和特征。大多数现代蛋白质组学研究的核心是质谱(MS)[2,3]。本质上,质谱仪是一种探测器,它测量离子化粒子的质荷比(m/z),并检测每个m/z比值下离子的相对数量。一般来说,酶解产生的肽片段(通常由胰蛋白酶产生)最常通过基质辅助激光解吸/电离(MALDI)或电喷雾电离(ESI)进行离子化,然后注入质谱仪。在大多数情况下,具有特定m/z比值的肽段可通过四极杆排除或飞行时间(TOF)选择来挑选。然后这些肽段可通过与气态基质碰撞在肽键处裂解而碎片化。然后通过第二次质谱分析(通常用TOF或离子阱质谱)确定片段的质量,以确定肽段的氨基酸序列。通过纳流高效液相色谱(HPLC)对样品进行预分级以及质谱仪灵敏度和准确性的提高,使得每个样品能够鉴定出多达1000个或更多的肽段。通过这种方式,可以解析复杂混合物并阐明蛋白质组成。