Yu Li-Rong, Conrads Thomas P, Uo Takuma, Kinoshita Yoshito, Morrison Richard S, Lucas David A, Chan King C, Blonder Josip, Issaq Haleem J, Veenstra Timothy D
Laboratory of Proteomics and Analytical Technologies, SAIC-Frederick, Inc., National Cancer Institute at Frederick, P.O. Box B, Frederick, MD 21702-1201, USA.
Mol Cell Proteomics. 2004 Sep;3(9):896-907. doi: 10.1074/mcp.M400034-MCP200. Epub 2004 Jun 30.
In this study, a multidimensional fractionation approach was combined with MS/MS to increase the capability of characterizing complex protein profiles of mammalian neuronal cells. Proteins extracted from primary cultures of cortical neurons were digested with trypsin followed by fractionation using strong cation exchange chromatography. Each of these fractions was analyzed by microcapillary reversed-phase LC-MS/MS. The analysis of the MS/MS data resulted in the identification of over 15,000 unique peptides from which 3,590 unique proteins were identified based on protein-specific peptide tags that are unique to a single protein in the searched database. In addition, 952 protein clusters were identified using cluster analysis of the proteins identified by the peptides not unique to a single protein. This identification revealed that a minimum of 4,542 proteins could be identified from this experiment, representing approximately 16% of all known mouse proteins. An evaluation of the number of false-positive identifications was undertaken by searching the entire MS/MS dataset against a database containing the sequences of over 12,000 proteins from archaea. This analysis allowed a systematic determination of the level of confidence in the identification of peptides as a function of SEQUEST cross correlation (Xcorr) and delta correlation (DeltaCn) scores. Correlation charts were also constructed to show the number of unique peptides identified for proteins from specific classes. The results show that low-abundance proteins involved in signal transduction and transcription are generally identified by fewer peptides than high-abundance proteins that play a role in maintaining mammalian cellular structure and motility. The results presented here provide the broadest proteome coverage for a mammalian cell to date and show that MS-based proteomics has the potential to provide high coverage of the proteins expressed within a cell.
在本研究中,将多维分级分离方法与串联质谱(MS/MS)相结合,以提高表征哺乳动物神经元细胞复杂蛋白质谱的能力。从皮质神经元原代培养物中提取的蛋白质先用胰蛋白酶消化,然后使用强阳离子交换色谱进行分级分离。这些级分中的每一个都通过微毛细管反相液相色谱-串联质谱进行分析。对串联质谱数据的分析导致鉴定出超过15,000个独特的肽段,基于搜索数据库中单个蛋白质特有的蛋白质特异性肽标签,从中鉴定出3590个独特的蛋白质。此外,使用对单个蛋白质不独特的肽段鉴定出的蛋白质进行聚类分析,鉴定出952个蛋白质簇。这一鉴定结果表明,从该实验中至少可以鉴定出4542种蛋白质,约占所有已知小鼠蛋白质的16%。通过将整个串联质谱数据集与包含来自古细菌的12,000多种蛋白质序列的数据库进行比对,对假阳性鉴定的数量进行了评估。该分析允许系统地确定肽段鉴定的置信水平与SEQUEST交叉相关(Xcorr)和δ相关(DeltaCn)分数的函数关系。还构建了相关图表以显示从特定类别蛋白质中鉴定出的独特肽段数量。结果表明,与参与维持哺乳动物细胞结构和运动的高丰度蛋白质相比,参与信号转导和转录的低丰度蛋白质通常由较少的肽段鉴定出来。此处呈现的结果提供了迄今为止哺乳动物细胞最广泛的蛋白质组覆盖范围,并表明基于质谱的蛋白质组学有潜力提供细胞内表达蛋白质的高覆盖率。