Hendrickson E L, Lamont R J, Hackett M
Departments of Chemical Engineering, Universityof Washington, Box 355014, Seattle, WA 98195, USA.
J Dent Res. 2008 Nov;87(11):1004-15. doi: 10.1177/154405910808701113.
Quantitative proteomic analysis of microbial systems generates large datasets that can be difficult and time-consuming to interpret. Fortunately, many of the data display and gene-clustering tools developed to analyze large transcriptome microarray datasets are also applicable to proteomes. Plots of abundance ratio vs. total signal or spectral counts can highlight regions of random error and putative change. Displaying data in the physical order of the genes in the genome sequence can highlight potential operons. At a basic level of transcriptional organization, identifying operons can give insights into regulatory pathways as well as provide corroborating evidence for proteomic results. Classification and clustering algorithms can group proteins together by their abundance changes under different conditions, helping to identify interesting expression patterns, but often work poorly with noisy data such as typically generated in a large-scale proteomic analysis. Biological interpretation can be aided more directly by overlaying differential protein abundance data onto metabolic pathways, indicating pathways with altered activities. More broadly, ontology tools detect altered levels of protein abundance for different metabolic pathways, molecular functions, and cellular localizations. In practice, pathway analysis and ontology are limited by the level of database curation associated with the organism of interest.
微生物系统的定量蛋白质组学分析会生成庞大的数据集,这些数据集可能难以解读且耗时。幸运的是,许多为分析大型转录组微阵列数据集而开发的数据显示和基因聚类工具也适用于蛋白质组。丰度比与总信号或光谱计数的图可以突出随机误差和假定变化的区域。按照基因组序列中基因的物理顺序显示数据可以突出潜在的操纵子。在转录组织的基本层面上,识别操纵子可以深入了解调控途径,并为蛋白质组学结果提供确证。分类和聚类算法可以根据蛋白质在不同条件下的丰度变化将它们分组在一起,有助于识别有趣的表达模式,但对于大规模蛋白质组学分析中通常产生的噪声数据往往效果不佳。通过将差异蛋白质丰度数据叠加到代谢途径上,可以更直接地辅助生物学解释,这表明了活性发生改变的途径。更广泛地说,本体工具可以检测不同代谢途径、分子功能和细胞定位的蛋白质丰度变化水平。实际上,途径分析和本体受到与感兴趣生物体相关的数据库编目水平的限制。