Department of Pathology and Laboratory Medicine, University of California Irvine, Irvine, California, USA.
PLoS One. 2012;7(9):e45178. doi: 10.1371/journal.pone.0045178. Epub 2012 Sep 28.
One of the major challenges in the development of prostate cancer prognostic biomarkers is the cellular heterogeneity in tissue samples. We developed an objective Cluster-Correlation (CC) analysis to identify gene expression changes in various cell types that are associated with progression. In the Cluster step, samples were clustered (unsupervised) based on the expression values of each gene through a mixture model combined with a multiple linear regression model in which cell-type percent data were used for decomposition. In the Correlation step, a Chi-square test was used to select potential prognostic genes. With CC analysis, we identified 324 significantly expressed genes (68 tumor and 256 stroma cell expressed genes) which were strongly associated with the observed biochemical relapse status. Significance Analysis of Microarray (SAM) was then utilized to develop a seven-gene classifier. The Classifier has been validated using two independent Data Sets. The overall prediction accuracy and sensitivity is 71% and 76%, respectively. The inclusion of the Gleason sum to the seven-gene classifier raised the prediction accuracy and sensitivity to 83% and 76% respectively based on independent testing. These results indicated that our prognostic model that includes cell type adjustments and using Gleason score and the seven-gene signature has some utility for predicting outcomes for prostate cancer for individual patients at the time of prognosis. The strategy could have applications for improving marker performance in other cancers and other diseases.
前列腺癌预后生物标志物开发面临的主要挑战之一是组织样本中的细胞异质性。我们开发了一种客观的聚类相关(CC)分析方法,以识别与进展相关的各种细胞类型的基因表达变化。在聚类步骤中,通过结合使用细胞类型百分比数据进行分解的混合模型和多元线性回归模型,根据每个基因的表达值对样本进行无监督聚类。在相关步骤中,使用卡方检验选择潜在的预后基因。通过 CC 分析,我们确定了 324 个显著表达的基因(68 个肿瘤和 256 个基质细胞表达基因),这些基因与观察到的生化复发状态密切相关。然后利用显著分析微阵列(SAM)开发了一个七基因分类器。该分类器已使用两个独立数据集进行了验证。总体预测准确率和灵敏度分别为 71%和 76%。基于独立测试,将 Gleason 总和纳入七基因分类器后,预测准确率和灵敏度分别提高到 83%和 76%。这些结果表明,我们的预后模型包括细胞类型调整以及使用 Gleason 评分和七个基因特征,对于在预后时预测个别患者的前列腺癌结局具有一定的应用价值。该策略可能适用于提高其他癌症和其他疾病中标志物的性能。