The Beatson Institute for Cancer Research, Glasgow, Scotland, United Kingdom.
J Clin Oncol. 2010 Jun 10;28(17):2881-8. doi: 10.1200/JCO.2009.24.8724. Epub 2010 May 10.
To identify functionally related prognostic gene sets for head and neck squamous cell carcinoma (HNSCC) by unsupervised statistical analysis of microarray data.
Microarray analysis was performed on 14 normal oral epithelium and 71 HNSCCs from patients with outcome data. Spectral clustering (SC) analysis of the data set identified multiple vectors representing distinct aspects of gene expression heterogeneity between samples. Gene ontology (GO) analysis of vector gene lists identified gene sets significantly enriched within defined biologic pathways. The prognostic significance of these was established by Cox survival analysis.
The most influential SC vectors were V2 and V3. V2 separated normal from tumor samples. GO analysis of V2 gene lists identified pathways with heterogeneous expression between HNSCCs, notably focal adhesion (FA)/extracellular matrix remodeling and cytokine-cytokine receptor (CR) interactions. Similar analysis of V3 gene lists identified further heterogeneity in CR pathways. V2CR genes represent an innate immune response, whereas high expression of V3CR genes represented an adaptive immune response that was not dependent on human papillomavirus status. Survival analysis demonstrated that the FA gene set was prognostic of poor outcome, whereas classification for adaptive immune response by the CR gene set was prognostic of good outcome. A combined FA&CR model dramatically exceeded the performance of current clinical classifiers (P < .001 in our cohort and, importantly, P = .007 in an independent cohort of 60 HNSCCs).
The application of SC and GO algorithms to HNSCC microarray data identified gene sets highly significant for predicting patient outcome. Further large-scale studies will establish the usefulness of these gene sets in the clinical management of HNSCC.
通过对微阵列数据进行无监督统计分析,鉴定与头颈部鳞状细胞癌(HNSCC)相关的功能预后基因集。
对 14 例正常口腔上皮和 71 例具有生存数据的 HNSCC 患者的微阵列数据进行分析。对数据集进行光谱聚类(SC)分析,确定了多个代表样本间基因表达异质性不同方面的向量。对向量基因列表进行基因本体论(GO)分析,确定了在特定生物途径中显著富集的基因集。通过 Cox 生存分析确定这些基因集的预后意义。
最具影响力的 SC 向量是 V2 和 V3。V2 将正常组织与肿瘤样本区分开来。V2 基因列表的 GO 分析确定了 HNSCC 之间表达存在差异的途径,特别是焦点黏附(FA)/细胞外基质重塑和细胞因子-细胞因子受体(CR)相互作用。对 V3 基因列表的类似分析确定了 CR 途径的进一步异质性。V2CR 基因代表先天免疫反应,而 V3CR 基因的高表达代表不依赖人乳头瘤病毒状态的适应性免疫反应。生存分析表明,FA 基因集与预后不良相关,而 CR 基因集的适应性免疫反应分类与预后良好相关。FA&CR 联合模型显著优于当前临床分类器的性能(在我们的队列中 P <.001,重要的是,在 60 例 HNSCC 的独立队列中 P =.007)。
将 SC 和 GO 算法应用于 HNSCC 微阵列数据,鉴定出与预测患者预后高度相关的基因集。进一步的大规模研究将确定这些基因集在 HNSCC 临床管理中的有用性。