West M, Blanchette C, Dressman H, Huang E, Ishida S, Spang R, Zuzan H, Olson J A, Marks J R, Nevins J R
Institute of Statistics and Decision Sciences, Duke University, Durham, NC 27708, USA.
Proc Natl Acad Sci U S A. 2001 Sep 25;98(20):11462-7. doi: 10.1073/pnas.201162998. Epub 2001 Sep 18.
Prognostic and predictive factors are indispensable tools in the treatment of patients with neoplastic disease. For the most part, such factors rely on a few specific cell surface, histological, or gross pathologic features. Gene expression assays have the potential to supplement what were previously a few distinct features with many thousands of features. We have developed Bayesian regression models that provide predictive capability based on gene expression data derived from DNA microarray analysis of a series of primary breast cancer samples. These patterns have the capacity to discriminate breast tumors on the basis of estrogen receptor status and also on the categorized lymph node status. Importantly, we assess the utility and validity of such models in predicting the status of tumors in crossvalidation determinations. The practical value of such approaches relies on the ability not only to assess relative probabilities of clinical outcomes for future samples but also to provide an honest assessment of the uncertainties associated with such predictive classifications on the basis of the selection of gene subsets for each validation analysis. This latter point is of critical importance in the ability to apply these methodologies to clinical assessment of tumor phenotype.
预后和预测因素是肿瘤疾病患者治疗中不可或缺的工具。在很大程度上,这些因素依赖于一些特定的细胞表面、组织学或大体病理特征。基因表达分析有潜力用数千个特征来补充以前为数不多的明显特征。我们已经开发出贝叶斯回归模型,该模型基于对一系列原发性乳腺癌样本进行DNA微阵列分析得出的基因表达数据提供预测能力。这些模式有能力根据雌激素受体状态以及分类后的淋巴结状态来区分乳腺肿瘤。重要的是,我们在交叉验证测定中评估此类模型在预测肿瘤状态方面的效用和有效性。此类方法的实际价值不仅依赖于评估未来样本临床结果的相对概率的能力,还依赖于基于每次验证分析中基因子集的选择,对与此类预测分类相关的不确定性进行如实评估的能力。后一点对于将这些方法应用于肿瘤表型的临床评估的能力至关重要。