Industrial Center, Shenzhen Polytechnic, Shenzhen, Guangdong 518055, China ; Department of Genitourinary Medical Oncology, Unit 1374, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, TX 77030-4009, USA.
Department of Genitourinary Medical Oncology, Unit 1374, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, TX 77030-4009, USA.
Comput Math Methods Med. 2013;2013:917502. doi: 10.1155/2013/917502. Epub 2013 Dec 4.
Predicting disease progression is one of the most challenging problems in prostate cancer research. Adding gene expression data to prediction models that are based on clinical features has been proposed to improve accuracy. In the current study, we applied a logistic regression (LR) model combining clinical features and gene co-expression data to improve the accuracy of the prediction of prostate cancer progression. The top-scoring pair (TSP) method was used to select genes for the model. The proposed models not only preserved the basic properties of the TSP algorithm but also incorporated the clinical features into the prognostic models. Based on the statistical inference with the iterative cross validation, we demonstrated that prediction LR models that included genes selected by the TSP method provided better predictions of prostate cancer progression than those using clinical variables only and/or those that included genes selected by the one-gene-at-a-time approach. Thus, we conclude that TSP selection is a useful tool for feature (and/or gene) selection to use in prognostic models and our model also provides an alternative for predicting prostate cancer progression.
预测疾病进展是前列腺癌研究中最具挑战性的问题之一。向基于临床特征的预测模型中添加基因表达数据,被提议用于提高准确性。在当前研究中,我们应用逻辑回归(LR)模型结合临床特征和基因共表达数据,以提高前列腺癌进展预测的准确性。使用最高分配对(TSP)方法选择用于模型的基因。所提出的模型不仅保留了 TSP 算法的基本性质,而且还将临床特征纳入了预后模型中。通过迭代交叉验证的统计推断,我们证明了包含 TSP 方法选择的基因的预测 LR 模型比仅使用临床变量和/或包含通过逐个基因选择方法选择的基因的预测模型提供了更好的前列腺癌进展预测。因此,我们得出结论,TSP 选择是用于预后模型中的特征(和/或基因)选择的有用工具,我们的模型也为预测前列腺癌进展提供了另一种选择。