Gründner Julian, Prokosch Hans-Ulrich, Stürzl Michael, Croner Roland, Christoph Jan, Toddenroth Dennis
Medical Informatics, Friedrich-Alexander University, Erlangen-Nürnberg, Erlangen, Germany.
Division of Molecular and Experimental Surgery, Department of Surgery, Friedrich-Alexander University, Erlangen-Nürnberg , Erlangen, Germany.
Stud Health Technol Inform. 2018;247:101-105.
Using gene markers and other patient features to predict clinical outcomes plays a vital role in enhancing clinical decision making and improving prognostic accuracy. This work uses a large set of colorectal cancer patient data to train predictive models using machine learning methods such as random forest, general linear model, and neural network for clinically relevant outcomes including disease free survival, survival, radio-chemotherapy response (RCT-R) and relapse. The most successful predictive models were created for dichotomous outcomes like relapse and RCT-R with accuracies of 0.71 and 0.70 on blinded test data respectively. The best prediction models regarding overall survival and disease-free survival had C-Index scores of 0.86 and 0.76 respectively. These models could be used in the future to aid a decision for or against chemotherapy and improve survival prognosis. We propose that future work should focus on creating reusable frameworks and infrastructure for training and delivering predictive models to physicians, so that they could be readily applied to other diseases in practice and be continuously developed integrating new data.
使用基因标记和其他患者特征来预测临床结果,在加强临床决策和提高预后准确性方面发挥着至关重要的作用。这项工作使用大量结直肠癌患者数据,通过随机森林、一般线性模型和神经网络等机器学习方法,针对包括无病生存期、总生存期、放化疗反应(RCT-R)和复发等临床相关结果训练预测模型。针对复发和RCT-R等二分结果创建了最成功的预测模型,在盲法测试数据上的准确率分别为0.71和0.70。关于总生存期和无病生存期的最佳预测模型的C指数得分分别为0.86和0.76。这些模型未来可用于辅助化疗决策并改善生存预后。我们建议未来的工作应专注于创建可重复使用的框架和基础设施,用于训练预测模型并将其提供给医生,以便它们能够在实践中轻松应用于其他疾病,并结合新数据不断发展。