Kim Woojae, Kim Ku Sang, Park Rae Woong
Department of Public Health and Medical Administration, Dongyang University, Yeongju, Korea.
Department of Biomedical Informatics, Ajou University School of Medicine, Suwon, Korea.; Breast Cancer Center, Ulsan City Hospital, Ulsan, Korea.
Healthc Inform Res. 2016 Apr;22(2):89-94. doi: 10.4258/hir.2016.22.2.89. Epub 2016 Apr 30.
Breast cancer has a high rate of recurrence, resulting in the need for aggressive treatment and close follow-up. However, previously established classification guidelines, based on expert panels or regression models, are controversial. Prediction models based on machine learning show excellent performance, but they are not widely used because they cannot explain their decisions and cannot be presented on paper in the way that knowledge is customarily represented in the clinical world. The principal objective of this study was to develop a nomogram based on a naïve Bayesian model for the prediction of breast cancer recurrence within 5 years after breast cancer surgery.
The nomogram can provide a visual explanation of the predicted probabilities on a sheet of paper. We used a data set from a Korean tertiary teaching hospital of 679 patients who had undergone breast cancer surgery between 1994 and 2002. Seven prognostic factors were selected as independent variables for the model.
The accuracy was 80%, and the area under the receiver operating characteristics curve (AUC) of the model was 0.81.
The nomogram can be easily used in daily practice to aid physicians and patients in making appropriate treatment decisions after breast cancer surgery.
乳腺癌复发率高,因此需要积极治疗和密切随访。然而,先前基于专家小组或回归模型制定的分类指南存在争议。基于机器学习的预测模型表现出色,但由于无法解释其决策过程,且不能以临床领域惯常表示知识的方式呈现在纸上,所以未得到广泛应用。本研究的主要目的是开发一种基于朴素贝叶斯模型的列线图,用于预测乳腺癌手术后5年内的复发情况。
该列线图可在一张纸上直观地解释预测概率。我们使用了韩国一家三级教学医院1994年至2002年间679例接受乳腺癌手术患者的数据集。选择七个预后因素作为模型的自变量。
准确率为80%,模型的受试者工作特征曲线(AUC)下面积为0.81。
该列线图可在日常实践中轻松使用,以帮助医生和患者在乳腺癌手术后做出适当的治疗决策。