Wang Kung-Min, Wang Kung-Jeng, Makond Bunjira
Department of Surgery, Shin-Kong Wu Ho-Su Memorial Hospital, Taipei, Taiwan, ROC.
Department of Industrial Management, National Taiwan University of Science and Technology, Taipei, 106, Taiwan, ROC.
Comput Methods Programs Biomed. 2020 Nov;196:105686. doi: 10.1016/j.cmpb.2020.105686. Epub 2020 Aug 1.
Multiple primary cancers significantly threat patient survivability. Predicting the survivability of patients with two cancers is challenging because its stochastic pattern relates with numerous variables.
In this study, a Bayesian network (BN) model was proposed to describe the occurrence of two primary cancers and predict the five-year survivability of patients using probabilistic evidence. Eleven types of major primary cancers and contingent occurrences of secondary cancers were investigated. A nationwide two-cancer database involving 7,845 patients in Taiwan was investigated. The BN topology is rigorously examined and imbalanced dataset is processed by the synthetic minority oversampling technique. The proposed BN survivability prognosis model was compared with benchmark approaches.
The proposed model significantly outperformed the back-propagation neural network, logistic regression, support vector machine, and naïve Bayes in terms of sensitivity, which is a critical performance index for the non-survival group.
Using the proposed BN model, one can estimate the posterior probabilities for every query provided appropriate prior evidences. The potential survivability information of patients, treatment effects, and socio-demographics factor effects predicted by the proposed model can help in cancer treatment assessment and cancer development monitoring.
多种原发性癌症严重威胁患者的生存能力。预测患有两种癌症患者的生存能力具有挑战性,因为其随机模式与众多变量相关。
在本研究中,提出了一种贝叶斯网络(BN)模型来描述两种原发性癌症的发生情况,并使用概率证据预测患者的五年生存能力。研究了11种主要原发性癌症以及继发性癌症的偶然发生情况。对台湾一个包含7845名患者的全国性双癌数据库进行了调查。对BN拓扑结构进行了严格检查,并通过合成少数过采样技术处理了不平衡数据集。将所提出的BN生存能力预后模型与基准方法进行了比较。
在所提出模型的敏感性方面,该模型显著优于反向传播神经网络、逻辑回归、支持向量机和朴素贝叶斯,敏感性是针对未存活组的关键性能指标。
使用所提出的BN模型,只要提供适当的先验证据,就可以估计每个查询的后验概率。所提出模型预测的患者潜在生存能力信息、治疗效果以及社会人口统计学因素影响,有助于癌症治疗评估和癌症发展监测。