Luo Yi, McShan Daniel, Ray Dipankar, Matuszak Martha, Jolly Shruti, Lawrence Theodore, Ming Kong Feng, Ten Haken Randall, El Naqa Issam
Department of Radiation Oncology, University of Michigan, Ann Arbor, USA,
Department of Radiation Oncology, University of Michigan, Ann Arbor, USA.
IEEE Trans Radiat Plasma Med Sci. 2019 Mar;3(2):232-241. doi: 10.1109/TRPMS.2018.2832609. Epub 2018 May 2.
The purpose of this study is to demonstrate that a Bayesian network (BN) approach can explore hierarchical biophysical relationships that influence tumor response and predict tumor local control (LC) in non-small-cell lung cancer (NSCLC) patients before and during radiotherapy from a large-scale dataset. Our BN building approach has two steps. First, relevant biophysical predictors influencing LC before and during the treatment are selected through an extended Markov blanket (eMB) method. From this eMB process, the most robust BN structure for LC prediction was found via a wrapper-based approach. Sixty-eight patients with complete feature information were used to identify a full BN model for LC prediction before and during the treatment. Fifty more recent patients with some missing information were reserved for independent testing of the developed pre- and during-therapy BNs. A nested cross-validation (N-CV) was developed to evaluate the performance of the two-step BN approach. An ensemble BN model is generated from the N-CV sampling process to assess its similarity with the corresponding full BN model, and thus evaluate the sensitivity of our BN approach. Our results show that the proposed BN development approach is a stable and robust approach to identify hierarchical relationships among biophysical features for LC prediction. Furthermore, BN predictions can be improved by incorporating during treatment information.
本研究的目的是证明贝叶斯网络(BN)方法能够探索影响肿瘤反应的层次生物物理关系,并在放疗前和放疗期间从大规模数据集中预测非小细胞肺癌(NSCLC)患者的肿瘤局部控制(LC)。我们构建BN的方法有两个步骤。首先,通过扩展马尔可夫毯(eMB)方法选择治疗前和治疗期间影响LC的相关生物物理预测因子。通过这个eMB过程,通过基于包装器的方法找到了用于LC预测的最稳健的BN结构。68例具有完整特征信息的患者被用于确定治疗前和治疗期间LC预测的完整BN模型。另外50例有一些缺失信息的近期患者被留作独立测试所开发的治疗前和治疗期间BN。开发了一种嵌套交叉验证(N-CV)来评估两步BN方法的性能。从N-CV采样过程中生成一个集成BN模型,以评估其与相应完整BN模型的相似性,从而评估我们BN方法的敏感性。我们的结果表明,所提出的BN开发方法是一种稳定且稳健的方法,用于识别生物物理特征之间的层次关系以进行LC预测。此外,通过纳入治疗期间的信息可以改善BN预测。