Dalian University of Technology, School of Software, China.
Chengdu Second People's Hospital, Department of Oncology, China.
Comput Math Methods Med. 2023 Jan 18;2023:7931321. doi: 10.1155/2023/7931321. eCollection 2023.
Overall survival (OS) in cancer is crucial for cancer treatment. Many machine learning methods have been applied to predict OS, but there are still the challenges of dealing with multiview data and overfitting. To overcome these problems, we propose a multiview deep forest (MVDF) in this paper. MVDF can learn the features of each view and fuse them with integrated learning and multiple kernel learning. Then, a gradient boost forest based on the information bottleneck theory is proposed to reduce redundant information and avoid overfitting. In addition, a pruning strategy for a cascaded forest is used to limit the impact of outlier data. Comprehensive experiments have been carried out on a data set from West China Hospital of Sichuan University and two public data sets. Results have demonstrated that our method outperforms the compared methods in predicting overall survival.
癌症的总生存期(OS)对癌症治疗至关重要。许多机器学习方法已被应用于预测 OS,但仍面临处理多视图数据和过拟合的挑战。为了克服这些问题,我们在本文中提出了一种多视图深度森林(MVDF)。MVDF 可以学习每个视图的特征,并通过集成学习和多核学习将它们融合。然后,基于信息瓶颈理论提出了一个梯度提升森林,以减少冗余信息并避免过拟合。此外,还使用级联森林的修剪策略来限制异常数据的影响。在四川大学华西医院的数据集中和两个公共数据集上进行了综合实验。结果表明,我们的方法在预测总生存期方面优于比较方法。