Wang Tao, Jiao Mengyu, Wang Xiaoxia
School of Mathematics and Physics, North China Electric Power University, Baoding 071003, China.
Hebei Key Laboratory of Physics and Energy Technology, North China Electric Power University, Baoding 071000, China.
Entropy (Basel). 2022 Aug 15;24(8):1124. doi: 10.3390/e24081124.
Link prediction is an important task in the field of network analysis and modeling, and predicts missing links in current networks and new links in future networks. In order to improve the performance of link prediction, we integrate global, local, and quasi-local topological information of networks. Here, a novel stacking ensemble framework is proposed for link prediction in this paper. Our approach employs random forest-based recursive feature elimination to select relevant structural features associated with networks and constructs a two-level stacking ensemble model involving various machine learning methods for link prediction. The lower level is composed of three base classifiers, i.e., logistic regression, gradient boosting decision tree, and XGBoost, and their outputs are then integrated with an XGBoost model in the upper level. Extensive experiments were conducted on six networks. Comparison results show that the proposed method can obtain better prediction results and applicability robustness.
链路预测是网络分析与建模领域中的一项重要任务,它能预测当前网络中缺失的链路以及未来网络中的新链路。为了提高链路预测的性能,我们整合了网络的全局、局部和准局部拓扑信息。在此,本文提出了一种用于链路预测的新型堆叠集成框架。我们的方法采用基于随机森林的递归特征消除来选择与网络相关的结构特征,并构建一个涉及各种机器学习方法的两级堆叠集成模型用于链路预测。下层由三个基分类器组成,即逻辑回归、梯度提升决策树和XGBoost,然后将它们的输出与上层的一个XGBoost模型进行集成。在六个网络上进行了广泛的实验。比较结果表明,所提出的方法能够获得更好的预测结果和适用性鲁棒性。