College of Mathematics and Computer Science, Northwest Minzu University, Lanzhou, China.
Key Laboratory of China's Ethnic Languages and Information Technology of Ministry of Education, Northwest Minzu University, Lanzhou, China.
PLoS One. 2024 Jan 2;19(1):e0287385. doi: 10.1371/journal.pone.0287385. eCollection 2024.
Link prediction in complex network is an important issue in network science. Recently, various structure-based similarity methods have been proposed. Most of algorithms are used to analyze the topology of the network, and to judge whether there is any connection between nodes by calculating the similarity of two nodes. However, it is necessary to get the extra attribute information of the node in advance, which is very difficult. Compared to the difficulty in obtaining the attribute information of the node itself, the topology of the network is easy to obtain, and the structure of the network is an inherent attribute of the network and is more reliable. The proposed method measures kinds of similarity between nodes based on non-trivial eigenvectors of Laplacian Matrix of the network, such as Euclidean distance, Manhattan distance and Angular distance. Then the classical machine learning algorithm can be used for classification prediction (two classification in this case), so as to achieve the purpose of link prediction. Based on this process, a spectral analysis-based link prediction algorithm is proposed, and named it LPbSA (Link Prediction based on Spectral Analysis). The experimental results on seven real-world networks demonstrated that LPbSA has better performance on Accuracy, Precision, Receiver Operating Curve(ROC), area under the ROC curve(AUC), Precision and Recall curve(PR curve) and balanced F Score(F-score curve) evaluation metrics than other ten classic methods.
在复杂网络中,链路预测是网络科学中的一个重要问题。最近,已经提出了各种基于结构的相似性方法。大多数算法用于分析网络的拓扑结构,并通过计算两个节点的相似性来判断节点之间是否存在连接。但是,需要预先获得节点的额外属性信息,这非常困难。与获取节点自身属性信息的难度相比,网络的拓扑结构更容易获得,并且网络的结构是网络的固有属性,更加可靠。
所提出的方法基于网络拉普拉斯矩阵的非平凡特征向量来测量节点之间的各种相似性,例如欧几里得距离、曼哈顿距离和角度距离。然后可以使用经典的机器学习算法进行分类预测(在这种情况下是两种分类),从而达到链路预测的目的。
基于这个过程,提出了一种基于谱分析的链路预测算法,并将其命名为 LPbSA(基于谱分析的链路预测)。在七个真实网络上的实验结果表明,LPbSA 在准确性、精度、接收器工作曲线(ROC)、ROC 曲线下面积(AUC)、精度和召回率曲线(PR 曲线)以及平衡 F 分数曲线(F-score 曲线)等评估指标上的性能均优于其他十种经典方法。