• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

链接预测中评估指标之间的不一致性。

Inconsistency among evaluation metrics in link prediction.

作者信息

Bi Yilin, Jiao Xinshan, Lee Yan-Li, Zhou Tao

机构信息

CompleX Lab, School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China.

School of Computer and Software Engineering, Xihua University, Chengdu 610039, China.

出版信息

PNAS Nexus. 2024 Nov 6;3(11):pgae498. doi: 10.1093/pnasnexus/pgae498. eCollection 2024 Nov.

DOI:10.1093/pnasnexus/pgae498
PMID:39564572
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11574622/
Abstract

Link prediction is a paradigmatic and challenging problem in network science, which aims to predict missing links, future links, and temporal links based on known topology. Along with the increasing number of link prediction algorithms, a critical yet previously ignored risk is that the evaluation metrics for algorithm performance are usually chosen at will. This paper implements extensive experiments on hundreds of real networks and 26 well-known algorithms, revealing significant inconsistency among evaluation metrics, namely different metrics probably produce remarkably different rankings of algorithms. Therefore, we conclude that any single metric cannot comprehensively or credibly evaluate algorithm performance. In terms of information content, we suggest the usage of at least two metrics: one is the area under the receiver operating characteristic curve, and the other is one of the following three candidates, say the area under the precision-recall curve, the area under the precision curve, and the normalized discounted cumulative gain. When the data are imbalanced, say the number of negative samples significantly outweighs the number of positive samples, the area under the generalized Receiver Operating Characteristic curve should also be used. In addition, as we have proved the essential equivalence of threshold-dependent metrics, if in a link prediction task, some specific thresholds are meaningful, we can consider any one threshold-dependent metric with those thresholds. This work completes a missing part in the landscape of link prediction, and provides a starting point toward a well-accepted criterion or standard to select proper evaluation metrics for link prediction.

摘要

链路预测是网络科学中一个典型且具有挑战性的问题,其目的是基于已知拓扑结构预测缺失的链路、未来的链路以及时间链路。随着链路预测算法数量的不断增加,一个关键但此前被忽视的风险是,算法性能的评估指标通常是随意选择的。本文对数百个真实网络和26种知名算法进行了广泛的实验,揭示了评估指标之间存在显著的不一致性,即不同的指标可能会产生截然不同的算法排名。因此,我们得出结论,任何单一指标都无法全面或可靠地评估算法性能。在信息内容方面,我们建议至少使用两个指标:一个是接收器操作特征曲线下的面积,另一个是以下三个候选指标之一,即精确率-召回率曲线下的面积、精确率曲线下的面积以及归一化折损累计增益。当数据不平衡时,即负样本数量显著超过正样本数量时,还应使用广义接收器操作特征曲线下的面积。此外,由于我们已经证明了依赖阈值的指标本质上是等价的,如果在链路预测任务中某些特定阈值是有意义的,我们可以考虑任何一个带有这些阈值的依赖阈值的指标。这项工作填补了链路预测领域中缺失的一部分,并为选择链路预测的合适评估指标提供了一个被广泛接受的标准或准则的起点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90d6/11574622/2d605ca44711/pgae498f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90d6/11574622/338bd38a6524/pgae498f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90d6/11574622/2194b3385c2a/pgae498f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90d6/11574622/9c6e78bdb97e/pgae498f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90d6/11574622/2d605ca44711/pgae498f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90d6/11574622/338bd38a6524/pgae498f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90d6/11574622/2194b3385c2a/pgae498f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90d6/11574622/9c6e78bdb97e/pgae498f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/90d6/11574622/2d605ca44711/pgae498f4.jpg

相似文献

1
Inconsistency among evaluation metrics in link prediction.链接预测中评估指标之间的不一致性。
PNAS Nexus. 2024 Nov 6;3(11):pgae498. doi: 10.1093/pnasnexus/pgae498. eCollection 2024 Nov.
2
Missing Link Prediction using Common Neighbor and Centrality based Parameterized Algorithm.基于公共邻居和中心性的参数化算法的缺失链接预测。
Sci Rep. 2020 Jan 15;10(1):364. doi: 10.1038/s41598-019-57304-y.
3
Link prediction based on spectral analysis.基于谱分析的链路预测。
PLoS One. 2024 Jan 2;19(1):e0287385. doi: 10.1371/journal.pone.0287385. eCollection 2024.
4
Efficient link prediction in the protein-protein interaction network using topological information in a generative adversarial network machine learning model.利用生成对抗网络机器学习模型中的拓扑信息提高蛋白质 - 蛋白质相互作用网络中的链路预测效率。
BMC Bioinformatics. 2022 Feb 19;23(1):78. doi: 10.1186/s12859-022-04598-x.
5
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
6
Identifying accurate link predictors based on assortativity of complex networks.基于复杂网络的关联性识别精确的链接预测器。
Sci Rep. 2022 Oct 27;12(1):18107. doi: 10.1038/s41598-022-22843-4.
7
Novel learning framework (knockoff technique) to evaluate metric ranking algorithms to describe human response to injury.用于评估度量排序算法以描述人类对损伤反应的新型学习框架(仿冒技术)。
Traffic Inj Prev. 2018;19(sup2):S121-S126. doi: 10.1080/15389588.2018.1519805. Epub 2018 Dec 20.
8
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
9
Graph Neural Network-Based Efficient Subgraph Embedding Method for Link Prediction in Mobile Edge Computing.基于图神经网络的移动边缘计算链路预测中高效子图嵌入方法。
Sensors (Basel). 2023 May 20;23(10):4936. doi: 10.3390/s23104936.
10
A potential energy and mutual information based link prediction approach for bipartite networks.基于势能和互信息的二部网络链路预测方法。
Sci Rep. 2020 Nov 26;10(1):20659. doi: 10.1038/s41598-020-77364-9.

引用本文的文献

1
Fine-Scale Risk Mapping for Dengue Vector Using Spatial Downscaling in Intra-Urban Areas of Guangzhou, China.利用空间降尺度法对中国广州城市内部地区登革热媒介进行精细尺度风险制图
Insects. 2025 Jun 25;16(7):661. doi: 10.3390/insects16070661.
2
Link prediction of heterogeneous complex networks based on an improved embedding learning algorithm.基于改进嵌入学习算法的异质复杂网络链接预测
PLoS One. 2025 Jan 7;20(1):e0315507. doi: 10.1371/journal.pone.0315507. eCollection 2025.

本文引用的文献

1
Link prediction accuracy on real-world networks under non-uniform missing-edge patterns.真实网络中非均匀缺失边模式下的链接预测精度。
PLoS One. 2024 Jul 18;19(7):e0306883. doi: 10.1371/journal.pone.0306883. eCollection 2024.
2
Link prediction using low-dimensional node embeddings: The measurement problem.使用低维节点嵌入的链接预测:测量问题。
Proc Natl Acad Sci U S A. 2024 Feb 20;121(8):e2312527121. doi: 10.1073/pnas.2312527121. Epub 2024 Feb 16.
3
A Survey on Hyperlink Prediction.超链接预测调查
IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):15034-15050. doi: 10.1109/TNNLS.2023.3286280. Epub 2024 Oct 30.
4
Information cocoons in online navigation.在线导航中的信息茧房。
iScience. 2022 Dec 28;26(1):105893. doi: 10.1016/j.isci.2022.105893. eCollection 2023 Jan 20.
5
"Stealing fire or stacking knowledge" by machine intelligence to model link prediction in complex networks.通过机器学习智能“窃取火种或积累知识”以对复杂网络中的链接预测进行建模。
iScience. 2022 Nov 30;26(1):105697. doi: 10.1016/j.isci.2022.105697. eCollection 2023 Jan 20.
6
Exploring drought-responsive crucial genes in .探索……中干旱响应关键基因
iScience. 2022 Oct 14;25(11):105347. doi: 10.1016/j.isci.2022.105347. eCollection 2022 Nov 18.
7
Link recommendation algorithms and dynamics of polarization in online social networks.在线社交网络中的链接推荐算法和极化动力学。
Proc Natl Acad Sci U S A. 2021 Dec 14;118(50). doi: 10.1073/pnas.2102141118.
8
Progresses and challenges in link prediction.链接预测中的进展与挑战。
iScience. 2021 Oct 5;24(11):103217. doi: 10.1016/j.isci.2021.103217. eCollection 2021 Nov 19.
9
Stacking models for nearly optimal link prediction in complex networks.堆叠模型以实现复杂网络中近乎最优的链路预测。
Proc Natl Acad Sci U S A. 2020 Sep 22;117(38):23393-23400. doi: 10.1073/pnas.1914950117. Epub 2020 Sep 4.
10
Protein Interface Complementarity and Gene Duplication Improve Link Prediction of Protein-Protein Interaction Network.蛋白质界面互补性和基因复制改善蛋白质-蛋白质相互作用网络的链接预测
Front Genet. 2020 Apr 2;11:291. doi: 10.3389/fgene.2020.00291. eCollection 2020.