Institute of Mathematics and Computer Science, USP, São Carlos, SP, Brazil.
The Observatory on Social Media (OSoMe), Indiana University, Bloomington, Indiana, United States of America.
PLoS One. 2024 Nov 6;19(11):e0312863. doi: 10.1371/journal.pone.0312863. eCollection 2024.
Random walks find extensive applications across various complex network domains, including embedding generation and link prediction. Despite the widespread utilization of random walks, the precise impact of distinct biases on embedding generation from sequence data and their subsequent effects on link prediction remain elusive. We conduct a comparative analysis of several random walk strategies, including the true self-avoiding random walk and the traditional random walk. We also analyze walks biased towards node degree and those with inverse node degree bias. Diverse adaptations of the node2vec algorithm to induce distinct exploratory behaviors were also investigated. Our empirical findings demonstrate that despite the varied behaviors inherent in these embeddings, only slight performance differences manifest in the context of link prediction. This implies the resilient recovery of network structure, regardless of the specific walk heuristic employed to traverse the network. Consequently, the results suggest that data generated from sequences governed by unknown mechanisms can be successfully reconstructed.
随机游走在各种复杂网络领域都有广泛的应用,包括嵌入生成和链接预测。尽管随机游走被广泛应用,但不同的偏差对序列数据嵌入生成的精确影响及其对链接预测的后续影响仍不清楚。我们对几种随机游走策略进行了比较分析,包括真实的自回避随机游走和传统的随机游走。我们还分析了偏向节点度和具有逆节点度偏差的游走。还研究了 node2vec 算法的多种自适应方法,以诱导出不同的探索行为。我们的实证研究结果表明,尽管这些嵌入中存在不同的行为,但在链接预测的情况下,只有细微的性能差异。这意味着网络结构具有很强的恢复能力,无论使用哪种特定的游走启发式来遍历网络。因此,结果表明可以成功地重建由未知机制生成的数据。