AnacletoLab, Dipartimento di Informatica, Università degli Studi di Milano, Milan, Italy.
National Laboratory in Artificial Intelligence and Intelligent Systems, Consorzio Interuniversitario Nazionale per l'Informatica, Rome, Italy.
Nat Comput Sci. 2023 Jun;3(6):552-568. doi: 10.1038/s43588-023-00465-8. Epub 2023 Jun 26.
Graph representation learning methods opened new avenues for addressing complex, real-world problems represented by graphs. However, many graphs used in these applications comprise millions of nodes and billions of edges and are beyond the capabilities of current methods and software implementations. We present GRAPE (Graph Representation Learning, Prediction and Evaluation), a software resource for graph processing and embedding that is able to scale with big graphs by using specialized and smart data structures, algorithms, and a fast parallel implementation of random-walk-based methods. Compared with state-of-the-art software resources, GRAPE shows an improvement of orders of magnitude in empirical space and time complexity, as well as competitive edge- and node-label prediction performance. GRAPE comprises approximately 1.7 million well-documented lines of Python and Rust code and provides 69 node-embedding methods, 25 inference models, a collection of efficient graph-processing utilities, and over 80,000 graphs from the literature and other sources. Standardized interfaces allow a seamless integration of third-party libraries, while ready-to-use and modular pipelines permit an easy-to-use evaluation of graph-representation-learning methods, therefore also positioning GRAPE as a software resource that performs a fair comparison between methods and libraries for graph processing and embedding.
图表示学习方法为解决由图表示的复杂现实世界问题开辟了新途径。然而,这些应用中使用的许多图包含数百万个节点和数十亿个边,超出了当前方法和软件实现的能力。我们提出了 GRAPE(Graph Representation Learning, Prediction and Evaluation),这是一个用于图处理和嵌入的软件资源,它能够通过使用专门的和智能的数据结构、算法以及快速的基于随机游走的方法并行实现来扩展到大型图。与最先进的软件资源相比,GRAPE 在经验空间和时间复杂度方面显示出数量级的改进,以及具有竞争力的边和节点标签预测性能。GRAPE 包含大约 170 万行经过良好记录的 Python 和 Rust 代码,并提供了 69 种节点嵌入方法、25 种推理模型、一系列高效的图处理实用程序以及来自文献和其他来源的 80,000 多个图。标准化接口允许无缝集成第三方库,而即用型和模块化管道允许轻松评估图表示学习方法,因此 GRAPE 也作为一个软件资源,可在图处理和嵌入方面对方法和库进行公平比较。