Suppr超能文献

基于吸收随机游走的有向图上的转换。

Transduction on Directed Graphs via Absorbing Random Walks.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2018 Jul;40(7):1770-1784. doi: 10.1109/TPAMI.2017.2730871. Epub 2017 Aug 11.

Abstract

In this paper we consider the problem of graph-based transductive classification, and we are particularly interested in the directed graph scenario which is a natural form for many real world applications. Different from existing research efforts that either only deal with undirected graphs or circumvent directionality by means of symmetrization, we propose a novel random walk approach on directed graphs using absorbing Markov chains, which can be regarded as maximizing the accumulated expected number of visits from the unlabeled transient states. Our algorithm is simple, easy to implement, and works with large-scale graphs on binary, multiclass, and multi-label prediction problems. Moreover, it is capable of preserving the graph structure even when the input graph is sparse and changes over time, as well as retaining weak signals presented in the directed edges. We present its intimate connections to a number of existing methods, including graph kernels, graph Laplacian based methods, and spanning forest of graphs. Its computational complexity and the generalization error are also studied. Empirically, our algorithm is evaluated on a wide range of applications, where it has shown to perform competitively comparing to a suite of state-of-the-art methods. In particular, our algorithm is shown to work exceptionally well with large sparse directed graphs with e.g., millions of nodes and tens of millions of edges, where it significantly outperforms other state-of-the-art methods. In the dynamic graph setting involving insertion or deletion of nodes and edge-weight changes over time, it also allows efficient online updates that produce the same results as of the batch update counterparts.

摘要

在本文中,我们考虑基于图的转导分类问题,特别关注于有向图场景,这是许多实际应用的自然形式。与仅处理无向图或通过对称化来规避方向性的现有研究工作不同,我们提出了一种基于有向图的新随机游走方法,使用吸收马尔可夫链,可以视为最大化未标记瞬态状态的访问累积期望数。我们的算法简单、易于实现,适用于二进制、多类和多标签预测问题的大规模图。此外,即使输入图稀疏且随时间变化,它也能够保留图结构,并保留有向边中呈现的弱信号。我们展示了它与许多现有方法的紧密联系,包括图核、基于图拉普拉斯的方法和图的生成树森林。还研究了它的计算复杂性和泛化误差。在广泛的应用中评估了我们的算法,它在与一系列最先进的方法的比较中表现出了竞争力。特别是,我们的算法在具有例如百万个节点和数千万条边的大型稀疏有向图中表现出色,明显优于其他最先进的方法。在涉及节点插入或删除以及边权重随时间变化的动态图设置中,它还允许高效的在线更新,产生与批量更新对应的相同结果。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验