Justice Derek, Hero Alfred
Department of Electrical Engineering and Computer Science, University of Michigan, 1301 Beal Avenue, Ann Arbor, MI 48109-2122, USA.
IEEE Trans Pattern Anal Mach Intell. 2006 Aug;28(8):1200-14. doi: 10.1109/TPAMI.2006.152.
A binary linear programming formulation of the graph edit distance for unweighted, undirected graphs with vertex attributes is derived and applied to a graph recognition problem. A general formulation for editing graphs is used to derive a graph edit distance that is proven to be a metric, provided the cost function for individual edit operations is a metric. Then, a binary linear program is developed for computing this graph edit distance, and polynomial time methods for determining upper and lower bounds on the solution of the binary program are derived by applying solution methods for standard linear programming and the assignment problem. A recognition problem of comparing a sample input graph to a database of known prototype graphs in the context of a chemical information system is presented as an application of the new method. The costs associated with various edit operations are chosen by using a minimum normalized variance criterion applied to pairwise distances between nearest neighbors in the database of prototypes. The new metric is shown to perform quite well in comparison to existing metrics when applied to a database of chemical graphs.
推导了具有顶点属性的无加权、无向图的图编辑距离的二元线性规划公式,并将其应用于图识别问题。使用一种通用的图编辑公式来推导图编辑距离,证明只要单个编辑操作的成本函数是度量,则该距离就是一种度量。然后,开发了一个用于计算此图编辑距离的二元线性规划,并通过应用标准线性规划和分配问题的求解方法,推导了确定该二元规划解的上下界的多项式时间方法。作为新方法的应用,提出了在化学信息系统中,将样本输入图与已知原型图数据库进行比较的识别问题。通过对原型数据库中最近邻之间的成对距离应用最小归一化方差准则,选择与各种编辑操作相关的成本。当应用于化学图数据库时,新度量与现有度量相比表现良好。