Wang Bao, Wei Guo-Wei
Department of Mathematics Michigan State University, MI 48824, USA.
Mathematical Biosciences Institute, The Ohio State University, Columbus, Ohio 43210, USA.
J Comput Phys. 2016 Jan 15;305:276-299. doi: 10.1016/j.jcp.2015.10.036.
Persistent homology provides a new approach for the topological simplification of big data via measuring the life time of intrinsic topological features in a filtration process and has found its success in scientific and engineering applications. However, such a success is essentially limited to qualitative data classification and analysis. Indeed, persistent homology has rarely been employed for quantitative modeling and prediction. Additionally, the present persistent homology is a passive tool, rather than a proactive technique, for classification and analysis. In this work, we outline a general protocol to construct object-oriented persistent homology methods. By means of differential geometry theory of surfaces, we construct an objective functional, namely, a surface free energy defined on the data of interest. The minimization of the objective functional leads to a Laplace-Beltrami operator which generates a multiscale representation of the initial data and offers an objective oriented filtration process. The resulting differential geometry based object-oriented persistent homology is able to preserve desirable geometric features in the evolutionary filtration and enhances the corresponding topological persistence. The cubical complex based homology algorithm is employed in the present work to be compatible with the Cartesian representation of the Laplace-Beltrami flow. The proposed Laplace-Beltrami flow based persistent homology method is extensively validated. The consistence between Laplace-Beltrami flow based filtration and Euclidean distance based filtration is confirmed on the Vietoris-Rips complex for a large amount of numerical tests. The convergence and reliability of the present Laplace-Beltrami flow based cubical complex filtration approach are analyzed over various spatial and temporal mesh sizes. The Laplace-Beltrami flow based persistent homology approach is utilized to study the intrinsic topology of proteins and fullerene molecules. Based on a quantitative model which correlates the topological persistence of fullerene central cavity with the total curvature energy of the fullerene structure, the proposed method is used for the prediction of fullerene isomer stability. The efficiency and robustness of the present method are verified by more than 500 fullerene molecules. It is shown that the proposed persistent homology based quantitative model offers good predictions of total curvature energies for ten types of fullerene isomers. The present work offers the first example to design object-oriented persistent homology to enhance or preserve desirable features in the original data during the filtration process and then automatically detect or extract the corresponding topological traits from the data.
持久同调通过在过滤过程中测量内在拓扑特征的寿命,为大数据的拓扑简化提供了一种新方法,并已在科学和工程应用中取得成功。然而,这种成功本质上仅限于定性数据分类和分析。实际上,持久同调很少用于定量建模和预测。此外,当前的持久同调是一种用于分类和分析的被动工具,而非主动技术。在这项工作中,我们概述了构建面向对象持久同调方法的一般方案。借助曲面的微分几何理论,我们构建了一个目标泛函,即在感兴趣的数据上定义的表面自由能。目标泛函的最小化导致一个拉普拉斯 - 贝尔特拉米算子,该算子生成初始数据的多尺度表示,并提供一个面向目标的过滤过程。由此产生的基于微分几何的面向对象持久同调能够在演化过滤中保留理想的几何特征,并增强相应的拓扑持久性。在本工作中采用基于立方体复形的同调算法,以与拉普拉斯 - 贝尔特拉米流的笛卡尔表示兼容。所提出的基于拉普拉斯 - 贝尔特拉米流的持久同调方法得到了广泛验证。通过大量数值测试,在 Vietoris - Rips 复形上证实了基于拉普拉斯 - 贝尔特拉米流的过滤与基于欧几里得距离的过滤之间的一致性。在各种空间和时间网格尺寸上分析了当前基于拉普拉斯 - 贝尔特拉米流的立方体复形过滤方法的收敛性和可靠性。基于拉普拉斯 - 贝尔特拉米流的持久同调方法被用于研究蛋白质和富勒烯分子的内在拓扑。基于一个将富勒烯中心腔的拓扑持久性与富勒烯结构的总曲率能量相关联的定量模型,所提出的方法用于预测富勒烯异构体的稳定性。通过 500 多个富勒烯分子验证了本方法的效率和稳健性。结果表明,所提出的基于持久同调的定量模型对十种类型的富勒烯异构体的总曲率能量提供了良好的预测。本工作提供了第一个设计面向对象持久同调的示例,以在过滤过程中增强或保留原始数据中的理想特征,然后自动从数据中检测或提取相应的拓扑特征。