Struski Lukasz, Sadowski Michal, Danel Tomasz, Tabor Jacek, Podolak Igor T
IEEE Trans Neural Netw Learn Syst. 2024 Sep;35(9):12068-12082. doi: 10.1109/TNNLS.2023.3251848. Epub 2024 Sep 3.
Interpolating between points is a problem connected simultaneously with finding geodesics and study of generative models. In the case of geodesics, we search for the curves with the shortest length, while in the case of generative models, we typically apply linear interpolation in the latent space. However, this interpolation uses implicitly the fact that Gaussian is unimodal. Thus, the problem of interpolating in the case when the latent density is non-Gaussian is an open problem. In this article, we present a general and unified approach to interpolation, which simultaneously allows us to search for geodesics and interpolating curves in latent space in the case of arbitrary density. Our results have a strong theoretical background based on the introduced quality measure of an interpolating curve. In particular, we show that maximizing the quality measure of the curve can be equivalently understood as a search of geodesic for a certain redefinition of the Riemannian metric on the space. We provide examples in three important cases. First, we show that our approach can be easily applied to finding geodesics on manifolds. Next, we focus our attention in finding interpolations in pretrained generative models. We show that our model effectively works in the case of arbitrary density. Moreover, we can interpolate in the subset of the space consisting of data possessing a given feature. The last case is focused on finding interpolation in the space of chemical compounds.
在点之间进行插值是一个与寻找测地线和生成模型研究同时相关的问题。在测地线的情况下,我们寻找长度最短的曲线,而在生成模型的情况下,我们通常在潜在空间中应用线性插值。然而,这种插值隐含地利用了高斯分布是单峰的这一事实。因此,在潜在密度为非高斯的情况下进行插值的问题仍然是一个未解决的问题。在本文中,我们提出了一种通用且统一的插值方法,该方法同时允许我们在任意密度的情况下在潜在空间中寻找测地线和插值曲线。我们的结果基于所引入的插值曲线质量度量具有强大的理论背景。特别是,我们表明最大化曲线的质量度量可以等效地理解为在空间上对黎曼度量进行特定重新定义时寻找测地线。我们在三个重要案例中提供了示例。首先,我们表明我们的方法可以很容易地应用于在流形上寻找测地线。接下来,我们将注意力集中在预训练生成模型中寻找插值。我们表明我们的模型在任意密度的情况下都能有效工作。此外,我们可以在由具有给定特征的数据组成的空间子集中进行插值。最后一个案例集中在寻找化合物空间中的插值。