Huguet Guillaume, Magruder D S, Tong Alexander, Fasina Oluwadamilola, Kuchroo Manik, Wolf Guy, Krishnaswamy Smita
Université de Montréal; Mila - Quebec AI Institute.
Yale University.
Adv Neural Inf Process Syst. 2022 Dec;35:29705-29718.
We present a method called Manifold Interpolating Optimal-Transport Flow (MIOFlow) that learns stochastic, continuous population dynamics from static snapshot samples taken at sporadic timepoints. MIOFlow combines dynamic models, manifold learning, and optimal transport by training neural ordinary differential equations (Neural ODE) to interpolate between static population snapshots as penalized by optimal transport with manifold ground distance. Further, we ensure that the flow follows the geometry by operating in the latent space of an autoencoder that we call a geodesic autoencoder (GAE). In GAE the latent space distance between points is regularized to match a novel multiscale geodesic distance on the data manifold that we define. We show that this method is superior to normalizing flows, Schrödinger bridges and other generative models that are designed to flow from noise to data in terms of interpolating between populations. Theoretically, we link these trajectories with dynamic optimal transport. We evaluate our method on simulated data with bifurcations and merges, as well as scRNA-seq data from embryoid body differentiation, and acute myeloid leukemia treatment.
我们提出了一种名为流形插值最优传输流(MIOFlow)的方法,该方法可从在零星时间点采集的静态快照样本中学习随机、连续的群体动态。MIOFlow通过训练神经常微分方程(Neural ODE)来结合动态模型、流形学习和最优传输,以便在静态群体快照之间进行插值,这种插值受到带有流形地面距离的最优传输的惩罚。此外,我们通过在一个我们称为测地线自动编码器(GAE)的自动编码器的潜在空间中进行操作,确保流遵循几何形状。在GAE中,点之间的潜在空间距离被正则化,以匹配我们定义的数据流形上的一种新型多尺度测地线距离。我们表明,在群体之间进行插值方面,该方法优于归一化流、薛定谔桥以及其他旨在从噪声流向数据的生成模型。从理论上讲,我们将这些轨迹与动态最优传输联系起来。我们在具有分支和合并的模拟数据以及来自胚状体分化和急性髓细胞白血病治疗的单细胞RNA测序(scRNA-seq)数据上评估了我们的方法。