Bhaskar Dhananjay, Zhang William Y, Wong Ian Y
School of Engineering, Center for Biomedical Engineering, Brown University, 184 Hope St Box D, Providence, RI 02912, USA.
Department of Computer Science, Brown University, 184 Hope St Box D, Providence, RI 02912, USA.
Soft Matter. 2021 May 5;17(17):4653-4664. doi: 10.1039/d1sm00072a.
Interacting, self-propelled particles such as epithelial cells can dynamically self-organize into complex multicellular patterns, which are challenging to classify without a priori information. Classically, different phases and phase transitions have been described based on local ordering, which may not capture structural features at larger length scales. Instead, topological data analysis (TDA) determines the stability of spatial connectivity at varying length scales (i.e. persistent homology), and can compare different particle configurations based on the "cost" of reorganizing one configuration into another. Here, we demonstrate a topology-based machine learning approach for unsupervised profiling of individual and collective phases based on large-scale loops. We show that these topological loops (i.e. dimension 1 homology) are robust to variations in particle number and density, particularly in comparison to connected components (i.e. dimension 0 homology). We use TDA to map out phase diagrams for simulated particles with varying adhesion and propulsion, at constant population size as well as when proliferation is permitted. Next, we use this approach to profile our recent experiments on the clustering of epithelial cells in varying growth factor conditions, which are compared to our simulations. Finally, we characterize the robustness of this approach at varying length scales, with sparse sampling, and over time. Overall, we envision TDA will be broadly applicable as a model-agnostic approach to analyze active systems with varying population size, from cytoskeletal motors to motile cells to flocking or swarming animals.
相互作用的自推进粒子,如上皮细胞,可以动态地自组织成复杂的多细胞模式,在没有先验信息的情况下对其进行分类具有挑战性。传统上,基于局部有序性描述了不同的相和相变,而这可能无法捕捉更大长度尺度上的结构特征。相反,拓扑数据分析(TDA)确定了不同长度尺度上空间连通性的稳定性(即持久同调),并且可以基于将一种构型重新组织成另一种构型的“成本”来比较不同的粒子构型。在这里,我们展示了一种基于拓扑的机器学习方法,用于基于大规模环对个体和集体相进行无监督分析。我们表明,这些拓扑环(即一维同调)对粒子数量和密度的变化具有鲁棒性,特别是与连通分量(即零维同调)相比。我们使用TDA来绘制具有不同粘附和推进力的模拟粒子的相图,包括在恒定种群大小以及允许增殖的情况下。接下来,我们使用这种方法对我们最近在不同生长因子条件下上皮细胞聚类的实验进行分析,并与我们的模拟结果进行比较。最后,我们表征了这种方法在不同长度尺度、稀疏采样以及随时间变化时的鲁棒性。总体而言,我们设想TDA将作为一种与模型无关的方法广泛应用于分析具有不同种群大小的活性系统,从细胞骨架马达到运动细胞再到成群或结队的动物。