Institute for Data, Systems and Society, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.
Department of Electrical Engineering & Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.
PLoS Comput Biol. 2020 Apr 28;16(4):e1007828. doi: 10.1371/journal.pcbi.1007828. eCollection 2020 Apr.
Lineage tracing involves the identification of all ancestors and descendants of a given cell, and is an important tool for studying biological processes such as development and disease progression. However, in many settings, controlled time-course experiments are not feasible, for example when working with tissue samples from patients. Here we present ImageAEOT, a computational pipeline based on autoencoders and optimal transport for predicting the lineages of cells using time-labeled datasets from different stages of a cellular process. Given a single-cell image from one of the stages, ImageAEOT generates an artificial lineage of this cell based on the population characteristics of the other stages. These lineages can be used to connect subpopulations of cells through the different stages and identify image-based features and biomarkers underlying the biological process. To validate our method, we apply ImageAEOT to a benchmark task based on nuclear and chromatin images during the activation of fibroblasts by tumor cells in engineered 3D tissues. We further validate ImageAEOT on chromatin images of various breast cancer cell lines and human tissue samples, thereby linking alterations in chromatin condensation patterns to different stages of tumor progression. Our results demonstrate the promise of computational methods based on autoencoding and optimal transport principles for lineage tracing in settings where existing experimental strategies cannot be used.
谱系追踪涉及鉴定给定细胞的所有祖先和后代,是研究发育和疾病进展等生物学过程的重要工具。然而,在许多情况下,受控的时间过程实验是不可行的,例如在处理来自患者的组织样本时。在这里,我们提出了 ImageAEOT,这是一个基于自动编码器和最优传输的计算管道,用于使用细胞过程的不同阶段的时间标记数据集来预测细胞的谱系。给定一个来自其中一个阶段的单细胞图像,ImageAEOT 根据其他阶段的群体特征生成该细胞的人工谱系。这些谱系可用于通过不同阶段连接细胞的亚群,并确定生物过程中基于图像的特征和生物标志物。为了验证我们的方法,我们将 ImageAEOT 应用于基于肿瘤细胞在工程化 3D 组织中激活成纤维细胞期间的核和染色质图像的基准任务。我们进一步在各种乳腺癌细胞系和人类组织样本的染色质图像上验证了 ImageAEOT,从而将染色质凝聚模式的改变与肿瘤进展的不同阶段联系起来。我们的结果表明,基于自动编码和最优传输原理的计算方法在无法使用现有实验策略的情况下进行谱系追踪具有很大的潜力。