Gao Yingnan, Feder Alison F
Department of Genome Sciences, University of Washington, Seattle, WA.
Herbold Computational Biology Program, Fred Hutchinson Cancer Center, Seattle, WA.
bioRxiv. 2024 Aug 6:2024.06.27.601073. doi: 10.1101/2024.06.27.601073.
Understanding cellular birth rate differences is crucial for predicting cancer progression and interpreting tumor-derived genetic data. Lineage tracing experiments enable detailed reconstruction of cellular genealogies, offering new opportunities to measure branching rate heterogeneity. However, the lineage tracing process can introduce complex tree features that complicate this effort. Here, we examine tree characteristics in lineage tracing-derived genealogies and find that editing window placement leads to multifurcations at a tree's root or tips. We propose several ways in which existing tree topology-based metrics can be extended to test for rate heterogeneity on trees even in the presence of lineage-tracing associated distortions. Although these methods vary in power and robustness, a test based on the statistic effectively detects branching rate heterogeneity in simulated lineage tracing data. Tests based on other common statistics ( and the Sackin index) show interior performance to . We apply our validated methods to xenograft experimental data and find widespread rate heterogeneity across multiple study systems. Our results demonstrate the potential of tree topology statistics in analyzing lineage tracing data, and highlight the challenges associated with adapting phylogenetic methods to these systems.
了解细胞出生率差异对于预测癌症进展和解读肿瘤衍生的遗传数据至关重要。谱系追踪实验能够详细重建细胞谱系,为测量分支速率异质性提供了新机会。然而,谱系追踪过程可能会引入复杂的树状特征,使这项工作变得复杂。在这里,我们研究了谱系追踪衍生谱系中的树状特征,发现编辑窗口的放置会导致树的根部或末梢出现多分支。我们提出了几种方法,通过这些方法可以扩展现有的基于树拓扑的指标,以测试即使在存在与谱系追踪相关的扭曲情况下树的速率异质性。尽管这些方法在功效和稳健性方面各不相同,但基于统计量的检验能够有效地检测模拟谱系追踪数据中的分支速率异质性。基于其他常见统计量(和萨克林指数)的检验显示出中等性能。我们将经过验证的方法应用于异种移植实验数据,发现在多个研究系统中广泛存在速率异质性。我们的结果证明了树拓扑统计在分析谱系追踪数据方面的潜力,并突出了将系统发育方法应用于这些系统所面临的挑战。