Santella Anthony, Du Zhuo, Bao Zhirong
Developmental Biology, Sloan-Kettering Institute, 1275 York Avenue, New York, New York 10065, USA.
BMC Bioinformatics. 2014 Jun 25;15:217. doi: 10.1186/1471-2105-15-217.
Advances in fluorescence labeling and imaging have made it possible to acquire in vivo records of complex biological processes. Analysis has lagged behind acquisition in part because of the difficulty and computational expense of accurate cell tracking. In vivo analysis requires, at minimum, tracking hundreds of cells over hundreds of time points in complex three dimensional environments. We address this challenge with a computational framework capable of efficiently and accurately tracing entire cell lineages.
The bulk of the tracking problem-tracking cells during interphase-is straightforward and can be executed with simple and fast methods. Difficult cases originate from detection errors and relatively rare large motions. Therefore, our method focuses computational effort on difficult cases identified by local increases in cell number. We force these cases into tentative cell track bifurcations, which define natural semi-local neighborhoods that permit Bayesian judgment about the underlying cell behavior. The bifurcation judgment process not only correctly tracks through cell divisions and large movements, but also offers corrections to detection errors. We demonstrate that this method enables large scale analysis of Caenorhabditis elegans development, an ideal validation platform because of an invariant cell lineage.
The high accuracy achieved by our method suggests that a bifurcation-based semi-local neighborhood provides sufficient information to recognize dependencies between nearby tracking choices, and to interpret difficult tracking cases without reverting to global optimization. Our method makes large amounts of lineage data accessible and opens the door to new types of statistical analysis of complex in vivo processes.
荧光标记和成像技术的进步使得获取复杂生物过程的体内记录成为可能。分析工作在一定程度上滞后于数据获取,部分原因是精确的细胞追踪存在困难且计算成本高昂。体内分析至少需要在复杂的三维环境中,对数百个细胞在数百个时间点进行追踪。我们通过一个能够高效且准确地追踪整个细胞谱系的计算框架来应对这一挑战。
追踪问题的主要部分——间期细胞追踪——较为简单,可以用简单快速的方法执行。困难情况源于检测错误和相对罕见的大幅度移动。因此,我们的方法将计算精力集中在通过细胞数量局部增加识别出的困难情况上。我们将这些情况强制纳入暂定的细胞轨迹分支,这些分支定义了自然的半局部邻域,允许对潜在的细胞行为进行贝叶斯判断。分支判断过程不仅能正确追踪细胞分裂和大幅度移动,还能对检测错误进行校正。我们证明,这种方法能够对秀丽隐杆线虫的发育进行大规模分析,由于其细胞谱系不变,这是一个理想的验证平台。
我们的方法所实现的高精度表明,基于分支的半局部邻域提供了足够的信息来识别附近追踪选择之间的依赖性,并解释困难的追踪情况,而无需诉诸全局优化。我们的方法使大量的谱系数据易于获取,并为复杂体内过程的新型统计分析打开了大门。