Hendrix Zachary H, Xu Lance W Q, Pressé Steve
ArXiv. 2025 Jul 25:arXiv:2507.05599v3.
Motion models (i.e., transition probability densities) are often deduced from fluorescence widefield tracking experiments by analyzing single-particle trajectories post-processed from data. This analysis immediately raises the question: To what degree is our ability to learn motion models impacted by analyzing post-processed trajectories versus raw measurements? To answer this question, we mathematically formulate a data likelihood for diffraction-limited fluorescence widefield tracking experiments. In particular, we make the likelihood's dependence on the motion model versus the emission (or measurement) model explicit. The emission model describes how photons emitted by biomolecules are distributed in space according to the optical point spread function, with intensities subsequently integrated over a pixel, and convoluted with camera noise. Logic dictates that if the likelihood is primarily informed by the motion model, it should be straightforward to learn the motion model from the post-processed trajectory. Contrarily, if the majority of the likelihood is dominated by the emission model, the post-processed trajectory inferred from data is primarily informed by the emission model, and very little information on the motion model permeates into the post-processed trajectories analyzed downstream to learn motion models. Indeed, we find that for typical diffraction-limited fluorescence experiments, the emission model often robustly contributes approximately 99% to the likelihood, leaving motion models to explain a meager 1% of the data. This result immediately casts doubt on our ability to reliably learn motion models from post-processed data, raising further questions on the significance of motion models learned thus far from post-processed single-particle trajectories from single-molecule widefield fluorescence tracking experiments.
运动模型(即转移概率密度)通常是通过分析从数据中后处理得到的单粒子轨迹,从荧光宽场跟踪实验中推导出来的。这种分析立即引发了一个问题:与分析原始测量值相比,我们学习运动模型的能力在多大程度上受到分析后处理轨迹的影响?为了回答这个问题,我们从数学上为衍射极限荧光宽场跟踪实验制定了一个数据似然性。特别是,我们明确了似然性对运动模型与发射(或测量)模型的依赖性。发射模型描述了生物分子发射的光子如何根据光学点扩散函数在空间中分布,其强度随后在一个像素上进行积分,并与相机噪声进行卷积。逻辑表明,如果似然性主要由运动模型决定,那么从后处理轨迹中学习运动模型应该是直接的。相反,如果似然性的大部分由发射模型主导,那么从数据中推断出的后处理轨迹主要由发射模型决定,并且关于运动模型的信息很少渗透到下游分析的后处理轨迹中以学习运动模型。事实上,我们发现对于典型的衍射极限荧光实验,发射模型通常对似然性的贡献约为99%,而运动模型只能解释1%的数据。这一结果立即让人怀疑我们从后处理数据中可靠学习运动模型的能力,引发了关于迄今为止从单分子宽场荧光跟踪实验的后处理单粒子轨迹中学习到的运动模型的意义的进一步问题。