Department of Physics, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany.
Department of Mathematics & Computer Science, Technische Universiteit Eindhoven, Eindhoven, Netherlands.
PLoS One. 2023 May 15;18(5):e0284736. doi: 10.1371/journal.pone.0284736. eCollection 2023.
Biological processes involve movements across all measurable scales. Similarity measures can be applied to compare and analyze these movements but differ in how differences in movement are aggregated across space and time. The present study reviews frequently-used similarity measures, such as the Hausdorff distance, Fréchet distance, Dynamic Time Warping, and Longest Common Subsequence, jointly with several measures less used in biological applications (Wasserstein distance, weak Fréchet distance, and Kullback-Leibler divergence), and provides computational tools for each of them that may be used in computational biology. We illustrate the use of the selected similarity measures in diagnosing differences within two extremely contrasting sets of biological data, which, remarkably, may both be relevant for magnetic field perception by migratory birds. Specifically, we assess and discuss cryptochrome protein conformational dynamics and extreme migratory trajectories of songbirds between Alaska and Africa. We highlight how similarity measures contrast regarding computational complexity and discuss those which can be useful in noise elimination or, conversely, are sensitive to spatiotemporal scales.
生物过程涉及跨越所有可测量尺度的运动。相似性度量可以用来比较和分析这些运动,但在如何在空间和时间上聚合运动差异方面有所不同。本研究回顾了常用的相似性度量,如 Hausdorff 距离、Fréchet 距离、动态时间规整和最长公共子序列,以及在生物应用中较少使用的几种度量(Wasserstein 距离、弱 Fréchet 距离和 Kullback-Leibler 散度),并为每个度量提供了可能在计算生物学中使用的计算工具。我们通过两个极端对比的生物数据集说明了所选相似性度量的用途,这两个数据集都可能与候鸟对磁场的感知有关。具体来说,我们评估和讨论了隐花色素蛋白构象动力学以及阿拉斯加和非洲之间鸣禽的极端迁徙轨迹。我们强调了相似性度量在计算复杂性方面的差异,并讨论了那些在消除噪声方面有用的或相反地对时空尺度敏感的度量。