Stegmaier Johannes, Mikut Ralf
Institute for Applied Computer Science, Karlsruhe Institute of Technology, Eggenstein-Leopoldshafen, Germany.
PLoS One. 2017 Nov 2;12(11):e0187535. doi: 10.1371/journal.pone.0187535. eCollection 2017.
Many automatically analyzable scientific questions are well-posed and a variety of information about expected outcomes is available a priori. Although often neglected, this prior knowledge can be systematically exploited to make automated analysis operations sensitive to a desired phenomenon or to evaluate extracted content with respect to this prior knowledge. For instance, the performance of processing operators can be greatly enhanced by a more focused detection strategy and by direct information about the ambiguity inherent in the extracted data. We present a new concept that increases the result quality awareness of image analysis operators by estimating and distributing the degree of uncertainty involved in their output based on prior knowledge. This allows the use of simple processing operators that are suitable for analyzing large-scale spatiotemporal (3D+t) microscopy images without compromising result quality. On the foundation of fuzzy set theory, we transform available prior knowledge into a mathematical representation and extensively use it to enhance the result quality of various processing operators. These concepts are illustrated on a typical bioimage analysis pipeline comprised of seed point detection, segmentation, multiview fusion and tracking. The functionality of the proposed approach is further validated on a comprehensive simulated 3D+t benchmark data set that mimics embryonic development and on large-scale light-sheet microscopy data of a zebrafish embryo. The general concept introduced in this contribution represents a new approach to efficiently exploit prior knowledge to improve the result quality of image analysis pipelines. The generality of the concept makes it applicable to practically any field with processing strategies that are arranged as linear pipelines. The automated analysis of terabyte-scale microscopy data will especially benefit from sophisticated and efficient algorithms that enable a quantitative and fast readout.
许多可自动分析的科学问题都表述清晰,并且关于预期结果的各种信息在事先就可获取。尽管这种先验知识常常被忽视,但它可以被系统地利用,以使自动化分析操作对期望的现象敏感,或者根据这种先验知识评估提取的内容。例如,通过更具针对性的检测策略以及关于提取数据中固有模糊性的直接信息,可以大大提高处理算子的性能。我们提出了一个新概念,即通过基于先验知识估计和分布图像分析算子输出中所涉及的不确定性程度,来提高其结果质量意识。这使得我们能够使用适合分析大规模时空(3D+t)显微镜图像的简单处理算子,而不影响结果质量。基于模糊集理论,我们将可用的先验知识转化为数学表示,并广泛利用它来提高各种处理算子的结果质量。这些概念在一个典型的生物图像分析流程中得到了说明,该流程包括种子点检测、分割、多视图融合和跟踪。所提出方法的功能在一个模拟胚胎发育的综合3D+t基准数据集以及斑马鱼胚胎的大规模光片显微镜数据上得到了进一步验证。本论文中引入的一般概念代表了一种有效利用先验知识来提高图像分析流程结果质量的新方法。该概念的通用性使其适用于几乎任何采用线性流程处理策略的领域。对万亿字节规模显微镜数据的自动分析将特别受益于能够实现定量和快速读出的复杂高效算法。