Bachthaler Sven, Weiskopf Daniel
VISUS (Visualization Research Center), Universität Stuttgart, Stuttgart, Germany.
IEEE Trans Vis Comput Graph. 2008 Nov-Dec;14(6):1428-35. doi: 10.1109/TVCG.2008.119.
Scatterplots are well established means of visualizing discrete data values with two data variables as a collection of discrete points. We aim at generalizing the concept of scatterplots to the visualization of spatially continuous input data by a continuous and dense plot. An example of a continuous input field is data defined on an n-D spatial grid with respective interpolation or reconstruction of in-between values. We propose a rigorous, accurate, and generic mathematical model of continuous scatterplots that considers an arbitrary density defined on an input field on an n-D domain and that maps this density to m-D scatterplots. Special cases are derived from this generic model and discussed in detail: scatterplots where the n-D spatial domain and the m-D data attribute domain have identical dimension, 1-D scatterplots as a way to define continuous histograms, and 2-D scatterplots of data on 3-D spatial grids. We show how continuous histograms are related to traditional discrete histograms and to the histograms of isosurface statistics. Based on the mathematical model of continuous scatterplots, respective visualization algorithms are derived, in particular for 2-D scatterplots of data from 3-D tetrahedral grids. For several visualization tasks, we show the applicability of continuous scatterplots. Since continuous scatterplots do not only sample data at grid points but interpolate data values within cells, a dense and complete visualization of the data set is achieved that scales well with increasing data set size. Especially for irregular grids with varying cell size, improved results are obtained when compared to conventional scatterplots. Therefore, continuous scatterplots are a suitable extension of a statistics visualization technique to be applied to typical data from scientific computation.
散点图是一种成熟的方法,用于将具有两个数据变量的离散数据值可视化为离散点的集合。我们旨在将散点图的概念推广到通过连续且密集的绘图来可视化空间连续输入数据。连续输入场的一个示例是在n维空间网格上定义的数据,并对中间值进行相应的插值或重构。我们提出了一种严格、准确且通用的连续散点图数学模型,该模型考虑在n维域上的输入场上定义的任意密度,并将此密度映射到m维散点图。从这个通用模型导出特殊情况并进行详细讨论:n维空间域和m维数据属性域具有相同维度的散点图、作为定义连续直方图的一种方式的1维散点图以及3维空间网格上数据的2维散点图。我们展示了连续直方图与传统离散直方图以及等值面统计直方图之间的关系。基于连续散点图的数学模型,推导了相应的可视化算法,特别是针对来自3维四面体网格数据的2维散点图。对于几个可视化任务,我们展示了连续散点图的适用性。由于连续散点图不仅在网格点对数据进行采样,还在单元格内对数据值进行插值,因此可以实现数据集的密集且完整的可视化,并且随着数据集大小的增加,其扩展性良好。特别是对于单元格大小不同的不规则网格,与传统散点图相比,可以获得更好的结果。因此,连续散点图是统计可视化技术的一种合适扩展,可应用于科学计算中的典型数据。