SCI Institute, University of Utah, USA.
IEEE Trans Vis Comput Graph. 2011 Dec;17(12):1902-11. doi: 10.1109/TVCG.2011.177.
Large observations and simulations in scientific research give rise to high-dimensional data sets that present many challenges and opportunities in data analysis and visualization. Researchers in application domains such as engineering, computational biology, climate study, imaging and motion capture are faced with the problem of how to discover compact representations of high-dimensional data while preserving their intrinsic structure. In many applications, the original data is projected onto low-dimensional space via dimensionality reduction techniques prior to modeling. One problem with this approach is that the projection step in the process can fail to preserve structure in the data that is only apparent in high dimensions. Conversely, such techniques may create structural illusions in the projection, implying structure not present in the original high-dimensional data. Our solution is to utilize topological techniques to recover important structures in high-dimensional data that contains non-trivial topology. Specifically, we are interested in high-dimensional branching structures. We construct local circle-valued coordinate functions to represent such features. Subsequently, we perform dimensionality reduction on the data while ensuring such structures are visually preserved. Additionally, we study the effects of global circular structures on visualizations. Our results reveal never-before-seen structures on real-world data sets from a variety of applications.
科学研究中的大规模观测和模拟产生了高维数据集,这在数据分析和可视化方面带来了许多挑战和机遇。工程、计算生物学、气候研究、成像和运动捕捉等应用领域的研究人员面临着如何在保留数据内在结构的同时发现高维数据的紧凑表示的问题。在许多应用中,在建模之前,原始数据通过降维技术投影到低维空间。该方法存在一个问题,即该过程中的投影步骤可能无法保留仅在高维中才明显的结构。相反,此类技术可能会在投影中产生结构错觉,暗示原始高维数据中不存在结构。我们的解决方案是利用拓扑技术来恢复高维数据中包含非平凡拓扑的重要结构。具体来说,我们对高维分支结构感兴趣。我们构建局部圆值坐标函数来表示这些特征。随后,我们在数据上执行降维,同时确保视觉上保留这些结构。此外,我们还研究全局圆形结构对可视化的影响。我们的结果揭示了来自各种应用的真实世界数据集上以前从未见过的结构。