IEEE Trans Vis Comput Graph. 2018 Jan;24(1):131-141. doi: 10.1109/TVCG.2017.2745258. Epub 2017 Aug 29.
Dimension reduction algorithms and clustering algorithms are both frequently used techniques in visual analytics. Both families of algorithms assist analysts in performing related tasks regarding the similarity of observations and finding groups in datasets. Though initially used independently, recent works have incorporated algorithms from each family into the same visualization systems. However, these algorithmic combinations are often ad hoc or disconnected, working independently and in parallel rather than integrating some degree of interdependence. A number of design decisions must be addressed when employing dimension reduction and clustering algorithms concurrently in a visualization system, including the selection of each algorithm, the order in which they are processed, and how to present and interact with the resulting projection. This paper contributes an overview of combining dimension reduction and clustering into a visualization system, discussing the challenges inherent in developing a visualization system that makes use of both families of algorithms.
降维算法和聚类算法都是可视化分析中常用的技术。这两类算法都能帮助分析师处理数据集中观测值相似性和发现分组的相关任务。尽管它们最初是独立使用的,但最近的工作已经将来自每个家族的算法整合到同一个可视化系统中。然而,这些算法的组合通常是特定的或不连贯的,它们独立且并行地工作,而不是集成一定程度的相互依赖。在可视化系统中同时使用降维和聚类算法时,必须解决许多设计决策,包括选择每个算法、它们的处理顺序以及如何呈现和与生成的投影交互。本文贡献了将降维和聚类结合到一个可视化系统中的概述,讨论了开发利用这两类算法的可视化系统所固有的挑战。