两相映射用于投影大数据集。

Two-phase mapping for projecting massive data sets.

机构信息

Universidade de São Paulo, São Carlos, Brazil.

出版信息

IEEE Trans Vis Comput Graph. 2010 Nov-Dec;16(6):1281-90. doi: 10.1109/TVCG.2010.207.

DOI:10.1109/TVCG.2010.207

PMID:20975168

Abstract

Most multidimensional projection techniques rely on distance (dissimilarity) information between data instances to embed high-dimensional data into a visual space. When data are endowed with Cartesian coordinates, an extra computational effort is necessary to compute the needed distances, making multidimensional projection prohibitive in applications dealing with interactivity and massive data. The novel multidimensional projection technique proposed in this work, called Part-Linear Multidimensional Projection (PLMP), has been tailored to handle multivariate data represented in Cartesian high-dimensional spaces, requiring only distance information between pairs of representative samples. This characteristic renders PLMP faster than previous methods when processing large data sets while still being competitive in terms of precision. Moreover, knowing the range of variation for data instances in the high-dimensional space, we can make PLMP a truly streaming data projection technique, a trait absent in previous methods.

摘要

大多数多维投影技术都依赖于数据实例之间的距离（相似度）信息，将高维数据嵌入到可视空间中。当数据具有笛卡尔坐标时，需要额外的计算工作量来计算所需的距离，这使得多维投影在处理交互性和海量数据的应用中变得不可行。本文提出的一种新的多维投影技术，称为部分线性多维投影（PLMP），专门用于处理以笛卡尔高维空间表示的多元数据，只需要成对的代表样本之间的距离信息。当处理大型数据集时，该特性使 PLMP 比以前的方法更快，同时在精度方面仍具有竞争力。此外，通过了解高维空间中数据实例的变化范围，我们可以使 PLMP 成为一种真正的流数据投影技术，这是以前的方法所没有的特点。

相似文献

Two-phase mapping for projecting massive data sets.

IEEE Trans Vis Comput Graph. 2010 Nov-Dec;16(6):1281-90. doi: 10.1109/TVCG.2010.207.

Least square projection: a fast high-precision multidimensional projection technique and its application to document mapping.

IEEE Trans Vis Comput Graph. 2008 May-Jun;14(3):564-75. doi: 10.1109/TVCG.2007.70443.

Congruent qualitative behavior of complete and reconstructed phase space trajectories from biomolecular dynamics simulation.

Proteins. 2002 Apr 1;47(1):25-30.

Spatial symmetries in vestibular projections to the uvula-nodulus.

Biol Cybern. 2007 Apr;96(4):439-53. doi: 10.1007/s00422-006-0136-y. Epub 2007 Jan 5.

Local Affine Multidimensional Projection.

IEEE Trans Vis Comput Graph. 2011 Dec;17(12):2563-71. doi: 10.1109/TVCG.2011.220.

Distance approximating dimension reduction of Riemannian manifolds.

IEEE Trans Syst Man Cybern B Cybern. 2010 Feb;40(1):208-17. doi: 10.1109/TSMCB.2009.2025028. Epub 2009 Jul 17.

HiPP: a novel hierarchical point placement strategy and its application to the exploration of document collections.

IEEE Trans Vis Comput Graph. 2008 Nov-Dec;14(6):1229-36. doi: 10.1109/TVCG.2008.138.

TopoMap: A 0-dimensional Homology Preserving Projection of High-Dimensional Data.

IEEE Trans Vis Comput Graph. 2021 Feb;27(2):561-571. doi: 10.1109/TVCG.2020.3030441. Epub 2021 Jan 28.

Incremental online learning in high dimensions.

Neural Comput. 2005 Dec;17(12):2602-34. doi: 10.1162/089976605774320557.

Topographic mapping of large dissimilarity data sets.

Neural Comput. 2010 Sep 1;22(9):2229-84. doi: 10.1162/NECO_a_00012.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

两相映射用于投影大数据集。

Two-phase mapping for projecting massive data sets.

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献