• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通用并行图形处理单元线性复杂度t-SNE优化

GPGPU Linear Complexity t-SNE Optimization.

作者信息

Pezzotti Nicola, Thijssen Julian, Mordvintsev Alexander, Hollt Thomas, Van Lew Baldur, Lelieveldt Boudewijn P F, Eisemann Elmar, Vilanova Anna

出版信息

IEEE Trans Vis Comput Graph. 2020 Jan;26(1):1172-1181. doi: 10.1109/TVCG.2019.2934307. Epub 2019 Aug 23.

DOI:10.1109/TVCG.2019.2934307
PMID:31449023
Abstract

In recent years the t-distributed Stochastic Neighbor Embedding (t-SNE) algorithm has become one of the most used and insightful techniques for exploratory data analysis of high-dimensional data. It reveals clusters of high-dimensional data points at different scales while only requiring minimal tuning of its parameters. However, the computational complexity of the algorithm limits its application to relatively small datasets. To address this problem, several evolutions of t-SNE have been developed in recent years, mainly focusing on the scalability of the similarity computations between data points. However, these contributions are insufficient to achieve interactive rates when visualizing the evolution of the t-SNE embedding for large datasets. In this work, we present a novel approach to the minimization of the t-SNE objective function that heavily relies on graphics hardware and has linear computational complexity. Our technique decreases the computational cost of running t-SNE on datasets by orders of magnitude and retains or improves on the accuracy of past approximated techniques. We propose to approximate the repulsive forces between data points by splatting kernel textures for each data point. This approximation allows us to reformulate the t-SNE minimization problem as a series of tensor operations that can be efficiently executed on the graphics card. An efficient implementation of our technique is integrated and available for use in the widely used Google TensorFlow.js, and an open-source C++ library.

摘要

近年来,t分布随机邻域嵌入(t-SNE)算法已成为高维数据探索性数据分析中使用最广泛且最具洞察力的技术之一。它能揭示不同尺度下高维数据点的聚类,同时只需对其参数进行最少的调整。然而,该算法的计算复杂度限制了它在相对较小数据集上的应用。为解决这个问题,近年来已开发出t-SNE的几种改进版本,主要集中在数据点间相似度计算的可扩展性上。然而,在可视化大型数据集的t-SNE嵌入演变时,这些改进不足以实现交互式速率。在这项工作中,我们提出了一种全新的方法来最小化t-SNE目标函数,该方法严重依赖图形硬件且具有线性计算复杂度。我们的技术将在数据集上运行t-SNE的计算成本降低了几个数量级,并保持或提高了以往近似技术的准确性。我们建议通过为每个数据点平铺内核纹理来近似数据点之间的排斥力。这种近似使我们能够将t-SNE最小化问题重新表述为一系列可在图形卡上高效执行的张量运算。我们技术的高效实现已集成到广泛使用的谷歌TensorFlow.js以及一个开源C++库中,可供使用。

相似文献

1
GPGPU Linear Complexity t-SNE Optimization.通用并行图形处理单元线性复杂度t-SNE优化
IEEE Trans Vis Comput Graph. 2020 Jan;26(1):1172-1181. doi: 10.1109/TVCG.2019.2934307. Epub 2019 Aug 23.
2
An Efficient Dual-Hierarchy t-SNE Minimization.一种高效的双层次t-SNE最小化方法。
IEEE Trans Vis Comput Graph. 2022 Jan;28(1):614-622. doi: 10.1109/TVCG.2021.3114817. Epub 2021 Dec 24.
3
Joint t-SNE for Comparable Projections of Multiple High-Dimensional Datasets.用于多个高维数据集可比投影的联合t-SNE
IEEE Trans Vis Comput Graph. 2022 Jan;28(1):623-632. doi: 10.1109/TVCG.2021.3114765. Epub 2021 Dec 24.
4
Heavy-tailed kernels reveal a finer cluster structure in t-SNE visualisations.重尾核在t-SNE可视化中揭示了更精细的聚类结构。
Mach Learn Knowl Discov Databases. 2020;11906:124-139. doi: 10.1007/978-3-030-46150-8_8. Epub 2020 Apr 30.
5
Application of t-SNE to human genetic data.t-SNE在人类遗传数据中的应用。
J Bioinform Comput Biol. 2017 Aug;15(4):1750017. doi: 10.1142/S0219720017500172. Epub 2017 Jun 23.
6
t-viSNE: Interactive Assessment and Interpretation of t-SNE Projections.t-viSNE:t-SNE投影的交互式评估与解读
IEEE Trans Vis Comput Graph. 2020 Aug;26(8):2696-2714. doi: 10.1109/TVCG.2020.2986996. Epub 2020 Apr 13.
7
Analyzing the similarity of samples and genes by MG-PCC algorithm, t-SNE-SS and t-SNE-SG maps.通过 MG-PCC 算法、t-SNE-SS 和 t-SNE-SG 图谱分析样本和基因的相似性。
BMC Bioinformatics. 2018 Dec 17;19(1):512. doi: 10.1186/s12859-018-2495-5.
8
Dimensionality reduction and visualisation of hyperspectral ink data using t-SNE.使用 t-SNE 对高光谱墨水数据进行降维和可视化。
Forensic Sci Int. 2020 Jun;311:110194. doi: 10.1016/j.forsciint.2020.110194. Epub 2020 Feb 12.
9
Conditional t-SNE: more informative t-SNE embeddings.条件t-SNE:更具信息性的t-SNE嵌入
Mach Learn. 2021;110(10):2905-2940. doi: 10.1007/s10994-020-05917-0. Epub 2020 Dec 6.
10
Vibration-Based Structural Health Monitoring Using Piezoelectric Transducers and Parametric -SNE.基于压电传感器和参数-SNE 的振动结构健康监测。
Sensors (Basel). 2020 Mar 19;20(6):1716. doi: 10.3390/s20061716.

引用本文的文献

1
A Spatio-Temporal Joint Diagnosis Framework for Bearing Faults via Graph Convolution and Attention-Enhanced Bidirectional Gated Networks.一种基于图卷积和注意力增强双向门控网络的轴承故障时空联合诊断框架
Sensors (Basel). 2025 Jun 23;25(13):3908. doi: 10.3390/s25133908.
2
Rapid discrimination of spp. and label-free surface enhanced Raman spectroscopy coupled with machine learning algorithms.[物种名称]的快速鉴别以及无标记表面增强拉曼光谱与机器学习算法相结合。 (你提供的原文中“spp.”和“label-free surface enhanced Raman spectroscopy”前面应该有具体物种名称等相关内容,这里翻译是根据现有内容尽量完整呈现意思)
Front Microbiol. 2023 Mar 8;14:1101357. doi: 10.3389/fmicb.2023.1101357. eCollection 2023.
3
Visinity: Visual Spatial Neighborhood Analysis for Multiplexed Tissue Imaging Data.
毗邻性分析:用于多重组织成像数据的可视化空间邻域分析。
IEEE Trans Vis Comput Graph. 2023 Jan;29(1):106-116. doi: 10.1109/TVCG.2022.3209378. Epub 2022 Dec 16.
4
Comprehensive analysis of the potential cuproptosis-related biomarker LIAS that regulates prognosis and immunotherapy of pan-cancers.对潜在的铜死亡相关生物标志物LIAS进行全面分析,该标志物可调节泛癌的预后和免疫治疗。
Front Oncol. 2022 Aug 2;12:952129. doi: 10.3389/fonc.2022.952129. eCollection 2022.
5
Research on E-Commerce Database Marketing Based on Machine Learning Algorithm.基于机器学习算法的电子商务数据库营销研究。
Comput Intell Neurosci. 2022 Jun 29;2022:7973446. doi: 10.1155/2022/7973446. eCollection 2022.
6
MOLGENGO: Finding Novel Molecules with Desired Electronic Properties by Capitalizing on Their Global Optimization.MOLGENGO:通过利用全局优化来寻找具有所需电子特性的新型分子。
ACS Omega. 2021 Oct 5;6(41):27454-27465. doi: 10.1021/acsomega.1c04347. eCollection 2021 Oct 19.
7
Stochastic neighbor embedding as a tool for visualizing the encoding capability of magnetic resonance fingerprinting dictionaries.随机邻居嵌入作为一种用于可视化磁共振指纹字典编码能力的工具。
MAGMA. 2022 Apr;35(2):223-234. doi: 10.1007/s10334-021-00963-8. Epub 2021 Oct 23.
8
A machine learning method for the discovery of minimum marker gene combinations for cell type identification from single-cell RNA sequencing.一种基于机器学习的方法,用于从单细胞 RNA 测序中发现用于细胞类型鉴定的最小标记基因组合。
Genome Res. 2021 Oct;31(10):1767-1780. doi: 10.1101/gr.275569.121. Epub 2021 Jun 4.
9
qSNE: quadratic rate t-SNE optimizer with automatic parameter tuning for large datasets.qSNE:具有自动参数调整的二次速率 t-SNE 优化器,适用于大型数据集。
Bioinformatics. 2020 Dec 22;36(20):5086-5092. doi: 10.1093/bioinformatics/btaa637.