跨类别标签层次结构比较嵌入可视化的通用框架。

A General Framework for Comparing Embedding Visualizations Across Class-Label Hierarchies.

作者信息

Manz Trevor, Lekschas Fritz, Greene Evan, Finak Greg, Gehlenborg Nils

出版信息

IEEE Trans Vis Comput Graph. 2025 Jan;31(1):283-293. doi: 10.1109/TVCG.2024.3456370. Epub 2024 Dec 3.

DOI:10.1109/TVCG.2024.3456370

PMID:39255153

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11875997/

Abstract

Projecting high-dimensional vectors into two dimensions for visualization, known as embedding visualization, facilitates perceptual reasoning and interpretation. Comparing multiple embedding visualizations drives decision-making in many domains, but traditional comparison methods are limited by a reliance on direct point correspondences. This requirement precludes comparisons without point correspondences, such as two different datasets of annotated images, and fails to capture meaningful higher-level relationships among point groups. To address these shortcomings, we propose a general framework for comparing embedding visualizations based on shared class labels rather than individual points. Our approach partitions points into regions corresponding to three key class concepts-confusion, neighborhood, and relative size-to characterize intra- and inter-class relationships. Informed by a preliminary user study, we implemented our framework using perceptual neighborhood graphs to define these regions and introduced metrics to quantify each concept. We demonstrate the generality of our framework with usage scenarios from machine learning and single-cell biology, highlighting our metrics' ability to draw insightful comparisons across label hierarchies. To assess the effectiveness of our approach, we conducted an evaluation study with five machine learning researchers and six single-cell biologists using an interactive and scalable prototype built with Python, JavaScript, and Rust. Our metrics enable more structured comparisons through visual guidance and increased participants' confidence in their findings.

摘要

将高维向量投影到二维空间进行可视化，即嵌入可视化，有助于感知推理和解释。比较多个嵌入可视化可推动许多领域的决策，但传统的比较方法受限于对直接点对应关系的依赖。这种要求排除了没有点对应关系的比较，比如两个不同的带注释图像数据集，并且无法捕捉点组之间有意义的更高层次关系。为解决这些缺点，我们提出了一个基于共享类标签而非单个点来比较嵌入可视化的通用框架。我们的方法将点划分为对应三个关键类概念（混淆、邻域和相对大小）的区域，以表征类内和类间关系。基于初步的用户研究，我们使用感知邻域图来定义这些区域并引入度量来量化每个概念，从而实现了我们的框架。我们通过机器学习和单细胞生物学的使用场景展示了我们框架的通用性，突出了我们的度量在跨标签层次结构进行有洞察力比较方面的能力。为评估我们方法的有效性，我们使用由Python、JavaScript和Rust构建的交互式可扩展原型，对五名机器学习研究人员和六名单细胞生物学家进行了一项评估研究。我们的度量通过视觉引导实现了更结构化的比较，并增强了参与者对其发现的信心。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a03/11875997/c17fc848c346/nihms-2039886-f0001.jpg

相似文献

A General Framework for Comparing Embedding Visualizations Across Class-Label Hierarchies.跨类别标签层次结构比较嵌入可视化的通用框架。

IEEE Trans Vis Comput Graph. 2025 Jan;31(1):283-293. doi: 10.1109/TVCG.2024.3456370. Epub 2024 Dec 3.

Decision making with visualizations: a cognitive framework across disciplines.可视化决策：跨学科的认知框架

Cogn Res Princ Implic. 2018 Jul 11;3:29. doi: 10.1186/s41235-018-0120-9. eCollection 2018 Dec.

The future of Cochrane Neonatal.考克兰新生儿协作网的未来。

Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

scGAD: a new task and end-to-end framework for generalized cell type annotation and discovery.scGAD：用于广义细胞类型注释和发现的新任务和端到端框架。

Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad045.

VISAtlas: An Image-Based Exploration and Query System for Large Visualization Collections via Neural Image Embedding.VISAtlas：一个通过神经图像嵌入对大型可视化数据集进行基于图像的探索和查询的系统。

IEEE Trans Vis Comput Graph. 2024 Jul;30(7):3224-3240. doi: 10.1109/TVCG.2022.3229023. Epub 2024 Jun 27.

Label-Embedding for Image Classification.图像分类的标签嵌入。

IEEE Trans Pattern Anal Mach Intell. 2016 Jul;38(7):1425-38. doi: 10.1109/TPAMI.2015.2487986. Epub 2015 Oct 7.

scBOL: a universal cell type identification framework for single-cell and spatial transcriptomics data.scBOL：单细胞和空间转录组学数据的通用细胞类型识别框架。

Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae188.

KG4Vis: A Knowledge Graph-Based Approach for Visualization Recommendation.KG4Vis：一种基于知识图谱的可视化推荐方法。

IEEE Trans Vis Comput Graph. 2022 Jan;28(1):195-205. doi: 10.1109/TVCG.2021.3114863. Epub 2021 Dec 24.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象：化学与物理邂逅生物学（瑞士阿斯科纳，2012年6月10日至14日）

Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.

引用本文的文献

SEAL: Spatially-resolved Embedding Analysis with Linked Imaging Data.SEAL：基于链接成像数据的空间分辨嵌入分析

bioRxiv. 2025 Jul 28:2025.07.19.665696. doi: 10.1101/2025.07.19.665696.

本文引用的文献

The landscape of biomedical research.生物医学研究的全景

Patterns (N Y). 2024 Apr 9;5(6):100968. doi: 10.1016/j.patter.2024.100968. eCollection 2024 Jun 14.

Classes are not Clusters: Improving Label-based Evaluation of Dimensionality Reduction.

IEEE Trans Vis Comput Graph. 2023 Nov 3;PP. doi: 10.1109/TVCG.2023.3327187.

Interactive Visual Cluster Analysis by Contrastive Dimensionality Reduction.基于对比降维的交互式可视化聚类分析

IEEE Trans Vis Comput Graph. 2023 Jan;29(1):734-744. doi: 10.1109/TVCG.2022.3209423. Epub 2022 Dec 16.

Polyphony: an Interactive Transfer Learning Framework for Single-Cell Data Analysis.多音性：单细胞数据分析的交互式迁移学习框架。

IEEE Trans Vis Comput Graph. 2023 Jan;29(1):591-601. doi: 10.1109/TVCG.2022.3209408. Epub 2022 Dec 20.

Extricating human tumour immune alterations from tissue inflammation.从组织炎症中提取人类肿瘤免疫改变。

Nature. 2022 May;605(7911):728-735. doi: 10.1038/s41586-022-04718-w. Epub 2022 May 11.

Visual Exploration of Relationships and Structure in Low-Dimensional Embeddings.低维嵌入中的关系和结构的可视化探索。

IEEE Trans Vis Comput Graph. 2023 Jul;29(7):3312-3326. doi: 10.1109/TVCG.2022.3156760. Epub 2023 May 26.

New interpretable machine-learning method for single-cell data reveals correlates of clinical response to cancer immunotherapy.用于单细胞数据的新型可解释机器学习方法揭示了癌症免疫治疗临床反应的相关因素。

Patterns (N Y). 2021 Oct 27;2(12):100372. doi: 10.1016/j.patter.2021.100372. eCollection 2021 Dec 10.

Measuring and Explaining the Inter-Cluster Reliability of Multidimensional Projections.多维投影的簇间可靠性测量与解释

IEEE Trans Vis Comput Graph. 2022 Jan;28(1):551-561. doi: 10.1109/TVCG.2021.3114833. Epub 2021 Dec 24.

Mapping single-cell data to reference atlases by transfer learning.通过迁移学习将单细胞数据映射到参考图谱上。

Nat Biotechnol. 2022 Jan;40(1):121-130. doi: 10.1038/s41587-021-01001-7. Epub 2021 Aug 30.

A computational model for gestalt proximity principle on dot patterns and beyond.一种关于点图案及其他图案的格式塔接近原则的计算模型。

J Vis. 2021 May 3;21(5):23. doi: 10.1167/jov.21.5.23.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

跨类别标签层次结构比较嵌入可视化的通用框架。

A General Framework for Comparing Embedding Visualizations Across Class-Label Hierarchies.

作者信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献