低维表示中数据点组的解释

Explaining Groups of Points in Low-Dimensional Representations.

作者信息

Plumb Gregory, Terhorst Jonathan, Sankararaman Sriram, Talwalkar Ameet

机构信息

Carnegie Mellon University.

University of Michigan.

出版信息

Proc Mach Learn Res. 2020 Jul;119:7762-7771.

PMID:34532709

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8442997/

Abstract

A common workflow in data exploration is to learn a low-dimensional representation of the data, identify groups of points in that representation, and examine the differences between the groups to determine what they represent. We treat this workflow as an interpretable machine learning problem by leveraging the model that learned the low-dimensional representation to help identify the key differences between the groups. To solve this problem, we introduce a new type of explanation, a Global Counterfactual Explanation (GCE), and our algorithm, Transitive Global Translations (TGT), for computing GCEs. TGT identifies the differences between each pair of groups using compressed sensing but constrains those pairwise differences to be consistent among all of the groups. Empirically, we demonstrate that TGT is able to identify explanations that accurately explain the model while being relatively sparse, and that these explanations match real patterns in the data.

摘要

数据探索中的一个常见工作流程是学习数据的低维表示，识别该表示中的点组，并检查这些组之间的差异以确定它们所代表的内容。我们通过利用学习低维表示的模型来帮助识别组之间的关键差异，将此工作流程视为一个可解释的机器学习问题。为了解决这个问题，我们引入了一种新型解释——全局反事实解释（GCE），以及我们用于计算GCE的算法——传递全局翻译（TGT）。TGT使用压缩感知识别每对组之间的差异，但将这些成对差异约束为在所有组中保持一致。从经验上看，我们证明TGT能够识别出在相对稀疏的情况下准确解释模型的解释，并且这些解释与数据中的真实模式相匹配。

相似文献

Explaining Groups of Points in Low-Dimensional Representations.低维表示中数据点组的解释

Proc Mach Learn Res. 2020 Jul;119:7762-7771.

Learning Domain-Independent Deep Representations by Mutual Information Minimization.通过互信息最小化学习领域独立的深度表示。

Comput Intell Neurosci. 2019 Jun 16;2019:9414539. doi: 10.1155/2019/9414539. eCollection 2019.

DECE: Decision Explorer with Counterfactual Explanations for Machine Learning Models.决策探索器：用于机器学习模型的反事实解释的决策探索器。

IEEE Trans Vis Comput Graph. 2021 Feb;27(2):1438-1447. doi: 10.1109/TVCG.2020.3030342. Epub 2021 Jan 28.

GANterfactual-Counterfactual Explanations for Medical Non-experts Using Generative Adversarial Learning.使用生成对抗学习为医学非专业人员提供反事实-反事实解释。

Front Artif Intell. 2022 Apr 8;5:825565. doi: 10.3389/frai.2022.825565. eCollection 2022.

Trace Quotient with Sparsity Priors for Learning Low Dimensional Image Representations.用于学习低维图像表示的具有稀疏先验的迹商

IEEE Trans Pattern Anal Mach Intell. 2020 Dec;42(12):3119-3135. doi: 10.1109/TPAMI.2019.2921031. Epub 2020 Nov 3.

Data representation using robust nonnegative matrix factorization for edge computing.使用稳健的非负矩阵分解进行边缘计算的数据表示。

Math Biosci Eng. 2022 Jan;19(2):2147-2178. doi: 10.3934/mbe.2022100. Epub 2021 Dec 28.

Transparency as design publicity: explaining and justifying inscrutable algorithms.作为设计宣传的透明度：解释和论证难以理解的算法

Ethics Inf Technol. 2021;23(3):253-263. doi: 10.1007/s10676-020-09564-w. Epub 2020 Oct 20.

Simultaneously learning affinity matrix and data representations for machine fault diagnosis.同时学习亲和矩阵和数据表示以进行机器故障诊断。

Neural Netw. 2020 Feb;122:395-406. doi: 10.1016/j.neunet.2019.11.007. Epub 2019 Nov 22.

Computing sparse representations of multidimensional signals using Kronecker bases.使用 Kronecker 基计算多维信号的稀疏表示。

Neural Comput. 2013 Jan;25(1):186-220. doi: 10.1162/NECO_a_00385. Epub 2012 Sep 28.

Transfer learning in proteins: evaluating novel protein learned representations for bioinformatics tasks.蛋白质中的迁移学习：评估生物信息学任务中新型蛋白质学习表示。

Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac232.

引用本文的文献

Toward computing attributions for dimensionality reduction techniques.迈向计算降维技术的归因

Bioinform Adv. 2023 Aug 3;3(1):vbad097. doi: 10.1093/bioadv/vbad097. eCollection 2023.

XOmiVAE: an interpretable deep learning model for cancer classification using high-dimensional omics data.XOmiVAE：一种使用高维组学数据进行癌症分类的可解释深度学习模型。

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab315.

本文引用的文献

From Clustering to Cluster Explanations via Neural Networks.从聚类到通过神经网络实现聚类解释。

IEEE Trans Neural Netw Learn Syst. 2024 Feb;35(2):1926-1940. doi: 10.1109/TNNLS.2022.3185901. Epub 2024 Feb 5.

The art of using t-SNE for single-cell transcriptomics.使用 t-SNE 进行单细胞转录组学分析的艺术。

Nat Commun. 2019 Nov 28;10(1):5416. doi: 10.1038/s41467-019-13056-x.

Interpretable dimensionality reduction of single cell transcriptome data with deep generative models.基于深度生成模型的单细胞转录组数据的可解释维度约简。

Nat Commun. 2018 May 21;9(1):2002. doi: 10.1038/s41467-018-04368-5.

Comprehensive Classification of Retinal Bipolar Neurons by Single-Cell Transcriptomics.通过单细胞转录组学对视网膜双极神经元进行综合分类

Cell. 2016 Aug 25;166(5):1308-1323.e30. doi: 10.1016/j.cell.2016.07.054.

Epidemiology of coronary heart disease in women.女性冠心病的流行病学

Prog Cardiovasc Dis. 2004 Jan-Feb;46(4):287-95. doi: 10.1016/j.pcad.2003.08.001.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验