• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

确定性信息瓶颈

The Deterministic Information Bottleneck.

作者信息

Strouse D J, Schwab David J

机构信息

Department of Physics, Princeton University, Princeton, NJ 08544, U.S.A.

Department of Physics, Northwestern University, Evanston, IL 60208, U.S.A.

出版信息

Neural Comput. 2017 Jun;29(6):1611-1630. doi: 10.1162/NECO_a_00961. Epub 2017 Apr 14.

DOI:10.1162/NECO_a_00961
PMID:28410050
Abstract

Lossy compression and clustering fundamentally involve a decision about which features are relevant and which are not. The information bottleneck method (IB) by Tishby, Pereira, and Bialek ( 1999 ) formalized this notion as an information-theoretic optimization problem and proposed an optimal trade-off between throwing away as many bits as possible and selectively keeping those that are most important. In the IB, compression is measured by mutual information. Here, we introduce an alternative formulation that replaces mutual information with entropy, which we call the deterministic information bottleneck (DIB) and argue better captures this notion of compression. As suggested by its name, the solution to the DIB problem turns out to be a deterministic encoder, or hard clustering, as opposed to the stochastic encoder, or soft clustering, that is optimal under the IB. We compare the IB and DIB on synthetic data, showing that the IB and DIB perform similarly in terms of the IB cost function, but that the DIB significantly outperforms the IB in terms of the DIB cost function. We also empirically find that the DIB offers a considerable gain in computational efficiency over the IB, over a range of convergence parameters. Our derivation of the DIB also suggests a method for continuously interpolating between the soft clustering of the IB and the hard clustering of the DIB.

摘要

有损压缩和聚类从根本上涉及到一个关于哪些特征相关、哪些特征不相关的决策。Tishby、Pereira和Bialek(1999)提出的信息瓶颈方法(IB)将这一概念形式化为一个信息论优化问题,并提出了在尽可能丢弃更多比特与有选择地保留最重要比特之间的最优权衡。在IB中,压缩通过互信息来衡量。在这里,我们引入一种替代公式,用熵取代互信息,我们将其称为确定性信息瓶颈(DIB),并认为它能更好地捕捉这种压缩概念。顾名思义,DIB问题的解决方案是一个确定性编码器,即硬聚类,这与在IB下最优的随机编码器(即软聚类)形成对比。我们在合成数据上比较了IB和DIB,结果表明,在IB成本函数方面,IB和DIB的表现相似,但在DIB成本函数方面,DIB明显优于IB。我们还通过实验发现,在一系列收敛参数范围内,DIB在计算效率上比IB有显著提升。我们对DIB的推导还提出了一种在IB的软聚类和DIB的硬聚类之间进行连续插值的方法。

相似文献

1
The Deterministic Information Bottleneck.确定性信息瓶颈
Neural Comput. 2017 Jun;29(6):1611-1630. doi: 10.1162/NECO_a_00961. Epub 2017 Apr 14.
2
The Information Bottleneck and Geometric Clustering.信息瓶颈与几何聚类
Neural Comput. 2019 Mar;31(3):596-612. doi: 10.1162/neco_a_01136. Epub 2018 Oct 12.
3
Pareto-Optimal Clustering with the Primal Deterministic Information Bottleneck.基于原始确定性信息瓶颈的帕累托最优聚类
Entropy (Basel). 2022 May 30;24(6):771. doi: 10.3390/e24060771.
4
Feature characterization in fMRI data: the Information Bottleneck approach.功能磁共振成像数据中的特征表征:信息瓶颈方法。
Med Image Anal. 2004 Dec;8(4):403-19. doi: 10.1016/j.media.2004.09.001.
5
Information bottleneck based incremental fuzzy clustering for large biomedical data.基于信息瓶颈的大型生物医学数据增量模糊聚类
J Biomed Inform. 2016 Aug;62:48-58. doi: 10.1016/j.jbi.2016.05.009. Epub 2016 May 31.
6
Embo: a Python package for empirical data analysis using the Information Bottleneck.Embo:一个使用信息瓶颈进行实证数据分析的Python包。
J Open Res Softw. 2021;9(1). doi: 10.5334/jors.322. Epub 2021 May 31.
7
The Convex Information Bottleneck Lagrangian.凸信息瓶颈拉格朗日函数。
Entropy (Basel). 2020 Jan 14;22(1):98. doi: 10.3390/e22010098.
8
Pareto-Optimal Data Compression for Binary Classification Tasks.用于二元分类任务的帕累托最优数据压缩
Entropy (Basel). 2019 Dec 19;22(1):7. doi: 10.3390/e22010007.
9
Learning Representations for Neural Network-Based Classification Using the Information Bottleneck Principle.使用信息瓶颈原理学习基于神经网络的分类表示。
IEEE Trans Pattern Anal Mach Intell. 2020 Sep;42(9):2225-2239. doi: 10.1109/TPAMI.2019.2909031. Epub 2019 Apr 2.
10
A Survey on Information Bottleneck.关于信息瓶颈的一项调查
IEEE Trans Pattern Anal Mach Intell. 2024 Aug;46(8):5325-5344. doi: 10.1109/TPAMI.2024.3366349. Epub 2024 Jul 2.

引用本文的文献

1
Spatiotemporal feature learning for actin dynamics.肌动蛋白动力学的时空特征学习
PLoS One. 2025 Mar 5;20(3):e0318036. doi: 10.1371/journal.pone.0318036. eCollection 2025.
2
Exploring the Trade-Off in the Variational Information Bottleneck for Regression with a Single Training Run.单次训练运行下探索回归变分信息瓶颈中的权衡
Entropy (Basel). 2024 Nov 30;26(12):1043. doi: 10.3390/e26121043.
3
A Survey on Error Exponents in Distributed Hypothesis Testing: Connections with Information Theory, Interpretations, and Applications.
分布式假设检验中误差指数的综述:与信息论的联系、解释及应用
Entropy (Basel). 2024 Jul 12;26(7):596. doi: 10.3390/e26070596.
4
On Neural Networks Fitting, Compression, and Generalization Behavior via Information-Bottleneck-like Approaches.基于类信息瓶颈方法的神经网络拟合、压缩与泛化行为研究
Entropy (Basel). 2023 Jul 14;25(7):1063. doi: 10.3390/e25071063.
5
Embo: a Python package for empirical data analysis using the Information Bottleneck.Embo:一个使用信息瓶颈进行实证数据分析的Python包。
J Open Res Softw. 2021;9(1). doi: 10.5334/jors.322. Epub 2021 May 31.
6
A Generalized Information-Theoretic Framework for the Emergence of Hierarchical Abstractions in Resource-Limited Systems.资源受限系统中分层抽象出现的广义信息论框架。
Entropy (Basel). 2022 Jun 9;24(6):809. doi: 10.3390/e24060809.
7
Pareto-Optimal Clustering with the Primal Deterministic Information Bottleneck.基于原始确定性信息瓶颈的帕累托最优聚类
Entropy (Basel). 2022 May 30;24(6):771. doi: 10.3390/e24060771.
8
Discrete Infomax Codes for Supervised Representation Learning.用于监督表示学习的离散信息最大化编码
Entropy (Basel). 2022 Apr 2;24(4):501. doi: 10.3390/e24040501.
9
Gaussian Information Bottleneck and the Non-Perturbative Renormalization Group.高斯信息瓶颈与非微扰重整化群
New J Phys. 2022 Mar;24(3). doi: 10.1088/1367-2630/ac395d. Epub 2022 Mar 9.
10
Measuring the repertoire of age-related behavioral changes in Drosophila melanogaster.测量黑腹果蝇与年龄相关的行为变化的 repertoire。
PLoS Comput Biol. 2022 Feb 24;18(2):e1009867. doi: 10.1371/journal.pcbi.1009867. eCollection 2022 Feb.