Holtzman Roi, Giulini Marco, Potestio Raffaello
Department of Physics of Complex Systems, Weizmann Institute of Science, Rehovot 76100, Israel.
Physics Department, University of Trento, via Sommarive, 14 I-38123 Trento, Italy.
Phys Rev E. 2022 Oct;106(4-1):044101. doi: 10.1103/PhysRevE.106.044101.
Complex systems are characterized by a tight, nontrivial interplay of their constituents, which gives rise to a multiscale spectrum of emergent properties. In this scenario, it is practically and conceptually difficult to identify those degrees of freedom that mostly determine the behavior of the system and separate them from less prominent players. Here, we tackle this problem making use of three measures of statistical information: Resolution, relevance, and mapping entropy. We address the links existing among them, taking the moves from the established relation between resolution and relevance and further developing novel connections between resolution and mapping entropy; by these means we can identify, in a quantitative manner, the number and selection of degrees of freedom of the system that preserve the largest information content about the generative process that underlies an empirical dataset. The method, which is implemented in a freely available software, is fully general, as it is shown through the application to three very diverse systems, namely, a toy model of independent binary spins, a coarse-grained representation of the financial stock market, and a fully atomistic simulation of a protein.
复杂系统的特征在于其组成部分之间紧密且不平凡的相互作用,这产生了多尺度的涌现性质谱。在这种情况下,从实际和概念上讲,很难识别那些最能决定系统行为的自由度,并将它们与不太突出的因素区分开来。在这里,我们利用三种统计信息度量来解决这个问题:分辨率、相关性和映射熵。我们探讨它们之间存在的联系,从分辨率和相关性之间已确立的关系出发,并进一步发展分辨率和映射熵之间的新联系;通过这些方法,我们可以定量地确定系统中保留关于构成经验数据集基础的生成过程最大信息量的自由度的数量和选择。该方法在一个免费软件中实现,具有完全通用性,这通过应用于三个非常不同的系统得到了证明,即独立二元自旋的玩具模型、金融股票市场的粗粒度表示以及蛋白质的全原子模拟。