Ferraz Mariana Sacrini Ayres, Kihara Alexandre Hiroaki
Centro de Matemática, Computação e Cognição (CMCC), Universidade Federal do ABC (UFABC), São Bernardo do Campo, São Paulo, Brazil.
Phys Rev E. 2022 Apr;105(4-1):044101. doi: 10.1103/PhysRevE.105.044101.
The enormous amount of currently available data demands efforts to extract meaningful information. For this purpose, different measurements are applied, including Shannon's entropy, permutation entropy, and the Lempel-Ziv complexity. These methods have been used in many applications, such as pattern recognition, series classification, and several other areas (e.g., physical, financial, and biomedical). Data in these applications are often presented in binary series with temporal correlations. Herein, we compare the measures of information entropy in binary series conveying short- and long-range temporal correlations characterized by the Hurst exponent H. Combining numerical and analytical approaches, we scrutinize different methods that were not efficient in detecting temporal correlations. To surpass this limitation, we propose a measure called the binary permutation index (BPI). We will demonstrate that BPI efficiently discriminates patterns embedded in the series, offering advantages over previous methods. Subsequently, we collect stock market time series and rain precipitation data as well as perform in vivo electrophysiological recordings in the hippocampus of an experimental animal model of temporal lobe epilepsy, in which the BPI application in both public open source and experimental data is demonstrated. An index is proposed to evaluate information entropy, allowing the ability to discriminate randomness and extract meaningful information in binary time series.
当前可用的数据量巨大,需要努力提取有意义的信息。为此,人们应用了不同的测量方法,包括香农熵、排列熵和莱姆尔-齐夫复杂度。这些方法已被用于许多应用中,如模式识别、序列分类以及其他几个领域(如物理、金融和生物医学)。这些应用中的数据通常以具有时间相关性的二进制序列形式呈现。在此,我们比较了以赫斯特指数H为特征的、传达短期和长期时间相关性的二进制序列中的信息熵度量。结合数值和分析方法,我们仔细研究了在检测时间相关性方面效率不高的不同方法。为了克服这一局限性,我们提出了一种称为二进制排列指数(BPI)的度量。我们将证明BPI能够有效地区分序列中嵌入的模式,比以前的方法具有优势。随后,我们收集了股票市场时间序列和降雨数据,并在颞叶癫痫实验动物模型的海马体中进行了体内电生理记录,展示了BPI在公开开源数据和实验数据中的应用。我们提出了一个评估信息熵的指标,使其能够区分随机性并提取二进制时间序列中的有意义信息。