Lee Byounghwa, Lee Jung-Hoon, Lee Sungyup, Kim Cheol Ho
CybreBrain Research Section, Electronics and Telecommunications Research Institute, Daejeon, Republic of Korea.
Front Comput Neurosci. 2023 Dec 12;17:1292842. doi: 10.3389/fncom.2023.1292842. eCollection 2023.
Burst patterns, characterized by their temporal heterogeneity, have been observed across a wide range of domains, encompassing event sequences from neuronal firing to various facets of human activities. Recent research on predicting event sequences leveraged a Transformer based on the Hawkes process, incorporating a self-attention mechanism to capture long-term temporal dependencies. To effectively handle bursty temporal patterns, we propose a Burst and Memory-aware Transformer (BMT) model, designed to explicitly address temporal heterogeneity. The BMT model embeds the burstiness and memory coefficient into the self-attention module, enhancing the learning process with insights derived from the bursty patterns. Furthermore, we employed a novel loss function designed to optimize the burstiness and memory coefficient values, as well as their corresponding discretized one-hot vectors, both individually and jointly. Numerical experiments conducted on diverse synthetic and real-world datasets demonstrated the outstanding performance of the BMT model in terms of accurately predicting event times and intensity functions compared to existing models and control groups. In particular, the BMT model exhibits remarkable performance for temporally heterogeneous data, such as those with power-law inter-event time distributions. Our findings suggest that the incorporation of burst-related parameters assists the Transformer in comprehending heterogeneous event sequences, leading to an enhanced predictive performance.
爆发模式以其时间异质性为特征,已在广泛的领域中被观察到,涵盖从神经元放电到人类活动各个方面的事件序列。最近关于预测事件序列的研究利用了基于霍克斯过程的Transformer,纳入了自注意力机制以捕捉长期时间依赖性。为了有效处理突发的时间模式,我们提出了一种突发和记忆感知Transformer(BMT)模型,旨在明确解决时间异质性问题。BMT模型将突发性和记忆系数嵌入自注意力模块,利用从突发模式中获得的见解增强学习过程。此外,我们采用了一种新颖的损失函数,旨在分别和联合优化突发性和记忆系数值及其相应的离散化独热向量。在各种合成和真实世界数据集上进行的数值实验表明,与现有模型和对照组相比,BMT模型在准确预测事件时间和强度函数方面表现出色。特别是,BMT模型对于具有时间异质性的数据,如具有幂律事件间时间分布的数据,表现出卓越的性能。我们的研究结果表明,纳入与突发相关的参数有助于Transformer理解异质事件序列,从而提高预测性能。