College of Computer Science, Chongqing University, Chongqing, China.
Department of Computing, Hong Kong Polytechnic University, Hongkong, China.
Neural Netw. 2023 May;162:162-174. doi: 10.1016/j.neunet.2023.02.008. Epub 2023 Feb 17.
Sentiment analysis refers to the mining of textual context, which is conducted with the aim of identifying and extracting subjective opinions in textual materials. However, most existing methods neglect other important modalities, e.g., the audio modality, which can provide intrinsic complementary knowledge for sentiment analysis. Furthermore, much work on sentiment analysis cannot continuously learn new sentiment analysis tasks or discover potential correlations among distinct modalities. To address these concerns, we propose a novel Lifelong Text-Audio Sentiment Analysis (LTASA) model to continuously learn text-audio sentiment analysis tasks, which effectively explores intrinsic semantic relationships from both intra-modality and inter-modality perspectives. More specifically, a modality-specific knowledge dictionary is developed for each modality to obtain shared intra-modality representations among various text-audio sentiment analysis tasks. Additionally, based on information dependence between text and audio knowledge dictionaries, a complementarity-aware subspace is developed to capture the latent nonlinear inter-modality complementary knowledge. To sequentially learn text-audio sentiment analysis tasks, a new online multi-task optimization pipeline is designed. Finally, we verify our model on three common datasets to show its superiority. Compared with some baseline representative methods, the capability of the LTASA model is significantly boosted in terms of five measurement indicators.
情感分析是指对文本语境的挖掘,旨在识别和提取文本材料中的主观意见。然而,大多数现有的方法忽略了其他重要的模态,例如音频模态,它可以为情感分析提供内在的补充知识。此外,情感分析的许多工作无法持续学习新的情感分析任务或发现不同模态之间的潜在相关性。为了解决这些问题,我们提出了一种新的终身文本-音频情感分析(LTASA)模型,以持续学习文本-音频情感分析任务,从内-外-内-外视角有效地探索内在语义关系。更具体地说,为每个模态开发了一个模态特定的知识字典,以获得各种文本-音频情感分析任务之间的共享内-内模态表示。此外,基于文本和音频知识字典之间的信息依赖关系,开发了一个互补感知子空间,以捕获潜在的非线性内-外模态互补知识。为了顺序学习文本-音频情感分析任务,设计了一个新的在线多任务优化管道。最后,我们在三个常见数据集上验证了我们的模型,以展示其优越性。与一些基线代表性方法相比,LTASA 模型在五个测量指标方面的性能显著提高。