Peng Yuxiang, Fang Jie, Li Bingxiang
School of Economics and Management, Xi'an University of Technology, Xi'an, 710048, China.
School of Telecommunication and Information Engineering, Xi'an University of Posts and Telecommunication, Xi'an, 710121, China.
Sci Rep. 2024 Oct 27;14(1):25673. doi: 10.1038/s41598-024-77025-1.
Recently, social media has gradually become an important topic in public opinion analysis field. Among social media, Microblog is one of the most important platforms because it is short, convenient, mobile and instantaneous. Social media microblog recognition well reflects the attitudes from an enormous big colony to a specific incident, either positive or negative, which can be used for deriving competitive intelligence, marketing strategies, detecting depression and so on. However, the existing methods usually use only text or image from internet but not take advantages of their complementary information to finalize the recognition, it limits the performance and robustness of the algorithms. In this paper, we present a collaborate decision network (CDN) based on cross-modal attention to exploit the discriminative attributes of multi modalities by data- and knowledge joint driven strategy in depth, and further improve the recognition performance. In addition, we collect and construct a visual-text microblog recognition dataset with 2854 samples to support the subsequent research of related fields. Finally, experimental reuslts on the collected dataset show the effectiveness and superiority of the proposed CDN.
近年来,社交媒体逐渐成为舆情分析领域的一个重要话题。在社交媒体中,微博是最重要的平台之一,因为它具有简短、便捷、可移动和即时性的特点。社交媒体微博识别很好地反映了大量群体对特定事件的态度,无论是积极的还是消极的,这些态度可用于获取竞争情报、制定营销策略、检测抑郁症等。然而,现有方法通常仅使用来自互联网的文本或图像,而没有利用它们的互补信息来完成识别,这限制了算法的性能和鲁棒性。在本文中,我们提出了一种基于跨模态注意力的协作决策网络(CDN),通过数据和知识联合驱动策略深入挖掘多模态的判别属性,进一步提高识别性能。此外,我们收集并构建了一个包含2854个样本的视觉文本微博识别数据集,以支持相关领域的后续研究。最后,在收集到的数据集上的实验结果表明了所提出的CDN的有效性和优越性。