Tiwari Dimple, Nagpal Bharti
Research Scholar, Ambedkar Institute of Advanced Communication Technologies and Research (GGSIPU), New Delhi, India.
NSUT East Campus (Formerly Ambedkar Institute of Advanced Communication Technologies and Research), New Delhi, India.
New Gener Comput. 2022;40(4):1165-1202. doi: 10.1007/s00354-022-00182-2. Epub 2022 Jul 11.
Social media materialized as an influential platform that allows people to share their views on global and local issues. Sentiment analysis can handle these massive amounts of unstructured reviews and convert them into meaningful opinions. Undoubtedly, COVID-19 originated as the enormous challenge across the world that physically and financially bruted humankind. Meanwhile, farmers' protests shook up the world against three pieces of legislation passed by the Indian government. Hence, an artificial intelligence-based sentiment model is needed for suggesting the right direction toward outbreaks. Although Deep Neural Network (DNN) gained popularity in sentiment analysis applications, these still have a limitation of sequential training, high-dimension feature space, and equal feature importance distribution. In addition, inaccurate polarity scoring and utility-based topic modeling are other challenging aspects of sentiment analysis. It motivates us to propose a Knowledge-Enriched Attention-based Hybrid Transformer (KEAHT) model by enriching the explicit knowledge of Latent Dirichlet Allocation (LDA) topic modeling and lexicalized domain ontology. A pre-trained Bidirectional Encoder Representation from Transformer (BERT) is employed to train within a minimum training corpus. It provides the facility of attention mechanism and can solve complex text problems accurately. A comparative study with existing baselines and recent hybrid models affirms the credibility of the proposed KEAHT in the field of Natural Language Processing (NLP). This model emphasizes artificial intelligence's role in handling the situation of the global pandemic and democratic dispute in a country. Furthermore, two benchmark datasets, namely "COVID-19-Vaccine-Labelled-Tweets" and "Indian-Farmer-Protest-Labelled-Tweets", are also constructed to accommodate future researchers for outlining the essential facts associated with the outbreaks.
社交媒体成为一个有影响力的平台,人们可以在上面分享对全球和本地问题的看法。情感分析能够处理这些大量的非结构化评论,并将其转化为有意义的观点。毫无疑问,新冠疫情成为全球范围内的巨大挑战,在身体和经济上重创了人类。与此同时,农民抗议活动震撼了世界,抗议印度政府通过的三项立法。因此,需要一个基于人工智能的情感模型来为应对疫情指明正确方向。尽管深度神经网络(DNN)在情感分析应用中颇受欢迎,但这些应用仍存在顺序训练、高维特征空间和特征重要性分布均等的局限性。此外,情感分析的其他具有挑战性的方面包括极性评分不准确和基于效用的主题建模。这促使我们通过丰富潜在狄利克雷分配(LDA)主题建模和词汇化领域本体的显性知识,提出一种基于知识增强注意力的混合变压器(KEAHT)模型。使用预训练的来自变压器的双向编码器表示(BERT)在最小训练语料库中进行训练。它提供了注意力机制功能,能够准确解决复杂的文本问题。与现有基线和近期混合模型的比较研究证实了所提出的KEAHT在自然语言处理(NLP)领域的可信度。该模型强调了人工智能在应对全球疫情和一个国家的民主争端情况中的作用。此外,还构建了两个基准数据集,即“新冠疫苗标记推文”和“印度农民抗议标记推文”,以方便未来的研究人员勾勒与疫情相关的基本事实。