阿拉伯语BERT-LSTM：基于Transformer模型和长短期记忆改进阿拉伯语情感分析

ArabBert-LSTM: improving Arabic sentiment analysis based on transformer model and Long Short-Term Memory.

作者信息

Alosaimi Wael, Saleh Hager, Hamzah Ali A, El-Rashidy Nora, Alharb Abdullah, Elaraby Ahmed, Mostafa Sherif

机构信息

Department of Information Technology, College of Computers and Information Technology, Taif University, Taif, Saudi Arabia.

Faculty of Computers and Artificial Intelligence, South Valley University, Hurghada, Egypt.

出版信息

Front Artif Intell. 2024 Jul 2;7:1408845. doi: 10.3389/frai.2024.1408845. eCollection 2024.

DOI:10.3389/frai.2024.1408845

PMID:39015364

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11250580/

Abstract

Sentiment analysis also referred to as opinion mining, plays a significant role in automating the identification of negative, positive, or neutral sentiments expressed in textual data. The proliferation of social networks, review sites, and blogs has rendered these platforms valuable resources for mining opinions. Sentiment analysis finds applications in various domains and languages, including English and Arabic. However, Arabic presents unique challenges due to its complex morphology characterized by inflectional and derivation patterns. To effectively analyze sentiment in Arabic text, sentiment analysis techniques must account for this intricacy. This paper proposes a model designed using the transformer model and deep learning (DL) techniques. The word embedding is represented by Transformer-based Model for Arabic Language Understanding (ArabBert), and then passed to the AraBERT model. The output of AraBERT is subsequently fed into a Long Short-Term Memory (LSTM) model, followed by feedforward neural networks and an output layer. AraBERT is used to capture rich contextual information and LSTM to enhance sequence modeling and retain long-term dependencies within the text data. We compared the proposed model with machine learning (ML) algorithms and DL algorithms, as well as different vectorization techniques: term frequency-inverse document frequency (TF-IDF), ArabBert, Continuous Bag-of-Words (CBOW), and skipGrams using four Arabic benchmark datasets. Through extensive experimentation and evaluation of Arabic sentiment analysis datasets, we showcase the effectiveness of our approach. The results underscore significant improvements in sentiment analysis accuracy, highlighting the potential of leveraging transformer models for Arabic Sentiment Analysis. The outcomes of this research contribute to advancing Arabic sentiment analysis, enabling more accurate and reliable sentiment analysis in Arabic text. The findings reveal that the proposed framework exhibits exceptional performance in sentiment classification, achieving an impressive accuracy rate of over 97%.

摘要

情感分析也被称为观点挖掘，在自动识别文本数据中表达的负面、正面或中性情感方面发挥着重要作用。社交网络、评论网站和博客的激增使这些平台成为挖掘观点的宝贵资源。情感分析在包括英语和阿拉伯语在内的各种领域和语言中都有应用。然而，阿拉伯语由于其复杂的形态学（以屈折和派生模式为特征）而带来了独特的挑战。为了有效地分析阿拉伯语文本中的情感，情感分析技术必须考虑到这种复杂性。本文提出了一种使用Transformer模型和深度学习（DL）技术设计的模型。词嵌入由基于Transformer的阿拉伯语语言理解模型（ArabBert）表示，然后传递给AraBERT模型。AraBERT的输出随后被输入到长短期记忆（LSTM）模型中，接着是前馈神经网络和输出层。AraBERT用于捕获丰富的上下文信息，LSTM用于增强序列建模并保留文本数据中的长期依赖关系。我们使用四个阿拉伯语基准数据集，将所提出的模型与机器学习（ML）算法和DL算法以及不同的矢量化技术（词频 - 逆文档频率（TF-IDF）、ArabBert、连续词袋模型（CBOW）和跳字模型）进行了比较。通过对阿拉伯语情感分析数据集的广泛实验和评估，我们展示了我们方法的有效性。结果强调了情感分析准确性的显著提高，突出了利用Transformer模型进行阿拉伯语情感分析的潜力。这项研究的成果有助于推进阿拉伯语情感分析，使阿拉伯语文本中的情感分析更加准确和可靠。研究结果表明，所提出的框架在情感分类方面表现出卓越的性能，实现了超过97%的令人印象深刻的准确率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d591/11250580/3d1aa1c6ad5c/frai-07-1408845-g0001.jpg

相似文献

ArabBert-LSTM: improving Arabic sentiment analysis based on transformer model and Long Short-Term Memory.阿拉伯语BERT-LSTM：基于Transformer模型和长短期记忆改进阿拉伯语情感分析

Front Artif Intell. 2024 Jul 2;7:1408845. doi: 10.3389/frai.2024.1408845. eCollection 2024.

Sentiment analysis of Arabic social media texts: A machine learning approach to deciphering customer perceptions.阿拉伯社交媒体文本的情感分析：一种解读客户认知的机器学习方法。

Heliyon. 2024 Mar 21;10(9):e27863. doi: 10.1016/j.heliyon.2024.e27863. eCollection 2024 May 15.

Heterogeneous Ensemble Deep Learning Model for Enhanced Arabic Sentiment Analysis.用于增强阿拉伯语情感分析的异质集成深度学习模型。

Sensors (Basel). 2022 May 12;22(10):3707. doi: 10.3390/s22103707.

Character gated recurrent neural networks for Arabic sentiment analysis.基于字符门控循环神经网络的阿拉伯语情感分析。

Sci Rep. 2022 Jun 13;12(1):9779. doi: 10.1038/s41598-022-13153-w.

Sentiment analysis in multilingual context: Comparative analysis of machine learning and hybrid deep learning models.多语言环境下的情感分析：机器学习与混合深度学习模型的比较分析

Heliyon. 2023 Sep 19;9(9):e20281. doi: 10.1016/j.heliyon.2023.e20281. eCollection 2023 Sep.

ArSa-Tweets: A novel Arabic sarcasm detection system based on deep learning model.ArSa-Tweets：一种基于深度学习模型的新型阿拉伯语讽刺检测系统。

Heliyon. 2024 Aug 28;10(17):e36892. doi: 10.1016/j.heliyon.2024.e36892. eCollection 2024 Sep 15.

Quantum computing and machine learning for Arabic language sentiment classification in social media.量子计算和机器学习在社交媒体中对阿拉伯语情感分类的应用。

Sci Rep. 2023 Oct 12;13(1):17305. doi: 10.1038/s41598-023-44113-7.

Rule-Based Arabic Sentiment Analysis using Binary Equilibrium Optimization Algorithm.基于规则的阿拉伯语情感分析：使用二进制平衡优化算法

Arab J Sci Eng. 2023;48(2):2359-2374. doi: 10.1007/s13369-022-07198-2. Epub 2022 Sep 26.

Improving Sentiment Analysis for Social Media Applications Using an Ensemble Deep Learning Language Model.使用集成深度学习语言模型改进社交媒体应用的情感分析

Arab J Sci Eng. 2022;47(2):2499-2511. doi: 10.1007/s13369-021-06227-w. Epub 2021 Oct 11.

Improving sentiment classification using a RoBERTa-based hybrid model.使用基于RoBERTa的混合模型改进情感分类。

Front Hum Neurosci. 2023 Dec 7;17:1292010. doi: 10.3389/fnhum.2023.1292010. eCollection 2023.

引用本文的文献

Transformer-based ensemble model for dialectal Arabic sentiment classification.基于Transformer的方言阿拉伯语情感分类集成模型。

PeerJ Comput Sci. 2025 Mar 24;11:e2644. doi: 10.7717/peerj-cs.2644. eCollection 2025.

Advancing arabic dialect detection with hybrid stacked transformer models.使用混合堆叠变压器模型推进阿拉伯方言检测。

Front Hum Neurosci. 2025 Feb 11;19:1498297. doi: 10.3389/fnhum.2025.1498297. eCollection 2025.

本文引用的文献

Heterogeneous Ensemble Deep Learning Model for Enhanced Arabic Sentiment Analysis.用于增强阿拉伯语情感分析的异质集成深度学习模型。

Sensors (Basel). 2022 May 12;22(10):3707. doi: 10.3390/s22103707.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

阿拉伯语BERT-LSTM：基于Transformer模型和长短期记忆改进阿拉伯语情感分析

ArabBert-LSTM: improving Arabic sentiment analysis based on transformer model and Long Short-Term Memory.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献