Deng Haisheng, Alkhayyat Ahmed
Xijing University, Xi'an, 710123, Shaanxi, China.
College of Technical Engineering, The Islamic University, Najaf, Iraq.
Sci Rep. 2025 May 17;15(1):17206. doi: 10.1038/s41598-025-01834-1.
The use of machine learning to analyze sentiments has attained considerable interest in the past few years. The task of analyzing sentiments has becfigome increasingly important and challenging. Due to the specific attributes of this type of data, including length of text, spelling errors, and abbreviations, unconventional methods and multiple steps are required for effectively analyzing sentiment in such a complex environment. In this research, two distinct word embedding models, GloVe and Word2Vec, were utilized for vectorization. To enhance the performance long short-term memory (LSTM), the model was optimized using the amended dwarf mongoose optimization (ADMO) algorithm, leading to improvements in the hyperparameters. The LSTM-ADMO achieved the accuracy values of 97.74 and 97.47 using Word2Vec and GloVe, respectively on IMDB, and it could gain the accuracy values of 97.84 and 97.51 using Word2Vec and GloVe, respectively on SST-2. In general, it was determined that the proposed model significantly outperformed other models, and there was very little difference between the two different word embedding techniques.
在过去几年中,使用机器学习来分析情感已经引起了相当大的关注。分析情感的任务变得越来越重要且具有挑战性。由于这类数据的特定属性,包括文本长度、拼写错误和缩写,在如此复杂的环境中有效分析情感需要非常规方法和多个步骤。在本研究中,使用了两种不同的词嵌入模型GloVe和Word2Vec进行向量化。为了提高长短期记忆(LSTM)的性能,使用改进的矮猫鼬优化(ADMO)算法对模型进行了优化,从而改进了超参数。LSTM-ADMO在IMDB上分别使用Word2Vec和GloVe时,准确率分别达到97.74和97.47,在SST-2上分别使用Word2Vec和GloVe时,准确率分别达到97.84和97.51。总体而言,确定所提出的模型明显优于其他模型,并且两种不同的词嵌入技术之间差异很小。