Al-Qudah Dana A, Al-Zoubi Ala' M, Cristea Alexandra I, Merelo-Guervós Juan J, Castillo Pedro A, Faris Hossam
King Abdullah II School for Information Technology, The University of Jordan, Amman, Jordan.
Faculty of Information Technology, Applied Science Private University, Amman, Jordan.
PeerJ Comput Sci. 2025 Jan 9;11:e2370. doi: 10.7717/peerj-cs.2370. eCollection 2025.
As the business world shifts to the web and tremendous amounts of data become available on multilingual mobile applications, new business and research challenges and opportunities have been explored. This research aims to intensify the usage of data analytics, machine learning, and sentiment analysis of textual data to classify customers' reviews, feedback, and ratings of businesses in Jordan's food and restaurant industry. The main methods used in this research were sentiment polarity (to address the challenges posed by businesses to automatically apply text analysis) and bio-metric techniques (to systematically identify users' emotional states, so reviews can be thoroughly understood). The research was extended to deal with reviews in Arabic, dialectic Arabic, and English, with the main focus on the Arabic language, as the application examined (Talabat) is based in Jordan. Arabic and English reviews were collected from the application, and a new model was proposed to sentimentally analyze reviews. The proposed model has four main stages: data collection, data preparation, model building, and model evaluation. The main purpose of this research is to study the problem expressed above using a model of ordinal regression to overcome issues related to misclassification. Additionally, an automatic multi-language prediction approach for online restaurant reviews was proposed by combining the eXtreme gradient boosting (XGBoost) and particle swarm optimization (PSO) techniques for the ordinal regression of these reviews. The proposed PSO-XGB algorithm showed superior results when compared to support vector machine (SVM) and other optimization methods in terms of root mean square error (RMSE) for the English and Arabic datasets. Specifically, for the Arabic dataset, PSO-XGB achieved an RMSE value of 0.7722, whereas PSO-SVM achieved an RSME value of 0.9988.
随着商业世界向网络转移,大量数据可在多语言移动应用程序上获取,新的商业和研究挑战与机遇也得到了探索。本研究旨在加强对文本数据的数据分析、机器学习和情感分析的运用,以对约旦食品和餐饮行业企业的客户评论、反馈及评级进行分类。本研究使用的主要方法是情感极性(以应对企业在自动应用文本分析方面面临的挑战)和生物识别技术(以系统识别用户的情绪状态,从而透彻理解评论)。该研究扩展到处理阿拉伯语、方言阿拉伯语和英语的评论,主要聚焦于阿拉伯语,因为所考察的应用程序(Talabat)总部位于约旦。从该应用程序收集了阿拉伯语和英语评论,并提出了一种新模型用于对评论进行情感分析。所提出的模型有四个主要阶段:数据收集、数据准备、模型构建和模型评估。本研究的主要目的是使用有序回归模型研究上述问题,以克服与错误分类相关的问题。此外,通过结合极端梯度提升(XGBoost)和粒子群优化(PSO)技术对这些评论进行有序回归,提出了一种用于在线餐厅评论的自动多语言预测方法。与支持向量机(SVM)和其他优化方法相比,所提出的PSO-XGB算法在英语和阿拉伯语数据集的均方根误差(RMSE)方面表现出更优的结果。具体而言,对于阿拉伯语数据集,PSO-XGB的RMSE值为0.7722,而PSO-SVM的RSME值为0.9988。