• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于机器学习的方法用于对阿拉伯语推文的远程学习进行情感分析。

A machine learning-based approach for sentiment analysis on distance learning from Arabic Tweets.

作者信息

Almalki Jameel

机构信息

Department of Computer Science, College of Computer in Al-Leith, Umm Al-Qura University, Makkah, Saudi Arabia.

出版信息

PeerJ Comput Sci. 2022 Jul 26;8:e1047. doi: 10.7717/peerj-cs.1047. eCollection 2022.

DOI:10.7717/peerj-cs.1047
PMID:36092011
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9454973/
Abstract

Social media platforms such as Twitter, YouTube, Instagram and Facebook are leading sources of large datasets nowadays. Twitter's data is one of the most reliable due to its privacy policy. Tweets have been used for sentiment analysis and to identify meaningful information within the dataset. Our study focused on the distance learning domain in Saudi Arabia by analyzing Arabic tweets about distance learning. This work proposes a model for analyzing people's feedback using a Twitter dataset in the distance learning domain. The proposed model is based on the Apache Spark product to manage the large dataset. The proposed model uses the Twitter API to get the tweets as raw data. These tweets were stored in the Apache Spark server. A regex-based technique for preprocessing removed retweets, links, hashtags, English words and numbers, usernames, and emojis from the dataset. After that, a Logistic-based Regression model was trained on the pre-processed data. This Logistic Regression model, from the field of machine learning, was used to predict the sentiment inside the tweets. Finally, a Flask application was built for sentiment analysis of the Arabic tweets. The proposed model gives better results when compared to various applied techniques. The proposed model is evaluated on test data to calculate Accuracy, F1 Score, Precision, and Recall, obtaining scores of 91%, 90%, 90%, and 89%, respectively.

摘要

如今,推特、优兔、照片墙和脸书等社交媒体平台是大型数据集的主要来源。由于推特的隐私政策,其数据是最可靠的数据之一。推文已被用于情感分析以及识别数据集中有意义的信息。我们的研究通过分析关于远程学习的阿拉伯语推文,聚焦于沙特阿拉伯的远程学习领域。这项工作提出了一个在远程学习领域使用推特数据集来分析人们反馈的模型。所提出的模型基于Apache Spark产品来管理大型数据集。该模型使用推特应用程序编程接口获取推文作为原始数据。这些推文存储在Apache Spark服务器中。一种基于正则表达式的预处理技术从数据集中删除了转发、链接、主题标签、英语单词和数字、用户名以及表情符号。之后,在预处理后的数据上训练了一个基于逻辑回归的模型。这个来自机器学习领域的逻辑回归模型被用于预测推文中的情感。最后,构建了一个用于对阿拉伯语推文进行情感分析的Flask应用程序。与各种应用技术相比,所提出的模型给出了更好的结果。在测试数据上对所提出的模型进行评估,以计算准确率、F1分数、精确率和召回率,分别获得了91%、90%、90%和89%的分数。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/187f/9454973/2a78e3f5b009/peerj-cs-08-1047-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/187f/9454973/2ce480c70eab/peerj-cs-08-1047-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/187f/9454973/06fe3f74a506/peerj-cs-08-1047-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/187f/9454973/973a136887a1/peerj-cs-08-1047-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/187f/9454973/96efca8be51b/peerj-cs-08-1047-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/187f/9454973/9fadb048e6c8/peerj-cs-08-1047-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/187f/9454973/2a78e3f5b009/peerj-cs-08-1047-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/187f/9454973/2ce480c70eab/peerj-cs-08-1047-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/187f/9454973/06fe3f74a506/peerj-cs-08-1047-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/187f/9454973/973a136887a1/peerj-cs-08-1047-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/187f/9454973/96efca8be51b/peerj-cs-08-1047-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/187f/9454973/9fadb048e6c8/peerj-cs-08-1047-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/187f/9454973/2a78e3f5b009/peerj-cs-08-1047-g006.jpg

相似文献

1
A machine learning-based approach for sentiment analysis on distance learning from Arabic Tweets.一种基于机器学习的方法用于对阿拉伯语推文的远程学习进行情感分析。
PeerJ Comput Sci. 2022 Jul 26;8:e1047. doi: 10.7717/peerj-cs.1047. eCollection 2022.
2
Sentiment Analysis of Arabic Tweets Regarding Distance Learning in Saudi Arabia during the COVID-19 Pandemic.新冠疫情期间沙特阿拉伯远程学习的阿拉伯推文的情感分析。
Sensors (Basel). 2021 Aug 11;21(16):5431. doi: 10.3390/s21165431.
3
Quantum computing and machine learning for Arabic language sentiment classification in social media.量子计算和机器学习在社交媒体中对阿拉伯语情感分类的应用。
Sci Rep. 2023 Oct 12;13(1):17305. doi: 10.1038/s41598-023-44113-7.
4
Heterogeneous Ensemble Deep Learning Model for Enhanced Arabic Sentiment Analysis.用于增强阿拉伯语情感分析的异质集成深度学习模型。
Sensors (Basel). 2022 May 12;22(10):3707. doi: 10.3390/s22103707.
5
COVID-19 vaccine sentiment analysis using public opinions on Twitter.利用推特上的公众舆论对新冠疫苗进行情绪分析。
Mater Today Proc. 2022;64:448-451. doi: 10.1016/j.matpr.2022.04.809. Epub 2022 Apr 28.
6
Depression detection for twitter users using sentiment analysis in English and Arabic tweets.使用英语和阿拉伯语推文的情感分析检测 Twitter 用户的抑郁情绪。
Artif Intell Med. 2024 Jan;147:102716. doi: 10.1016/j.artmed.2023.102716. Epub 2023 Nov 19.
7
Deep learning based sentiment analysis of public perception of working from home through tweets.基于深度学习的通过推文对公众在家工作看法的情感分析。
J Intell Inf Syst. 2023;60(1):255-274. doi: 10.1007/s10844-022-00736-2. Epub 2022 Aug 24.
8
MonkeyPox2022Tweets: A Large-Scale Twitter Dataset on the 2022 Monkeypox Outbreak, Findings from Analysis of Tweets, and Open Research Questions.猴痘2022年推文:关于2022年猴痘疫情的大规模推特数据集、推文分析结果及开放性研究问题
Infect Dis Rep. 2022 Nov 14;14(6):855-883. doi: 10.3390/idr14060087.
9
Identifying Potential Lyme Disease Cases Using Self-Reported Worldwide Tweets: Deep Learning Modeling Approach Enhanced With Sentimental Words Through Emojis.利用自我报告的全球推文识别潜在莱姆病病例:通过表情符号增强带有情感词汇的深度学习模型。
J Med Internet Res. 2023 Oct 16;25:e47014. doi: 10.2196/47014.
10
Using deep learning to analyze the psychological effects of COVID-19.利用深度学习分析新型冠状病毒肺炎的心理影响。
Front Psychol. 2023 Aug 14;14:962854. doi: 10.3389/fpsyg.2023.962854. eCollection 2023.

引用本文的文献

1
DistilRoBiLSTMFuse: an efficient hybrid deep learning approach for sentiment analysis.蒸馏循环神经网络长短期记忆网络融合模型:一种用于情感分析的高效混合深度学习方法。
PeerJ Comput Sci. 2024 Sep 26;10:e2349. doi: 10.7717/peerj-cs.2349. eCollection 2024.

本文引用的文献

1
Sentiment Analysis of Arabic Tweets Regarding Distance Learning in Saudi Arabia during the COVID-19 Pandemic.新冠疫情期间沙特阿拉伯远程学习的阿拉伯推文的情感分析。
Sensors (Basel). 2021 Aug 11;21(16):5431. doi: 10.3390/s21165431.
2
A Sentiment Analysis Approach to Predict an Individual's Awareness of the Precautionary Procedures to Prevent COVID-19 Outbreaks in Saudi Arabia.一种用于预测个体对沙特阿拉伯预防 COVID-19 爆发的预防措施意识的情感分析方法。
Int J Environ Res Public Health. 2020 Dec 30;18(1):218. doi: 10.3390/ijerph18010218.
3
The sudden transition to synchronized online learning during the COVID-19 pandemic in Saudi Arabia: a qualitative study exploring medical students' perspectives.
沙特阿拉伯在 COVID-19 大流行期间突然转向同步在线学习:一项探索医学生观点的定性研究。
BMC Med Educ. 2020 Aug 28;20(1):285. doi: 10.1186/s12909-020-02208-z.
4
The future of social media in marketing.社交媒体在营销领域的未来。
J Acad Mark Sci. 2020;48(1):79-95. doi: 10.1007/s11747-019-00695-1. Epub 2019 Oct 12.
5
Distance learning in the era of COVID-19.COVID-19 时代的远程学习。
Arch Dermatol Res. 2021 Jul;313(5):389-390. doi: 10.1007/s00403-020-02088-9. Epub 2020 May 8.