使用机器学习分类器对推特上的金融推特帖子进行情感分析。

Sentiment analysis of financial Twitter posts on Twitter with the machine learning classifiers.

作者信息

Cam Handan, Cam Alper Veli, Demirel Ugur, Ahmed Sana

机构信息

Department of Management Information Systems, Faculty of Economic and Administrative Science, Gumushane University, 29000, Gumushane, Turkey.

Department of Health Care Management, Faculty of Health Sciences, Gumushane University, 29000, Gumushane, Turkey.

出版信息

Heliyon. 2023 Dec 17;10(1):e23784. doi: 10.1016/j.heliyon.2023.e23784. eCollection 2024 Jan 15.

DOI:10.1016/j.heliyon.2023.e23784

PMID:38205287

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10776998/

Abstract

This paper presents a sentiment analysis combining the lexicon-based and machine learning (ML)-based approaches in Turkish to investigate the public mood for the prediction of stock market behavior in BIST30, Borsa Istanbul. Our main motivation behind this study is to apply sentiment analysis to financial-related tweets in Turkish. We import 17189 tweets posted as "#Borsaistanbul, #Bist, #Bist30, #Bist100″ on Twitter between November 7, 2022, and November 15, 2022, via a MAXQDA 2020, a qualitative data analysis program. For the lexicon-based side, we use a multilingual sentiment offered by the Orange program to label the polarities of the 17189 samples as positive, negative, and neutral labels. Neutral labels are discarded for the machine learning experiments. For the machine learning side, we select 9076 data as positive and negative to implement the classification problem with six different supervised machine learning classifiers conducted in Python 3.6 with the sklearn library. In experiments, 80 % of the selected data is used for the training phase and the rest is used for the testing and validation phase. Results of the experiments show that the Support Vector Machine and Multilayer Perceptron classifier perform better than other classifiers with 0.89 and 0.88 accuracy and AUC values of 0.8729 and 0.8647 respectively. Other classifiers obtain approximately a 78,5 % accuracy rate. It is possible to increase sentiment analysis accuracy with parameter optimization on a larger, cleaner, and more balanced dataset by changing the pre-processing steps. This work can be expanded in the future to develop better sentiment analysis using deep learning approaches.

摘要

本文提出了一种结合基于词典和基于机器学习（ML）的方法的情感分析，用于土耳其语，以研究公众情绪，从而预测伊斯坦布尔证券交易所BIST30的股票市场行为。我们进行这项研究的主要动机是将情感分析应用于土耳其语的金融相关推文。我们通过定性数据分析程序MAXQDA 2020，导入了2022年11月7日至2022年11月15日期间在Twitter上以“#Borsaistanbul、#Bist、#Bist30、#Bist100”发布的17189条推文。对于基于词典的方面，我们使用Orange程序提供的多语言情感分析来将17189个样本的极性标记为积极、消极和中性标签。在机器学习实验中，中性标签被舍弃。对于机器学习方面，我们选择9076条数据作为积极和消极数据，使用sklearn库在Python 3.6中进行六个不同的监督机器学习分类器来实现分类问题。在实验中，80%的选定数据用于训练阶段，其余数据用于测试和验证阶段。实验结果表明，支持向量机和多层感知器分类器的表现优于其他分类器，准确率分别为0.89和0.88，AUC值分别为0.8729和0.8647。其他分类器的准确率约为78.5%。通过改变预处理步骤，在更大、更干净、更平衡的数据集上进行参数优化，可以提高情感分析的准确性。这项工作未来可以通过使用深度学习方法进行扩展，以开发更好的情感分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e998/10776998/c14b465dad5f/gr1.jpg

相似文献

Sentiment analysis of financial Twitter posts on Twitter with the machine learning classifiers.

Heliyon. 2023 Dec 17;10(1):e23784. doi: 10.1016/j.heliyon.2023.e23784. eCollection 2024 Jan 15.

Using twitter to examine smoking behavior and perceptions of emerging tobacco products.

J Med Internet Res. 2013 Aug 29;15(8):e174. doi: 10.2196/jmir.2534.

"When 'Bad' is 'Good'": Identifying Personal Communication and Sentiment in Drug-Related Tweets.

JMIR Public Health Surveill. 2016 Oct 24;2(2):e162. doi: 10.2196/publichealth.6327.

Sentimental Analysis of Twitter Users from Turkish Content with Natural Language Processing.

Comput Intell Neurosci. 2022 Apr 13;2022:2455160. doi: 10.1155/2022/2455160. eCollection 2022.

Machine Learning Classifiers for Twitter Surveillance of Vaping: Comparative Machine Learning Study.

J Med Internet Res. 2020 Aug 12;22(8):e17478. doi: 10.2196/17478.

Identifying Key Topics Bearing Negative Sentiment on Twitter: Insights Concerning the 2015-2016 Zika Epidemic.

JMIR Public Health Surveill. 2019 Jun 4;5(2):e11036. doi: 10.2196/11036.

Deep learning in finance assessing twitter sentiment impact and prediction on stocks.

PeerJ Comput Sci. 2024 May 10;10:e2018. doi: 10.7717/peerj-cs.2018. eCollection 2024.

Classification of Twitter Vaping Discourse Using BERTweet: Comparative Deep Learning Study.

JMIR Med Inform. 2022 Jul 21;10(7):e33678. doi: 10.2196/33678.

Assessing Electronic Cigarette-Related Tweets for Sentiment and Content Using Supervised Machine Learning.

J Med Internet Res. 2015 Aug 25;17(8):e208. doi: 10.2196/jmir.4392.

A performance comparison of supervised machine learning models for Covid-19 tweets sentiment analysis.

PLoS One. 2021 Feb 25;16(2):e0245909. doi: 10.1371/journal.pone.0245909. eCollection 2021.

引用本文的文献

Sentiment analysis with echo state network and augmented water cycle algorithm.

Sci Rep. 2025 Aug 27;15(1):31556. doi: 10.1038/s41598-025-17457-5.

How to go green? Exploring public attention and sentiment towards waste sorting behaviors on Weibo platform: A study based on text co-occurrence networks and deep learning.

Heliyon. 2024 Sep 27;10(19):e38510. doi: 10.1016/j.heliyon.2024.e38510. eCollection 2024 Oct 15.

本文引用的文献

Advantageous comparison: using Twitter responses to understand similarities between cybercriminals ("Yahoo Boys") and politicians ("Yahoo men").

Heliyon. 2022 Oct 18;8(11):e11142. doi: 10.1016/j.heliyon.2022.e11142. eCollection 2022 Nov.

Modeling monthly reference evapotranspiration process in Turkey: application of machine learning methods.

Environ Monit Assess. 2022 Nov 3;195(1):67. doi: 10.1007/s10661-022-10662-z.

Covid-19 vaccine hesitancy: Text mining, sentiment analysis and machine learning on COVID-19 vaccination Twitter dataset.

Expert Syst Appl. 2023 Feb;212:118715. doi: 10.1016/j.eswa.2022.118715. Epub 2022 Sep 5.

Comparison of hybrid machine learning methods for the prediction of short-term meteorological droughts of Sakarya Meteorological Station in Turkey.

Environ Sci Pollut Res Int. 2022 Oct;29(50):75487-75511. doi: 10.1007/s11356-022-21083-3. Epub 2022 Jun 3.

Sentimental Analysis of COVID-19 Tweets Using Deep Learning Models.

Infect Dis Rep. 2021 Apr 1;13(2):329-339. doi: 10.3390/idr13020032.

A Proposed Sentiment Analysis Deep Learning Algorithm for Analyzing COVID-19 Tweets.

Inf Syst Front. 2021;23(6):1417-1429. doi: 10.1007/s10796-021-10135-7. Epub 2021 Apr 20.

A performance comparison of supervised machine learning models for Covid-19 tweets sentiment analysis.

PLoS One. 2021 Feb 25;16(2):e0245909. doi: 10.1371/journal.pone.0245909. eCollection 2021.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用机器学习分类器对推特上的金融推特帖子进行情感分析。

Sentiment analysis of financial Twitter posts on Twitter with the machine learning classifiers.

作者信息

Cam Handan, Cam Alper Veli, Demirel Ugur, Ahmed Sana

机构信息

Department of Management Information Systems, Faculty of Economic and Administrative Science, Gumushane University, 29000, Gumushane, Turkey.

Department of Health Care Management, Faculty of Health Sciences, Gumushane University, 29000, Gumushane, Turkey.

出版信息

Heliyon. 2023 Dec 17;10(1):e23784. doi: 10.1016/j.heliyon.2023.e23784. eCollection 2024 Jan 15.

DOI:10.1016/j.heliyon.2023.e23784

PMID:38205287

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10776998/

Abstract

摘要

使用机器学习分类器对推特上的金融推特帖子进行情感分析。

Sentiment analysis of financial Twitter posts on Twitter with the machine learning classifiers.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

使用机器学习分类器对推特上的金融推特帖子进行情感分析。

Sentiment analysis of financial Twitter posts on Twitter with the machine learning classifiers.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献