与COP9相关推文的情感分析：预训练模型与传统技术的比较研究

Sentiment analysis of COP9-related tweets: a comparative study of pre-trained models and traditional techniques.

作者信息

Elmitwalli Sherif, Mehegan John

机构信息

Tobacco Control Research Group, Department for Health, University of Bath, Bath, United Kingdom.

出版信息

Front Big Data. 2024 Mar 20;7:1357926. doi: 10.3389/fdata.2024.1357926. eCollection 2024.

DOI:10.3389/fdata.2024.1357926

PMID:38572292

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10987730/

Abstract

INTRODUCTION

Sentiment analysis has become a crucial area of research in natural language processing in recent years. The study aims to compare the performance of various sentiment analysis techniques, including lexicon-based, machine learning, Bi-LSTM, BERT, and GPT-3 approaches, using two commonly used datasets, IMDB reviews and Sentiment140. The objective is to identify the best-performing technique for an exemplar dataset, tweets associated with the WHO Framework Convention on Tobacco Control Ninth Conference of the Parties in 2021 (COP9).

METHODS

A two-stage evaluation was conducted. In the first stage, various techniques were compared on standard sentiment analysis datasets using standard evaluation metrics such as accuracy, F1-score, and precision. In the second stage, the best-performing techniques from the first stage were applied to partially annotated COP9 conference-related tweets.

RESULTS

In the first stage, BERT achieved the highest F1-scores (0.9380 for IMDB and 0.8114 for Sentiment 140), followed by GPT-3 (0.9119 and 0.7913) and Bi-LSTM (0.8971 and 0.7778). In the second stage, GPT-3 performed the best for sentiment analysis on partially annotated COP9 conference-related tweets, with an F1-score of 0.8812.

DISCUSSION

The study demonstrates the effectiveness of pre-trained models like BERT and GPT-3 for sentiment analysis tasks, outperforming traditional techniques on standard datasets. Moreover, the better performance of GPT-3 on the partially annotated COP9 tweets highlights its ability to generalize well to domain-specific data with limited annotations. This provides researchers and practitioners with a viable option of using pre-trained models for sentiment analysis in scenarios with limited or no annotated data across different domains.

摘要

引言

近年来，情感分析已成为自然语言处理中一个至关重要的研究领域。本研究旨在使用两个常用数据集（IMDB影评和Sentiment140）比较各种情感分析技术的性能，包括基于词典的、机器学习、双向长短期记忆网络（Bi-LSTM）、BERT和GPT-3方法。目标是为一个示例数据集（与2021年世界卫生组织《烟草控制框架公约》第九届缔约方会议（COP9）相关的推文）确定性能最佳的技术。

方法

进行了两阶段评估。在第一阶段，使用诸如准确率、F1分数和精确率等标准评估指标，在标准情感分析数据集上比较各种技术。在第二阶段，将第一阶段中性能最佳的技术应用于部分标注的与COP9会议相关的推文。

结果

在第一阶段，BERT获得了最高的F1分数（IMDB数据集为0.9380，Sentiment140数据集为0.8114），其次是GPT-3（分别为0.9119和0.7913）和Bi-LSTM（分别为0.8971和0.7778）。在第二阶段，GPT-3在部分标注的与COP9会议相关的推文的情感分析中表现最佳，F1分数为0.8812。

讨论

该研究证明了像BERT和GPT-3这样的预训练模型在情感分析任务中的有效性，在标准数据集上优于传统技术。此外，GPT-3在部分标注的COP9推文中的更好性能突出了其对有限标注的特定领域数据的良好泛化能力。这为研究人员和从业者提供了一个可行的选择，即在不同领域中有限或无标注数据的情况下，使用预训练模型进行情感分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b198/10987730/1f8d8712d262/fdata-07-1357926-g0001.jpg

相似文献

Sentiment analysis of COP9-related tweets: a comparative study of pre-trained models and traditional techniques.

Front Big Data. 2024 Mar 20;7:1357926. doi: 10.3389/fdata.2024.1357926. eCollection 2024.

Multi-class sentiment analysis of urdu text using multilingual BERT.

Sci Rep. 2022 Mar 31;12(1):5436. doi: 10.1038/s41598-022-09381-9.

Efficacy of ChatGPT in Cantonese Sentiment Analysis: Comparative Study.

J Med Internet Res. 2024 Jan 30;26:e51069. doi: 10.2196/51069.

Quantum computing and machine learning for Arabic language sentiment classification in social media.

Sci Rep. 2023 Oct 12;13(1):17305. doi: 10.1038/s41598-023-44113-7.

Fuzzy ensemble of fined tuned BERT models for domain-specific sentiment analysis of software engineering dataset.

PLoS One. 2024 May 28;19(5):e0300279. doi: 10.1371/journal.pone.0300279. eCollection 2024.

Vaccine sentiment analysis using BERT + NBSVM and geo-spatial approaches.

J Supercomput. 2023 May 7:1-31. doi: 10.1007/s11227-023-05319-8.

A BERT Framework to Sentiment Analysis of Tweets.

Sensors (Basel). 2023 Jan 2;23(1):506. doi: 10.3390/s23010506.

A comparative study of large language model-based zero-shot inference and task-specific supervised classification of breast cancer pathology reports.

J Am Med Inform Assoc. 2024 Oct 1;31(10):2315-2327. doi: 10.1093/jamia/ocae146.

Sentiment Analysis Methods for HPV VaccinesRelated Tweets Based on Transfer Learning.

Healthcare (Basel). 2020 Aug 28;8(3):307. doi: 10.3390/healthcare8030307.

Topic prediction for tobacco control based on COP9 tweets using machine learning techniques.

PLoS One. 2024 Feb 15;19(2):e0298298. doi: 10.1371/journal.pone.0298298. eCollection 2024.

引用本文的文献

Scalable evaluation framework for retrieval augmented generation in tobacco research using large Language models.

Sci Rep. 2025 Jul 2;15(1):22760. doi: 10.1038/s41598-025-05726-2.

Enhancing sentiment and intent analysis in public health via fine-tuned Large Language Models on tobacco and e-cigarette-related tweets.

Front Big Data. 2024 Nov 28;7:1501154. doi: 10.3389/fdata.2024.1501154. eCollection 2024.

Analyzing digital societal interactions and sentiment classification in Twitter (X) during critical events in Chile.

Heliyon. 2024 Jun 11;10(12):e32572. doi: 10.1016/j.heliyon.2024.e32572. eCollection 2024 Jun 30.

本文引用的文献

Topic prediction for tobacco control based on COP9 tweets using machine learning techniques.

PLoS One. 2024 Feb 15;19(2):e0298298. doi: 10.1371/journal.pone.0298298. eCollection 2024.

A systematic review of social network sentiment analysis with comparative study of ensemble-based techniques.

Artif Intell Rev. 2023 Apr 12:1-55. doi: 10.1007/s10462-023-10472-w.

Improving the Polarity of Text through word2vec Embedding for Primary Classical Arabic Sentiment Analysis.

Neural Process Lett. 2023 Jan 23:1-16. doi: 10.1007/s11063-022-11111-1.

A BERT Framework to Sentiment Analysis of Tweets.

Sensors (Basel). 2023 Jan 2;23(1):506. doi: 10.3390/s23010506.

Character gated recurrent neural networks for Arabic sentiment analysis.

Sci Rep. 2022 Jun 13;12(1):9779. doi: 10.1038/s41598-022-13153-w.

New meaning for NLP: the trials and tribulations of natural language processing with GPT-3 in ophthalmology.

Br J Ophthalmol. 2022 Jul;106(7):889-892. doi: 10.1136/bjophthalmol-2022-321141. Epub 2022 May 6.

Multi-class sentiment analysis of urdu text using multilingual BERT.

Sci Rep. 2022 Mar 31;12(1):5436. doi: 10.1038/s41598-022-09381-9.

Where next for the WHO Framework Convention on Tobacco Control?

Tob Control. 2022 Mar;31(2):183-186. doi: 10.1136/tobaccocontrol-2021-056545.

Comparing machine learning algorithms for predicting COVID-19 mortality.

BMC Med Inform Decis Mak. 2022 Jan 4;22(1):2. doi: 10.1186/s12911-021-01742-0.

COVID-19 Sensing: Negative Sentiment Analysis on Social Media in China via BERT Model.

IEEE Access. 2020 Jul 28;8:138162-138169. doi: 10.1109/ACCESS.2020.3012595. eCollection 2020.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

与COP9相关推文的情感分析：预训练模型与传统技术的比较研究

Sentiment analysis of COP9-related tweets: a comparative study of pre-trained models and traditional techniques.

作者信息

Elmitwalli Sherif, Mehegan John

机构信息

Tobacco Control Research Group, Department for Health, University of Bath, Bath, United Kingdom.