通过在与烟草和电子烟相关的推文上微调大型语言模型来增强公共卫生领域的情感和意图分析。

Enhancing sentiment and intent analysis in public health via fine-tuned Large Language Models on tobacco and e-cigarette-related tweets.

作者信息

Elmitwalli Sherif, Mehegan John, Gallagher Allen, Alebshehy Raouf

机构信息

Tobacco Control Research Group, Department for Health, University of Bath, Bath, United Kingdom.

出版信息

Front Big Data. 2024 Nov 28;7:1501154. doi: 10.3389/fdata.2024.1501154. eCollection 2024.

DOI:10.3389/fdata.2024.1501154

PMID:39669437

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11634848/

Abstract

BACKGROUND

Accurate sentiment analysis and intent categorization of tobacco and e-cigarette-related social media content are critical for public health research, yet they necessitate specialized natural language processing approaches.

OBJECTIVE

To compare pre-trained and fine-tuned Flan-T5 models for intent classification and sentiment analysis of tobacco and e-cigarette tweets, demonstrating the effectiveness of pre-training a lightweight large language model for domain specific tasks.

METHODS

Three Flan-T5 classification models were developed: (1) tobacco intent, (2) e-cigarette intent, and (3) sentiment analysis. Domain-specific datasets with tobacco and e-cigarette tweets were created using GPT-4 and validated by tobacco control specialists using a rigorous evaluation process. A standardized rubric and consensus mechanism involving domain specialists ensured high-quality datasets. The Flan-T5 Large Language Models were fine-tuned using Low-Rank Adaptation and evaluated against pre-trained baselines on the datasets using accuracy performance metrics. To further assess model generalizability and robustness, the fine-tuned models were evaluated on real-world tweets collected around the COP9 event.

RESULTS

In every task, fine-tuned models performed much better than pre-trained models. Compared to the pre-trained model's accuracy of 0.33, the fine-tuned model achieved an overall accuracy of 0.91 for tobacco intent classification. The fine-tuned model achieved an accuracy of 0.93 for e-cigarette intent, which is higher than the accuracy of 0.36 for the pre-trained model. The fine-tuned model significantly outperformed the pre-trained model's accuracy of 0.65 in sentiment analysis, achieving an accuracy of 0.94 for sentiments.

CONCLUSION

The effectiveness of lightweight Flan-T5 models in analyzing tweets associated with tobacco and e-cigarette is significantly improved by domain-specific fine-tuning, providing highly accurate instruments for tracking public conversation on tobacco and e-cigarette. The involvement of domain specialists in dataset validation ensured that the generated content accurately represented real-world discussions, thereby enhancing the quality and reliability of the results. Research on tobacco control and the formulation of public policy could be informed by these findings.

摘要

背景

对烟草和电子烟相关社交媒体内容进行准确的情感分析和意图分类对公共卫生研究至关重要，但这需要专门的自然语言处理方法。

目的

比较预训练和微调的Flan-T5模型在烟草和电子烟推文的意图分类和情感分析方面的表现，证明为特定领域任务预训练轻量级大语言模型的有效性。

方法

开发了三个Flan-T5分类模型：（1）烟草意图，（2）电子烟意图，以及（3）情感分析。使用GPT-4创建了包含烟草和电子烟推文的特定领域数据集，并由烟草控制专家通过严格的评估过程进行验证。涉及领域专家的标准化评分标准和共识机制确保了高质量的数据集。使用低秩适应对Flan-T5大语言模型进行微调，并使用准确率性能指标在数据集上与预训练基线进行评估。为了进一步评估模型的通用性和稳健性，在COP9活动期间收集的真实世界推文中对微调模型进行评估。

结果

在每项任务中，微调模型的表现都比预训练模型好得多。与预训练模型0.33的准确率相比，微调模型在烟草意图分类方面的总体准确率达到了0.91。微调模型在电子烟意图方面的准确率为0.93，高于预训练模型0.36的准确率。在情感分析中，微调模型的表现明显优于预训练模型0.65的准确率，情感准确率达到了0.94。