Elmitwalli Sherif, Mehegan John, Gallagher Allen, Alebshehy Raouf
Tobacco Control Research Group, Department for Health, University of Bath, Bath, United Kingdom.
Front Big Data. 2024 Nov 28;7:1501154. doi: 10.3389/fdata.2024.1501154. eCollection 2024.
Accurate sentiment analysis and intent categorization of tobacco and e-cigarette-related social media content are critical for public health research, yet they necessitate specialized natural language processing approaches.
To compare pre-trained and fine-tuned Flan-T5 models for intent classification and sentiment analysis of tobacco and e-cigarette tweets, demonstrating the effectiveness of pre-training a lightweight large language model for domain specific tasks.
Three Flan-T5 classification models were developed: (1) tobacco intent, (2) e-cigarette intent, and (3) sentiment analysis. Domain-specific datasets with tobacco and e-cigarette tweets were created using GPT-4 and validated by tobacco control specialists using a rigorous evaluation process. A standardized rubric and consensus mechanism involving domain specialists ensured high-quality datasets. The Flan-T5 Large Language Models were fine-tuned using Low-Rank Adaptation and evaluated against pre-trained baselines on the datasets using accuracy performance metrics. To further assess model generalizability and robustness, the fine-tuned models were evaluated on real-world tweets collected around the COP9 event.
In every task, fine-tuned models performed much better than pre-trained models. Compared to the pre-trained model's accuracy of 0.33, the fine-tuned model achieved an overall accuracy of 0.91 for tobacco intent classification. The fine-tuned model achieved an accuracy of 0.93 for e-cigarette intent, which is higher than the accuracy of 0.36 for the pre-trained model. The fine-tuned model significantly outperformed the pre-trained model's accuracy of 0.65 in sentiment analysis, achieving an accuracy of 0.94 for sentiments.
The effectiveness of lightweight Flan-T5 models in analyzing tweets associated with tobacco and e-cigarette is significantly improved by domain-specific fine-tuning, providing highly accurate instruments for tracking public conversation on tobacco and e-cigarette. The involvement of domain specialists in dataset validation ensured that the generated content accurately represented real-world discussions, thereby enhancing the quality and reliability of the results. Research on tobacco control and the formulation of public policy could be informed by these findings.
对烟草和电子烟相关社交媒体内容进行准确的情感分析和意图分类对公共卫生研究至关重要,但这需要专门的自然语言处理方法。
比较预训练和微调的Flan-T5模型在烟草和电子烟推文的意图分类和情感分析方面的表现,证明为特定领域任务预训练轻量级大语言模型的有效性。
开发了三个Flan-T5分类模型:(1)烟草意图,(2)电子烟意图,以及(3)情感分析。使用GPT-4创建了包含烟草和电子烟推文的特定领域数据集,并由烟草控制专家通过严格的评估过程进行验证。涉及领域专家的标准化评分标准和共识机制确保了高质量的数据集。使用低秩适应对Flan-T5大语言模型进行微调,并使用准确率性能指标在数据集上与预训练基线进行评估。为了进一步评估模型的通用性和稳健性,在COP9活动期间收集的真实世界推文中对微调模型进行评估。
在每项任务中,微调模型的表现都比预训练模型好得多。与预训练模型0.33的准确率相比,微调模型在烟草意图分类方面的总体准确率达到了0.91。微调模型在电子烟意图方面的准确率为0.93,高于预训练模型0.36的准确率。在情感分析中,微调模型的表现明显优于预训练模型0.65的准确率,情感准确率达到了0.94。
通过特定领域的微调,轻量级Flan-T5模型在分析与烟草和电子烟相关推文方面的有效性得到了显著提高,为跟踪公众对烟草和电子烟的讨论提供了高度准确的工具。领域专家参与数据集验证确保了生成的内容准确反映现实世界的讨论,从而提高了结果的质量和可靠性。这些发现可为烟草控制研究和公共政策的制定提供参考。