从在线帖子中挖掘个体内部状态的短文本的文本挖掘方法的系统评价

A systematic evaluation of text mining methods for short texts: Mapping individuals' internal states from online posts.

机构信息

Department of Sociology/ICS, Utrecht University, Utrecht, The Netherlands.

出版信息

Behav Res Methods. 2024 Apr;56(4):2782-2803. doi: 10.3758/s13428-024-02381-9. Epub 2024 Apr 4.

DOI:10.3758/s13428-024-02381-9

PMID:38575776

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11133038/

Abstract

Short texts generated by individuals in online environments can provide social and behavioral scientists with rich insights into these individuals' internal states. Trained manual coders can reliably interpret expressions of such internal states in text. However, manual coding imposes restrictions on the number of texts that can be analyzed, limiting our ability to extract insights from large-scale textual data. We evaluate the performance of several automatic text analysis methods in approximating trained human coders' evaluations across four coding tasks encompassing expressions of motives, norms, emotions, and stances. Our findings suggest that commonly used dictionaries, although performing well in identifying infrequent categories, generate false positives too frequently compared to other methods. We show that large language models trained on manually coded data yield the highest performance across all case studies. However, there are also instances where simpler methods show almost equal performance. Additionally, we evaluate the effectiveness of cutting-edge generative language models like GPT-4 in coding texts for internal states with the help of short instructions (so-called zero-shot classification). While promising, these models fall short of the performance of models trained on manually analyzed data. We discuss the strengths and weaknesses of various models and explore the trade-offs between model complexity and performance in different applications. Our work informs social and behavioral scientists of the challenges associated with text mining of large textual datasets, while providing best-practice recommendations.

摘要

个体在在线环境中生成的短文可以为社会和行为科学家提供丰富的内部状态信息。经过训练的人工编码员可以可靠地解释文本中此类内部状态的表达。然而，人工编码对可以分析的文本数量施加了限制，限制了我们从大规模文本数据中提取见解的能力。我们评估了几种自动文本分析方法在四个编码任务中接近经过训练的人类编码员评估的性能，这些任务涵盖动机、规范、情感和立场的表达。我们的研究结果表明，常用词典虽然在识别不常见的类别方面表现良好，但与其他方法相比，产生假阳性的频率过高。我们表明，在所有案例研究中，基于手动编码数据训练的大型语言模型表现最佳。然而，也有一些简单方法表现出几乎相同的性能的情况。此外，我们评估了 GPT-4 等前沿生成语言模型在接受简短指令（所谓的零样本分类）的情况下对内部状态编码文本的有效性。虽然很有希望，但这些模型的性能不及基于手动分析数据训练的模型。我们讨论了各种模型的优缺点，并探讨了不同应用中模型复杂性和性能之间的权衡。我们的工作使社会和行为科学家了解与大型文本数据集的文本挖掘相关的挑战，同时提供最佳实践建议。

相似文献

A systematic evaluation of text mining methods for short texts: Mapping individuals' internal states from online posts.从在线帖子中挖掘个体内部状态的短文本的文本挖掘方法的系统评价

Behav Res Methods. 2024 Apr;56(4):2782-2803. doi: 10.3758/s13428-024-02381-9. Epub 2024 Apr 4.

Improving entity recognition using ensembles of deep learning and fine-tuned large language models: A case study on adverse event extraction from VAERS and social media.使用深度学习集成和微调大语言模型改进实体识别：以从VAERS和社交媒体中提取不良事件为例

J Biomed Inform. 2025 Mar;163:104789. doi: 10.1016/j.jbi.2025.104789. Epub 2025 Feb 7.

Summarizing Online Patient Conversations Using Generative Language Models: Experimental and Comparative Study.使用生成式语言模型总结在线患者对话：实验与比较研究

JMIR Med Inform. 2025 Apr 14;13:e62909. doi: 10.2196/62909.

Public Health Discussions on Social Media: Evaluating Automated Sentiment Analysis Methods.社交媒体上的公共卫生讨论：评估自动情感分析方法

JMIR Form Res. 2025 Jan 8;9:e57395. doi: 10.2196/57395.

Social media mining for birth defects research: A rule-based, bootstrapping approach to collecting data for rare health-related events on Twitter.社交媒体挖掘在出生缺陷研究中的应用：一种基于规则和自举的方法，用于在 Twitter 上收集罕见健康相关事件的数据。

J Biomed Inform. 2018 Nov;87:68-78. doi: 10.1016/j.jbi.2018.10.001. Epub 2018 Oct 4.

Data and systems for medication-related text classification and concept normalization from Twitter: insights from the Social Media Mining for Health (SMM4H)-2017 shared task.从 Twitter 上获取药物相关文本分类和概念规范化的数据和系统：来自社交媒体挖掘健康（SMM4H）-2017 共享任务的见解。

J Am Med Inform Assoc. 2018 Oct 1;25(10):1274-1283. doi: 10.1093/jamia/ocy114.

Text mining for social science - The state and the future of computational text analysis in sociology.文本挖掘在社会科学中的应用——社会学中计算文本分析的现状与未来。

Soc Sci Res. 2022 Nov;108:102784. doi: 10.1016/j.ssresearch.2022.102784. Epub 2022 Sep 2.

GPT is an effective tool for multilingual psychological text analysis.GPT 是一种用于多语言心理文本分析的有效工具。

Proc Natl Acad Sci U S A. 2024 Aug 20;121(34):e2308950121. doi: 10.1073/pnas.2308950121. Epub 2024 Aug 12.

Information Extraction from Clinical Texts with Generative Pre-trained Transformer Models.利用生成式预训练Transformer模型从临床文本中提取信息。

Int J Med Sci. 2025 Feb 3;22(5):1015-1028. doi: 10.7150/ijms.103332. eCollection 2025.

Guide for the application of the data augmentation approach on sets of texts in Spanish for sentiment and emotion analysis.西班牙语情感分析中数据集的扩充方法应用指南。

PLoS One. 2024 Sep 26;19(9):e0310707. doi: 10.1371/journal.pone.0310707. eCollection 2024.

本文引用的文献

GPT is an effective tool for multilingual psychological text analysis.GPT 是一种用于多语言心理文本分析的有效工具。

Proc Natl Acad Sci U S A. 2024 Aug 20;121(34):e2308950121. doi: 10.1073/pnas.2308950121. Epub 2024 Aug 12.

Perils and opportunities in using large language models in psychological research.在心理学研究中使用大语言模型的风险与机遇

PNAS Nexus. 2024 Jul 16;3(7):pgae245. doi: 10.1093/pnasnexus/pgae245. eCollection 2024 Jul.

Using proprietary language models in academic research requires explicit justification.在学术研究中使用专有语言模型需要明确的理由。

Nat Comput Sci. 2024 Jan;4(1):2-3. doi: 10.1038/s43588-023-00585-1.

ChatGPT outperforms crowd workers for text-annotation tasks.在文本注释任务中，ChatGPT的表现优于众包工作者。

Proc Natl Acad Sci U S A. 2023 Jul 25;120(30):e2305016120. doi: 10.1073/pnas.2305016120. Epub 2023 Jul 18.

Why open-source generative AI models are an ethical way forward for science.为何开源生成式人工智能模型是科学发展的一种道德途径。

Nature. 2023 Apr;616(7957):413. doi: 10.1038/d41586-023-01295-4.

LEXpander: Applying colexification networks to automated lexicon expansion.LEXpander：将共词网络应用于自动化词汇扩展。

Behav Res Methods. 2024 Feb;56(2):952-967. doi: 10.3758/s13428-023-02063-y. Epub 2023 Mar 10.

Text mining for social science - The state and the future of computational text analysis in sociology.文本挖掘在社会科学中的应用——社会学中计算文本分析的现状与未来。

Soc Sci Res. 2022 Nov;108:102784. doi: 10.1016/j.ssresearch.2022.102784. Epub 2022 Sep 2.

Using natural language processing and machine learning to replace human content coders.使用自然语言处理和机器学习来取代人工内容编码员。

Psychol Methods. 2024 Dec;29(6):1148-1163. doi: 10.1037/met0000518. Epub 2022 Aug 25.

Validating daily social media macroscopes of emotions.验证日常社交媒体情绪的宏观情况。

Sci Rep. 2022 Jul 4;12(1):11236. doi: 10.1038/s41598-022-14579-y.

A repeated-measures study on emotional responses after a year in the pandemic.一项关于疫情爆发一年后情绪反应的重复测量研究。

Sci Rep. 2021 Nov 30;11(1):23114. doi: 10.1038/s41598-021-02414-9.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

从在线帖子中挖掘个体内部状态的短文本的文本挖掘方法的系统评价

A systematic evaluation of text mining methods for short texts: Mapping individuals' internal states from online posts.

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献