使用大型语言模型支持内容分析：以 ChatGPT 检测不良事件为例。

Using Large Language Models to Support Content Analysis: A Case Study of ChatGPT for Adverse Event Detection.

机构信息

Herbert Wertheim School of Public Health and Human Longevity Science, University of California San Diego, La Jolla, CA, United States.

Qualcomm Institute, University of California San Diego, La Jolla, CA, United States.

出版信息

J Med Internet Res. 2024 May 2;26:e52499. doi: 10.2196/52499.

DOI:10.2196/52499

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11099800/

Abstract

This study explores the potential of using large language models to assist content analysis by conducting a case study to identify adverse events (AEs) in social media posts. The case study compares ChatGPT's performance with human annotators' in detecting AEs associated with delta-8-tetrahydrocannabinol, a cannabis-derived product. Using the identical instructions given to human annotators, ChatGPT closely approximated human results, with a high degree of agreement noted: 94.4% (9436/10,000) for any AE detection (Fleiss κ=0.95) and 99.3% (9931/10,000) for serious AEs (κ=0.96). These findings suggest that ChatGPT has the potential to replicate human annotation accurately and efficiently. The study recognizes possible limitations, including concerns about the generalizability due to ChatGPT's training data, and prompts further research with different models, data sources, and content analysis tasks. The study highlights the promise of large language models for enhancing the efficiency of biomedical research.

摘要

本研究通过案例研究探索了使用大型语言模型辅助内容分析的潜力，以识别社交媒体帖子中的不良事件 (AE)。该案例研究比较了 ChatGPT 在检测与大麻衍生产品 delta-8-四氢大麻酚相关的 AE 方面的表现与人类注释者的表现。使用给予人类注释者的相同说明，ChatGPT 非常接近人类的结果，一致性程度很高：任何 AE 检测的 94.4%（9436/10000）（Fleiss κ=0.95）和严重 AE 的 99.3%（9931/10000）（κ=0.96）。这些发现表明 ChatGPT 具有准确高效复制人类注释的潜力。该研究认识到可能存在的局限性，包括由于 ChatGPT 的训练数据而导致的普遍性问题，并提出了使用不同模型、数据源和内容分析任务进行进一步研究的建议。该研究强调了大型语言模型在提高生物医学研究效率方面的潜力。

相似文献

1

Using Large Language Models to Support Content Analysis: A Case Study of ChatGPT for Adverse Event Detection.使用大型语言模型支持内容分析：以 ChatGPT 检测不良事件为例。

J Med Internet Res. 2024 May 2;26:e52499. doi: 10.2196/52499.

2

Generative artificial intelligence and machine learning methods to screen social media content.用于筛选社交媒体内容的生成式人工智能和机器学习方法。

PeerJ Comput Sci. 2025 Mar 14;11:e2710. doi: 10.7717/peerj-cs.2710. eCollection 2025.

3

What's in a Name? Experimental Evidence of Gender Bias in Recommendation Letters Generated by ChatGPT.名字里的乾坤：ChatGPT 生成的推荐信中的性别偏见的实验证据。

J Med Internet Res. 2024 Mar 5;26:e51837. doi: 10.2196/51837.

4

Large Language Models and Empathy: Systematic Review.大语言模型与同理心：系统综述

J Med Internet Res. 2024 Dec 11;26:e52597. doi: 10.2196/52597.

5

Assessing ChatGPT's Educational Potential in Lung Cancer Radiotherapy From Clinician and Patient Perspectives: Content Quality and Readability Analysis.从临床医生和患者角度评估ChatGPT在肺癌放疗中的教育潜力：内容质量与可读性分析

JMIR Cancer. 2025 Aug 13;11:e69783. doi: 10.2196/69783.

6

Evaluation of ChatGPT-4 as an Online Outpatient Assistant in Puerperal Mastitis Management: Content Analysis of an Observational Study.评估ChatGPT-4作为产褥期乳腺炎管理在线门诊助手的效果：一项观察性研究的内容分析

JMIR Med Inform. 2025 Jul 24;13:e68980. doi: 10.2196/68980.

7

Use of Large Language Models to Classify Epidemiological Characteristics in Synthetic and Real-World Social Media Posts About Conjunctivitis Outbreaks: Infodemiology Study.利用大语言模型对合成及真实世界社交媒体上有关结膜炎爆发的帖子中的流行病学特征进行分类：信息流行病学研究

J Med Internet Res. 2025 Jul 2;27:e65226. doi: 10.2196/65226.

8

Cannabis-based medicines for chronic neuropathic pain in adults.用于成人慢性神经性疼痛的大麻类药物。

Cochrane Database Syst Rev. 2018 Mar 7;3(3):CD012182. doi: 10.1002/14651858.CD012182.pub2.

9

Using Artificial Intelligence ChatGPT to Access Medical Information About Chemical Eye Injuries: Comparative Study.使用人工智能ChatGPT获取有关化学性眼外伤的医学信息：比较研究

JMIR Form Res. 2025 Aug 13;9:e73642. doi: 10.2196/73642.

10

Uncovering Language Disparity of ChatGPT on Retinal Vascular Disease Classification: Cross-Sectional Study.揭示 ChatGPT 在视网膜血管疾病分类上的语言差异：一项横断面研究。

J Med Internet Res. 2024 Jan 22;26:e51926. doi: 10.2196/51926.

引用本文的文献

1

Pharmacovigilance in the Age of Legalized Cannabis: Using Social Media to Monitor Drug-Drug Interactions Between Immunosuppressants and Cannabis-Derived Products.合法大麻时代的药物警戒：利用社交媒体监测免疫抑制剂与大麻衍生产品之间的药物相互作用

Drug Saf. 2025 Jan;48(1):99-105. doi: 10.1007/s40264-024-01481-x. Epub 2024 Sep 18.

本文引用的文献

1

Self-reported adverse events associated with ∆-Tetrahydrocannabinol (Delta-8-THC) Use.与使用∆-四氢大麻酚（Delta-8-THC）相关的自我报告不良事件。

J Cannabis Res. 2023 May 23;5(1):15. doi: 10.1186/s42238-023-00191-y.

2

Evaluation of Facebook and Twitter Monitoring to Detect Safety Signals for Medical Products: An Analysis of Recent FDA Safety Alerts.评估脸书和推特监测以检测医疗产品安全信号：对美国食品药品监督管理局近期安全警报的分析

Drug Saf. 2017 Apr;40(4):317-331. doi: 10.1007/s40264-016-0491-0.

3

Utilizing social media data for pharmacovigilance: A review.利用社交媒体数据进行药物警戒：综述

J Biomed Inform. 2015 Apr;54:202-12. doi: 10.1016/j.jbi.2015.02.004. Epub 2015 Feb 23.

4

Automated detection of adverse events using natural language processing of discharge summaries.利用出院小结的自然语言处理自动检测不良事件。

J Am Med Inform Assoc. 2005 Jul-Aug;12(4):448-57. doi: 10.1197/jamia.M1794. Epub 2005 Mar 31.

5

Bias, prevalence and kappa.偏倚、患病率及kappa值

J Clin Epidemiol. 1993 May;46(5):423-9. doi: 10.1016/0895-4356(93)90018-v.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验