使用自然语言处理的人工智能机器人识别和应对Reddit皮肤科论坛上的健康错误信息：设计与评估研究。

Identifying and Responding to Health Misinformation on Reddit Dermatology Forums With Artificially Intelligent Bots Using Natural Language Processing: Design and Evaluation Study.

作者信息

Sager Monique A, Kashyap Aditya M, Tamminga Mila, Ravoori Sadhana, Callison-Burch Christopher, Lipoff Jules B

机构信息

Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.

Department of Computer Science, University of Pennsylvania, Philadelphia, PA, United States.

出版信息

JMIR Dermatol. 2021 Sep 30;4(2):e20975. doi: 10.2196/20975.

DOI:10.2196/20975

PMID:37632809

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10334965/

Abstract

BACKGROUND

Reddit, the fifth most popular website in the United States, boasts a large and engaged user base on its dermatology forums where users crowdsource free medical opinions. Unfortunately, much of the advice provided is unvalidated and could lead to the provision of inappropriate care. Initial testing has revealed that artificially intelligent bots can detect misinformation regarding tanning and essential oils on Reddit dermatology forums and may be able to produce responses to posts containing misinformation.

OBJECTIVE

To analyze the ability of bots to find and respond to tanning and essential oil-related health misinformation on Reddit's dermatology forums in a controlled test environment.

METHODS

Using natural language processing techniques, we trained bots to target misinformation, using relevant keywords and to post prefabricated responses. By evaluating different model architectures across a held-out test set, we compared performances.

RESULTS

Our models yielded data test accuracies ranging 95%-100%, with a Bidirectional Encoder Representations from Transformers (BERT) fine-tuned model resulting in the highest level of test accuracy. Bots were then able to post corrective prefabricated responses to misinformation in a test environment.

CONCLUSIONS

Using a limited data set, bots accurately detected examples of health misinformation within Reddit dermatology forums. Given that these bots can then post prefabricated responses, this technique may allow for interception of misinformation. Providing correct information does not mean that users will be receptive or find such interventions persuasive. Further studies should investigate this strategy's effectiveness to inform future deployment of bots as a technique in combating health misinformation.

摘要

背景

Reddit是美国第五大最受欢迎的网站，其皮肤科论坛拥有庞大且活跃的用户群体，用户在该论坛上众包免费医疗意见。不幸的是，所提供的许多建议未经证实，可能会导致提供不适当的护理。初步测试表明，人工智能机器人能够在Reddit皮肤科论坛上检测到有关晒黑和精油的错误信息，并可能能够对包含错误信息的帖子做出回应。

目的

在受控测试环境中分析机器人在Reddit皮肤科论坛上查找并回应与晒黑和精油相关的健康错误信息的能力。

方法

我们使用自然语言处理技术，训练机器人使用相关关键词来针对错误信息，并发布预制回复。通过在一个留出的测试集上评估不同的模型架构，我们比较了性能。

结果

我们的模型在数据测试中的准确率在95%-100%之间，其中经过微调的双向编码器表征来自变压器（BERT）模型的测试准确率最高。然后，机器人能够在测试环境中对错误信息发布纠正性预制回复。

结论

使用有限的数据集，机器人能够准确检测Reddit皮肤科论坛内的健康错误信息示例。鉴于这些机器人随后可以发布预制回复，这种技术可能有助于拦截错误信息。提供正确信息并不意味着用户会接受或认为此类干预具有说服力。进一步的研究应调查这种策略的有效性，以便为未来将机器人作为一种对抗健康错误信息的技术进行部署提供参考。

相似文献

Identifying and Responding to Health Misinformation on Reddit Dermatology Forums With Artificially Intelligent Bots Using Natural Language Processing: Design and Evaluation Study.使用自然语言处理的人工智能机器人识别和应对Reddit皮肤科论坛上的健康错误信息：设计与评估研究。

JMIR Dermatol. 2021 Sep 30;4(2):e20975. doi: 10.2196/20975.

Natural language processing of Reddit data to evaluate dermatology patient experiences and therapeutics.利用 Reddit 数据进行自然语言处理以评估皮肤科患者的体验和治疗效果。

J Am Acad Dermatol. 2020 Sep;83(3):803-808. doi: 10.1016/j.jaad.2019.07.014. Epub 2019 Jul 12.

Enabling Early Health Care Intervention by Detecting Depression in Users of Web-Based Forums using Language Models: Longitudinal Analysis and Evaluation.通过语言模型检测基于网络论坛用户的抑郁症实现早期医疗干预：纵向分析与评估

JMIR AI. 2023 Mar 24;2:e41205. doi: 10.2196/41205.

Characterizing the Prevalence of Obesity Misinformation, Factual Content, Stigma, and Positivity on the Social Media Platform Reddit Between 2011 and 2019: Infodemiology Study. characterizing the prevalence of obesity misinformation, factual content, stigma, and positivity on the social media platform reddit between 2011 and 2019: infodemiology study.

J Med Internet Res. 2022 Dec 30;24(12):e36729. doi: 10.2196/36729.

Social Media and Dermatology During the COVID-19 Pandemic: Analyzing User-Submitted Posts Seeking Dermatologic Advice on Reddit.新冠疫情期间的社交媒体与皮肤病学：分析在Reddit上寻求皮肤科建议的用户提交帖子

Cureus. 2023 Jan 12;15(1):e33720. doi: 10.7759/cureus.33720. eCollection 2023 Jan.

Internet-Based Mental Health Survey Research: Navigating Internet Bots on Reddit.基于互联网的心理健康调查研究：应对Reddit上的网络机器人

Cyberpsychol Behav Soc Netw. 2023 Feb;26(2):73-79. doi: 10.1089/cyber.2022.0173. Epub 2023 Feb 1.

Exploring COVID-related relationship extraction: Contrasting data sources and analyzing misinformation.探索与新冠病毒相关的关系提取：对比数据源并分析错误信息。

Heliyon. 2024 Feb 28;10(5):e26973. doi: 10.1016/j.heliyon.2024.e26973. eCollection 2024 Mar 15.

Characterizing and Identifying the Prevalence of Web-Based Misinformation Relating to Medication for Opioid Use Disorder: Machine Learning Approach.描述和识别与阿片类药物使用障碍药物治疗相关的网络错误信息的流行情况：机器学习方法。

J Med Internet Res. 2021 Dec 22;23(12):e30753. doi: 10.2196/30753.

Fine-Tuning BERT Models to Classify Misinformation on Garlic and COVID-19 on Twitter.微调 BERT 模型以在 Twitter 上对大蒜和 COVID-19 相关的错误信息进行分类。

Int J Environ Res Public Health. 2022 Apr 22;19(9):5126. doi: 10.3390/ijerph19095126.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

引用本文的文献

A scoping review of natural language processing in addressing medically inaccurate information: Errors, misinformation, and hallucination.关于自然语言处理在处理医学错误信息方面的范围综述：错误、错误信息和幻觉。

J Biomed Inform. 2025 Jul 22:104866. doi: 10.1016/j.jbi.2025.104866.

Isotretinoin on TikTok: A Qualitative Analysis of Attitudes, Perspectives, and Knowledge Dissemination of User-Generated Content.异维A酸在TikTok上的情况：对用户生成内容的态度、观点及知识传播的定性分析

Cureus. 2025 Mar 8;17(3):e80244. doi: 10.7759/cureus.80244. eCollection 2025 Mar.

Reddit users' perspectives on radiofrequency ablation: A data analysis.Reddit用户对射频消融术的看法：一项数据分析。

Interv Pain Med. 2024 Dec 19;4(1):100535. doi: 10.1016/j.inpm.2024.100535. eCollection 2025 Mar.

Natural language processing in dermatology: A systematic literature review and state of the art.皮肤科自然语言处理：系统文献回顾与现状

J Eur Acad Dermatol Venereol. 2024 Dec;38(12):2225-2234. doi: 10.1111/jdv.20286. Epub 2024 Aug 16.

Portrayal of mental health effects of isotretinoin on TikTok.异维A酸对心理健康影响在TikTok上的呈现。

JAAD Int. 2023 Dec 1;14:90-91. doi: 10.1016/j.jdin.2023.10.008. eCollection 2024 Mar.

A quasi-experimental study examining the efficacy of multimodal bot screening tools and recommendations to preserve data integrity in online psychological research.一项准实验研究，旨在检验多模态 bot 筛查工具的效果，并提出建议以维护在线心理研究中的数据完整性。

Am Psychol. 2024 Oct;79(7):956-969. doi: 10.1037/amp0001183. Epub 2023 Jul 20.

Using Machine Learning Technology (Early Artificial Intelligence-Supported Response With Social Listening Platform) to Enhance Digital Social Understanding for the COVID-19 Infodemic: Development and Implementation Study.利用机器学习技术（借助社交倾听平台实现早期人工智能支持的响应）增强对 COVID-19 信息疫情的数字社会理解：开发与实施研究。

JMIR Infodemiology. 2023 Aug 21;3:e47317. doi: 10.2196/47317.

When Virtual Assistants Meet Teledermatology: Validation of a Virtual Assistant to Improve the Quality of Life of Psoriatic Patients.当虚拟助手遇见远程皮肤病学：验证一个虚拟助手以改善银屑病患者的生活质量。

Int J Environ Res Public Health. 2022 Nov 5;19(21):14527. doi: 10.3390/ijerph192114527.

Patient and Public Involvement in Dermatology Research: A Review.患者和公众参与皮肤科研究：综述。

Am J Clin Dermatol. 2022 May;23(3):319-329. doi: 10.1007/s40257-022-00680-5. Epub 2022 Mar 29.

The role of dermatologists in social media: exploring the benefits and risks.皮肤科医生在社交媒体中的作用：探索益处与风险。

Hautarzt. 2022 May;73(5):401-404. doi: 10.1007/s00105-022-04946-1. Epub 2022 Feb 8.

本文引用的文献

Natural language processing of Reddit data to evaluate dermatology patient experiences and therapeutics.利用 Reddit 数据进行自然语言处理以评估皮肤科患者的体验和治疗效果。

J Am Acad Dermatol. 2020 Sep;83(3):803-808. doi: 10.1016/j.jaad.2019.07.014. Epub 2019 Jul 12.

Communicating Research in an Era of Misinformation.在错误信息时代传播研究成果。

Am J Public Health. 2019 May;109(5):645. doi: 10.2105/AJPH.2019.305048.

The spread of low-credibility content by social bots.社交机器人传播低可信度内容。

Nat Commun. 2018 Nov 20;9(1):4787. doi: 10.1038/s41467-018-06930-7.

Weaponized Health Communication: Twitter Bots and Russian Trolls Amplify the Vaccine Debate.武器化的健康传播：推特机器人和俄罗斯水军放大疫苗辩论。

Am J Public Health. 2018 Oct;108(10):1378-1384. doi: 10.2105/AJPH.2018.304567. Epub 2018 Aug 23.

Misinformation and Its Correction: Continued Influence and Successful Debiasing.错误信息及其纠正：持续影响与成功去偏倚

Psychol Sci Public Interest. 2012 Dec;13(3):106-31. doi: 10.1177/1529100612451018.

The potential carcinogenic risk of tanning beds: clinical guidelines and patient safety advice.晒黑床的潜在致癌风险：临床指南和患者安全建议。

Cancer Manag Res. 2010 Oct 28;2:277-82. doi: 10.2147/CMR.S7403.

Tanning bed exposure increases the risk of malignant melanoma.使用晒黑床会增加患恶性黑色素瘤的风险。

Int J Dermatol. 2007 Dec;46(12):1253-7. doi: 10.1111/j.1365-4632.2007.03408.x.

Prepubertal gynecomastia linked to lavender and tea tree oils.青春期前男性乳房发育与薰衣草油和茶树油有关。

N Engl J Med. 2007 Feb 1;356(5):479-85. doi: 10.1056/NEJMoa064725.

Ingestion of tea tree oil (Melaleuca oil) by a 4-year-old boy.

Pediatr Emerg Care. 2003 Jun;19(3):169-71. doi: 10.1097/01.pec.0000081241.98249.7b.

Melaleuca oil poisoning in a 17-month-old.

Vet Hum Toxicol. 1995 Dec;37(6):557-8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验