Sager Monique A, Kashyap Aditya M, Tamminga Mila, Ravoori Sadhana, Callison-Burch Christopher, Lipoff Jules B
Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.
Department of Computer Science, University of Pennsylvania, Philadelphia, PA, United States.
JMIR Dermatol. 2021 Sep 30;4(2):e20975. doi: 10.2196/20975.
Reddit, the fifth most popular website in the United States, boasts a large and engaged user base on its dermatology forums where users crowdsource free medical opinions. Unfortunately, much of the advice provided is unvalidated and could lead to the provision of inappropriate care. Initial testing has revealed that artificially intelligent bots can detect misinformation regarding tanning and essential oils on Reddit dermatology forums and may be able to produce responses to posts containing misinformation.
To analyze the ability of bots to find and respond to tanning and essential oil-related health misinformation on Reddit's dermatology forums in a controlled test environment.
Using natural language processing techniques, we trained bots to target misinformation, using relevant keywords and to post prefabricated responses. By evaluating different model architectures across a held-out test set, we compared performances.
Our models yielded data test accuracies ranging 95%-100%, with a Bidirectional Encoder Representations from Transformers (BERT) fine-tuned model resulting in the highest level of test accuracy. Bots were then able to post corrective prefabricated responses to misinformation in a test environment.
Using a limited data set, bots accurately detected examples of health misinformation within Reddit dermatology forums. Given that these bots can then post prefabricated responses, this technique may allow for interception of misinformation. Providing correct information does not mean that users will be receptive or find such interventions persuasive. Further studies should investigate this strategy's effectiveness to inform future deployment of bots as a technique in combating health misinformation.
Reddit是美国第五大最受欢迎的网站,其皮肤科论坛拥有庞大且活跃的用户群体,用户在该论坛上众包免费医疗意见。不幸的是,所提供的许多建议未经证实,可能会导致提供不适当的护理。初步测试表明,人工智能机器人能够在Reddit皮肤科论坛上检测到有关晒黑和精油的错误信息,并可能能够对包含错误信息的帖子做出回应。
在受控测试环境中分析机器人在Reddit皮肤科论坛上查找并回应与晒黑和精油相关的健康错误信息的能力。
我们使用自然语言处理技术,训练机器人使用相关关键词来针对错误信息,并发布预制回复。通过在一个留出的测试集上评估不同的模型架构,我们比较了性能。
我们的模型在数据测试中的准确率在95%-100%之间,其中经过微调的双向编码器表征来自变压器(BERT)模型的测试准确率最高。然后,机器人能够在测试环境中对错误信息发布纠正性预制回复。
使用有限的数据集,机器人能够准确检测Reddit皮肤科论坛内的健康错误信息示例。鉴于这些机器人随后可以发布预制回复,这种技术可能有助于拦截错误信息。提供正确信息并不意味着用户会接受或认为此类干预具有说服力。进一步的研究应调查这种策略的有效性,以便为未来将机器人作为一种对抗健康错误信息的技术进行部署提供参考。