社交媒体上的内容审核：谁来审核、为何审核仇恨言论很重要吗？

Content Moderation on Social Media: Does It Matter Who and Why Moderates Hate Speech?

机构信息

Department of Interactive Media, School of Communication, Hong Kong Baptist University, Kowloon, Hong Kong.

Department of Media and Communication, City University of Hong Kong, Kowloon, Hong Kong.

出版信息

Cyberpsychol Behav Soc Netw. 2023 Jul;26(7):527-534. doi: 10.1089/cyber.2022.0158. Epub 2023 May 3.

DOI:10.1089/cyber.2022.0158

PMID:37140448

Abstract

Artificial intelligence (AI) has been increasingly integrated into content moderation to detect and remove hate speech on social media. An online experiment ( = 478) was conducted to examine how moderation agents (AI vs. human vs. human-AI collaboration) and removal explanations (with vs. without) affect users' perceptions and acceptance of removal decisions for hate speech targeting social groups with certain characteristics, such as religion or sexual orientation. The results showed that individuals exhibit consistent levels of perceived trustworthiness and acceptance of removal decisions regardless of the type of moderation agent. When explanations for the content takedown were provided, removal decisions made jointly by humans and AI were perceived as more trustworthy than the same decisions made by humans alone, which increased users' willingness to accept the verdict. However, this moderated mediation effect was only significant when Muslims, not homosexuals, were the target of hate speech.

摘要

人工智能（AI）已越来越多地融入内容审核中，以检测和删除社交媒体上的仇恨言论。我们进行了一项在线实验（ = 478），以研究审核人员（AI 与人类与 AI 协作）和删除说明（有与无）如何影响用户对仇恨言论的看法和接受程度，这些仇恨言论针对的是具有某些特征的社会群体，如宗教或性取向。结果表明，无论审核人员的类型如何，个体对可信赖性和对删除决策的接受程度都保持一致。当提供内容删除的解释时，由人类和 AI 共同做出的删除决策被认为比人类单独做出的决策更值得信赖，这增加了用户接受裁决的意愿。但是，当仇恨言论的目标是穆斯林，而不是同性恋者时，这种调节中介效应才具有统计学意义。

相似文献

Content Moderation on Social Media: Does It Matter Who and Why Moderates Hate Speech?社交媒体上的内容审核：谁来审核、为何审核仇恨言论很重要吗？

Cyberpsychol Behav Soc Netw. 2023 Jul;26(7):527-534. doi: 10.1089/cyber.2022.0158. Epub 2023 May 3.

On the rise of fear speech in online social media.社交媒体中恐惧言论的兴起。

Proc Natl Acad Sci U S A. 2023 Mar 14;120(11):e2212270120. doi: 10.1073/pnas.2212270120. Epub 2023 Mar 6.

Empathy-based counterspeech can reduce racist hate speech in a social media field experiment.基于同理心的反驳言论可以减少社交媒体领域实验中的种族主义仇恨言论。

Proc Natl Acad Sci U S A. 2021 Dec 14;118(50). doi: 10.1073/pnas.2116310118.

Offline events and online hate.线下活动与网络仇恨

PLoS One. 2023 Jan 25;18(1):e0278511. doi: 10.1371/journal.pone.0278511. eCollection 2023.

A comparative study of the characteristics of hate speech propagators and their behaviours over Twitter social media platform.Twitter社交媒体平台上仇恨言论传播者的特征及其行为的比较研究。

Heliyon. 2023 Aug 11;9(8):e19097. doi: 10.1016/j.heliyon.2023.e19097. eCollection 2023 Aug.

Free Speech or Free to Hate?: Anti-LGBTQ+ Discourses in LGBTQ+-Affirming Spaces on Gab Social.言论自由还是自由仇恨？：Gab 社交平台上的 LGBTQ+ 肯定空间中的反 LGBTQ+言论。

J Homosex. 2024 Jul 2;71(8):2030-2055. doi: 10.1080/00918369.2023.2218959. Epub 2023 Jul 28.

Merging public health and automated approaches to address online hate speech.融合公共卫生与自动化方法以应对网络仇恨言论。

AI Ethics. 2023 Apr 12:1-10. doi: 10.1007/s43681-023-00281-w.

Illusory interparty disagreement: Partisans agree on what hate speech to censor but do not know it.虚幻的党派间分歧：党派人士对于审查何种仇恨言论达成了一致，但他们自己却不知道。

Proc Natl Acad Sci U S A. 2024 Sep 24;121(39):e2402428121. doi: 10.1073/pnas.2402428121. Epub 2024 Sep 16.

Factors Associated with Online Hate Acceptance: A Cross-National Six-Country Study among Young Adults.与网络仇恨接受相关的因素：一项跨六国的青年成年人的跨国研究。

Int J Environ Res Public Health. 2022 Jan 4;19(1):534. doi: 10.3390/ijerph19010534.

Qualitative and Artificial Intelligence-based Sentiment Analyses of Anti-LGBTI+ Hate Speech on Twitter in Turkey.土耳其推特上反LGBTI+仇恨言论的定性分析与基于人工智能的情感分析

Issues Ment Health Nurs. 2023 Feb;44(2):112-120. doi: 10.1080/01612840.2022.2158407. Epub 2023 Jan 20.

引用本文的文献

AI-enhanced collective intelligence.人工智能增强的集体智慧。

Patterns (N Y). 2024 Oct 10;5(11):101074. doi: 10.1016/j.patter.2024.101074. eCollection 2024 Nov 8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

社交媒体上的内容审核：谁来审核、为何审核仇恨言论很重要吗？

Content Moderation on Social Media: Does It Matter Who and Why Moderates Hate Speech?

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献