仇恨的数据化：自动仇恨言论监测中的期望与挑战

The Datafication of Hate: Expectations and Challenges in Automated Hate Speech Monitoring.

作者信息

Laaksonen Salla-Maaria, Haapoja Jesse, Kinnunen Teemu, Nelimarkka Matti, Pöyhtäri Reeta

机构信息

Centre for Consumer Society Research, University of Helsinki, Helsinki, Finland.

Department of Computer Science, Aalto University, Espoo, Finland.

出版信息

Front Big Data. 2020 Feb 5;3:3. doi: 10.3389/fdata.2020.00003. eCollection 2020.

DOI:10.3389/fdata.2020.00003

PMID:33693378

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7931925/

Abstract

Hate speech has been identified as a pressing problem in society and several automated approaches have been designed to detect and prevent it. This paper reports and reflects upon an action research setting consisting of multi-organizational collaboration conducted during Finnish municipal elections in 2017, wherein a technical infrastructure was designed to automatically monitor candidates' social media updates for hate speech. The setting allowed us to engage in a 2-fold investigation. First, the collaboration offered a unique view for exploring how hate speech emerges as a technical problem. The project developed an adequately well-working algorithmic solution using supervised machine learning. We tested the performance of various feature extraction and machine learning methods and ended up using a combination of Bag-of-Words feature extraction with Support-Vector Machines. However, an automated approach required heavy simplification, such as using rudimentary scales for classifying hate speech and a reliance on word-based approaches, while in reality hate speech is a linguistic and social phenomenon with various tones and forms. Second, the action-research-oriented setting allowed us to observe affective responses, such as the hopes, dreams, and fears related to machine learning technology. Based on participatory observations, project artifacts and documents, interviews with project participants, and online reactions to the detection project, we identified participants' aspirations for effective automation as well as the level of neutrality and objectivity introduced by an algorithmic system. However, the participants expressed more critical views toward the system after the monitoring process. Our findings highlight how the powerful expectations related to technology can easily end up dominating a project dealing with a contested, topical social issue. We conclude by discussing the problematic aspects of datafying hate and suggesting some practical implications for hate speech recognition.

摘要

仇恨言论已被视为社会中的一个紧迫问题，并且已经设计了几种自动化方法来检测和预防它。本文报告并反思了一个行动研究场景，该场景包括在2017年芬兰市政选举期间进行的多组织合作，其中设计了一个技术基础设施来自动监测候选人社交媒体更新中的仇恨言论。这个场景使我们能够进行两方面的调查。首先，这种合作提供了一个独特的视角来探索仇恨言论如何作为一个技术问题出现。该项目使用监督式机器学习开发了一个运行良好的算法解决方案。我们测试了各种特征提取和机器学习方法的性能，最终使用了词袋特征提取与支持向量机的组合。然而，自动化方法需要大量简化，例如使用基本量表来分类仇恨言论以及依赖基于单词的方法，而实际上仇恨言论是一种具有各种语气和形式的语言和社会现象。其次，以行动研究为导向的场景使我们能够观察情感反应，例如与机器学习技术相关的希望、梦想和恐惧。基于参与式观察、项目工件和文档、对项目参与者的访谈以及对检测项目的在线反应，我们确定了参与者对有效自动化的期望以及算法系统引入的中立性和客观性水平。然而，在监测过程之后，参与者对该系统表达了更批判性的观点。我们的研究结果强调了与技术相关的强大期望如何轻易地最终主导一个处理有争议的热点社会问题的项目。我们通过讨论将仇恨数据化的问题方面并提出一些仇恨言论识别的实际意义来得出结论。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c8e/7931925/eecebff1a7a4/fdata-03-00003-g0001.jpg

相似文献

The Datafication of Hate: Expectations and Challenges in Automated Hate Speech Monitoring.仇恨的数据化：自动仇恨言论监测中的期望与挑战

Front Big Data. 2020 Feb 5;3:3. doi: 10.3389/fdata.2020.00003. eCollection 2020.

Linguistic Patterns for Code Word Resilient Hate Speech Identification.用于代码词抗扰仇恨言论识别的语言模式。

Sensors (Basel). 2021 Nov 25;21(23):7859. doi: 10.3390/s21237859.

Hate speech detection: Challenges and solutions.仇恨言论检测：挑战与解决方案。

PLoS One. 2019 Aug 20;14(8):e0221152. doi: 10.1371/journal.pone.0221152. eCollection 2019.

Is hate speech detection the solution the world wants?仇恨言论检测是世界所需要的解决方案吗？

Proc Natl Acad Sci U S A. 2023 Mar 7;120(10):e2209384120. doi: 10.1073/pnas.2209384120. Epub 2023 Feb 27.

Detection of cyberhate speech towards female sport in the Arabic Xsphere.在阿拉伯语网络空间中对女子体育的网络仇恨言论检测。

PeerJ Comput Sci. 2024 Jun 27;10:e2138. doi: 10.7717/peerj-cs.2138. eCollection 2024.

Code-mixing unveiled: Enhancing the hate speech detection in Arabic dialect tweets using machine learning models.代码混合揭秘：使用机器学习模型增强阿拉伯方言推文中的仇恨言论检测

PLoS One. 2024 Jul 17;19(7):e0305657. doi: 10.1371/journal.pone.0305657. eCollection 2024.

Detection of Hate Speech in COVID-19-Related Tweets in the Arab Region: Deep Learning and Topic Modeling Approach.检测阿拉伯地区与 COVID-19 相关推文的仇恨言论：深度学习和主题建模方法。

J Med Internet Res. 2020 Dec 8;22(12):e22609. doi: 10.2196/22609.

Offline events and online hate.线下活动与网络仇恨

PLoS One. 2023 Jan 25;18(1):e0278511. doi: 10.1371/journal.pone.0278511. eCollection 2023.

Emotionally Informed Hate Speech Detection: A Multi-target Perspective.基于情感信息的仇恨言论检测：多目标视角

Cognit Comput. 2022;14(1):322-352. doi: 10.1007/s12559-021-09862-5. Epub 2021 Jun 28.

Detection of fake news and hate speech for Ethiopian languages: a systematic review of the approaches.埃塞俄比亚语言中假新闻和仇恨言论的检测：方法的系统综述

J Big Data. 2022;9(1):66. doi: 10.1186/s40537-022-00619-x. Epub 2022 May 19.

引用本文的文献

Engaging in the good with technology: a framework for examining positive technology use.借助技术投身善举：审视积极技术使用的框架

Front Psychol. 2023 Aug 15;14:1175740. doi: 10.3389/fpsyg.2023.1175740. eCollection 2023.

Is hate speech detection the solution the world wants?仇恨言论检测是世界所需要的解决方案吗？

Proc Natl Acad Sci U S A. 2023 Mar 7;120(10):e2209384120. doi: 10.1073/pnas.2209384120. Epub 2023 Feb 27.

本文引用的文献

Us and them: identifying cyber hate on Twitter across multiple protected characteristics.我们与他们：识别推特上针对多种受保护特征的网络仇恨言论。

EPJ Data Sci. 2016;5(1):11. doi: 10.1140/epjds/s13688-016-0072-6. Epub 2016 Mar 23.

Towards an Ethical Framework for Publishing Twitter Data in Social Research: Taking into Account Users' Views, Online Context and Algorithmic Estimation.构建社会研究中发布推特数据的伦理框架：兼顾用户观点、网络环境及算法评估

Sociology. 2017 Dec;51(6):1149-1168. doi: 10.1177/0038038517708140. Epub 2017 May 26.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

仇恨的数据化：自动仇恨言论监测中的期望与挑战

The Datafication of Hate: Expectations and Challenges in Automated Hate Speech Monitoring.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献