西班牙语中的希望言语检测：LGBT案例。

Hope speech detection in Spanish: The LGBT case.

作者信息

García-Baena Daniel, García-Cumbreras Miguel Ángel, Jiménez-Zafra Salud María, García-Díaz José Antonio, Valencia-García Rafael

机构信息

I.E.S. San Juan de la Cruz, Jaén, Spain.

Computer Science Department, SINAI Research Group, CEATIC, Universidad de Jaén, Jaén, Spain.

出版信息

Lang Resour Eval. 2023 Mar 17:1-28. doi: 10.1007/s10579-023-09638-3.

DOI:10.1007/s10579-023-09638-3

PMID:37360265

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10022560/

Abstract

In recent years, systems have been developed to monitor online content and remove abusive, offensive or hateful content. Comments in online social media have been analyzed to find and stop the spread of negativity using methods such as hate speech detection, identification of offensive language or detection of abusive language. We define hope speech as the type of speech that is able to relax a hostile environment and that helps, gives suggestions and inspires for good to a number of people when they are in times of illness, stress, loneliness or depression. Detecting it automatically, in order to give greater diffusion to positive comments, can have a very significant effect when it comes to fighting against sexual or racial discrimination or when we intend to foster less bellicose environments. In this article we perform a complete study on hope speech, analyzing existing solutions and available resources. In addition, we have generated a quality resource, SpanishHopeEDI, a new Spanish Twitter dataset on LGBT community, and we have conducted some experiments that can serve as a baseline for further research.

摘要

近年来，已开发出一些系统来监控在线内容并删除辱骂性、攻击性或仇恨性内容。人们通过分析在线社交媒体中的评论，运用诸如仇恨言论检测、冒犯性语言识别或辱骂性语言检测等方法，来发现并阻止负面信息的传播。我们将希望言论定义为这样一种言论类型：它能够缓和敌对氛围，在人们患病、承受压力、感到孤独或抑郁时，对许多人起到帮助、给予建议并激发其积极向上的作用。自动检测希望言论，以便让积极评论得到更广泛传播，在打击性别或种族歧视方面，或者在我们想要营造不那么好战的环境时，可能会产生非常显著的效果。在本文中，我们对希望言论进行了全面研究，分析了现有的解决方案和可用资源。此外，我们还生成了一个高质量资源，即SpanishHopeEDI，这是一个关于 LGBT 社区的全新西班牙语推特数据集，并且我们进行了一些实验，这些实验可为进一步研究提供基线。