• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分析和学习不同类型骚扰的语言。

Analyzing and learning the language for different types of harassment.

机构信息

University of Wisconsin-Madison, Madison, Wisconsin, United States of America.

University of Dayton, Dayton, Ohio, United States of America.

出版信息

PLoS One. 2020 Mar 27;15(3):e0227330. doi: 10.1371/journal.pone.0227330. eCollection 2020.

DOI:10.1371/journal.pone.0227330
PMID:32218569
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7100939/
Abstract

THIS ARTICLE USES WORDS OR LANGUAGE THAT IS CONSIDERED PROFANE, VULGAR, OR OFFENSIVE BY SOME READERS. The presence of a significant amount of harassment in user-generated content and its negative impact calls for robust automatic detection approaches. This requires the identification of different types of harassment. Earlier work has classified harassing language in terms of hurtfulness, abusiveness, sentiment, and profanity. However, to identify and understand harassment more accurately, it is essential to determine the contextual type that captures the interrelated conditions in which harassing language occurs. In this paper we introduce the notion of contextual type in harassment by distinguishing between five contextual types: (i) sexual, (ii) racial, (iii) appearance-related, (iv) intellectual and (v) political. We utilize an annotated corpus from Twitter distinguishing these types of harassment. We study the context of each kind to shed light on the linguistic meaning, interpretation, and distribution, with results from two lines of investigation: an extensive linguistic analysis, and the statistical distribution of uni-grams. We then build type- aware classifiers to automate the identification of type-specific harassment. Our experiments demonstrate that these classifiers provide competitive accuracy for identifying and analyzing harassment on social media. We present extensive discussion and significant observations about the effectiveness of type-aware classifiers using a detailed comparison setup, providing insight into the role of type-dependent features.

摘要

本文使用了一些被某些读者认为是亵渎、粗俗或冒犯性的词语或语言。用户生成内容中存在大量骚扰,且其产生了负面影响,这呼吁我们采取强有力的自动检测方法。这需要识别不同类型的骚扰。早期的工作已经根据伤害性、辱骂性、情感和亵渎性等方面对骚扰语言进行了分类。然而,为了更准确地识别和理解骚扰,确定上下文类型以捕捉骚扰语言发生的相关条件至关重要。在本文中,我们通过区分以下五种上下文类型来引入骚扰中的上下文类型的概念:(i)性,(ii)种族,(iii)外貌相关,(iv)智力和(v)政治。我们利用来自 Twitter 的带注释语料库来区分这些类型的骚扰。我们研究了每种骚扰的上下文,以阐明其语言意义、解释和分布,这得益于两条研究路线的结果:广泛的语言分析和一元词的统计分布。然后,我们构建了基于类型的分类器,以实现对特定类型骚扰的自动识别。我们的实验表明,这些分类器在识别和分析社交媒体上的骚扰方面提供了有竞争力的准确性。我们通过详细的比较设置展示了关于基于类型的分类器有效性的广泛讨论和重要观察,深入了解了依赖类型的特征的作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f63/7100939/0a1e1978adec/pone.0227330.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f63/7100939/557ed0a3bb6e/pone.0227330.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f63/7100939/52ae83866b14/pone.0227330.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f63/7100939/9ba3905e3546/pone.0227330.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f63/7100939/2372ab77d9c3/pone.0227330.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f63/7100939/a339e7fd3b62/pone.0227330.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f63/7100939/0a1e1978adec/pone.0227330.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f63/7100939/557ed0a3bb6e/pone.0227330.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f63/7100939/52ae83866b14/pone.0227330.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f63/7100939/9ba3905e3546/pone.0227330.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f63/7100939/2372ab77d9c3/pone.0227330.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f63/7100939/a339e7fd3b62/pone.0227330.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f63/7100939/0a1e1978adec/pone.0227330.g006.jpg

相似文献

1
Analyzing and learning the language for different types of harassment.分析和学习不同类型骚扰的语言。
PLoS One. 2020 Mar 27;15(3):e0227330. doi: 10.1371/journal.pone.0227330. eCollection 2020.
2
Offensive language detection in low resource languages: A use case of Persian language.低资源语言中的攻击性语言检测:以波斯语为例。
PLoS One. 2024 Jun 21;19(6):e0304166. doi: 10.1371/journal.pone.0304166. eCollection 2024.
3
Sexual harassment of critical care nurses: a costly workplace issue.重症监护护士遭受的性骚扰:一个代价高昂的职场问题。
Am J Crit Care. 1994 Nov;3(6):409-15.
4
Sexual harassment of nurses and nursing students.护士及护理专业学生遭受的性骚扰。
J Adv Nurs. 2003 Jun;42(6):637-44. doi: 10.1046/j.1365-2648.2003.02667.x.
5
Online harassment: a toolkit for protecting yourself from abuse.网络骚扰:保护自己免受虐待的工具包。
Nature. 2022 Sep;609(7925):205-207. doi: 10.1038/d41586-022-02766-w.
6
Sexual harassment in the workplace: nurses' perceptions.职场中的性骚扰:护士的看法。
J Nurs Adm. 2000 Oct;30(10):497-503. doi: 10.1097/00005110-200010000-00008.
7
Screening for Harassment, Abuse, and Discrimination among Surgery Residents: An EAST Multicenter Trial.外科住院医师中骚扰、虐待和歧视的筛查:一项东部多中心试验。
Am Surg. 2019 May 1;85(5):456-461.
8
International Olympic Committee consensus statement: harassment and abuse (non-accidental violence) in sport.国际奥林匹克委员会共识声明:体育运动中的骚扰和虐待(非意外暴力)。
Br J Sports Med. 2016 Sep;50(17):1019-29. doi: 10.1136/bjsports-2016-096121. Epub 2016 Apr 26.
9
The #MeToo Movement in the United States: Text Analysis of Early Twitter Conversations.美国的#MeToo运动:早期推特对话的文本分析。
J Med Internet Res. 2019 Sep 3;21(9):e13837. doi: 10.2196/13837.
10
Harassment, a field study.骚扰,一项实地研究。
Nat Ecol Evol. 2017 Dec;1(12):1787-1788. doi: 10.1038/s41559-017-0404-3.

引用本文的文献

1
Correction: Analyzing and learning the language for different types of harassment.更正:分析和学习针对不同类型骚扰的语言。
PLoS One. 2020 Apr 29;15(4):e0232650. doi: 10.1371/journal.pone.0232650. eCollection 2020.

本文引用的文献

1
Automatic detection of cyberbullying in social media text.社交媒体文本中网络欺凌的自动检测。
PLoS One. 2018 Oct 8;13(10):e0203794. doi: 10.1371/journal.pone.0203794. eCollection 2018.
2
Appearance-related cyberbullying: a qualitative investigation of characteristics, content, reasons, and effects.外貌相关的网络欺凌:对特征、内容、原因及影响的定性调查
Body Image. 2014 Sep;11(4):527-33. doi: 10.1016/j.bodyim.2014.08.006. Epub 2014 Sep 3.
3
Effect size, confidence interval and statistical significance: a practical guide for biologists.
效应量、置信区间与统计显著性:生物学家实用指南
Biol Rev Camb Philos Soc. 2007 Nov;82(4):591-605. doi: 10.1111/j.1469-185X.2007.00027.x.