Suppr超能文献

人工智能与包容性:曾涉帮派的青年作为分析非结构化推特数据的领域专家

Artificial Intelligence and Inclusion: Formerly Gang-Involved Youth as Domain Experts for Analyzing Unstructured Twitter Data.

作者信息

Frey William R, Patton Desmond U, Gaskell Michael B, McGregor Kyle A

机构信息

Columbia University, New York City, NY, USA.

New York University Langone Health, New York City, NY, USA.

出版信息

Soc Sci Comput Rev. 2020 Feb;38(1):42-56. doi: 10.1177/0894439318788314. Epub 2018 Jul 18.

Abstract

Mining social media data for studying the human condition has created new and unique challenges. When analyzing social media data from marginalized communities, algorithms lack the ability to accurately interpret off-line context, which may lead to dangerous assumptions about and implications for marginalized communities. To combat this challenge, we hired formerly gang-involved young people as domain experts for contextualizing social media data in order to create inclusive, community-informed algorithms. Utilizing data from the Gang Intervention and Computer Science Project-a comprehensive analysis of Twitter data from gang-involved youth in Chicago-we describe the process of involving formerly gang-involved young people in developing a new part-of-speech tagger and content classifier for a prototype natural language processing system that detects aggression and loss in Twitter data. We argue that involving young people as domain experts leads to more robust understandings of context, including localized language, culture, and events. These insights could change how data scientists approach the development of corpora and algorithms that affect people in marginalized communities and who to involve in that process. We offer a contextually driven interdisciplinary approach between social work and data science that integrates domain insights into the training of qualitative annotators and the production of algorithms for positive social impact.

摘要

挖掘社交媒体数据以研究人类状况带来了新的独特挑战。在分析来自边缘化社区的社交媒体数据时,算法缺乏准确解读线下背景的能力,这可能导致对边缘化社区产生危险的假设和影响。为应对这一挑战,我们聘请曾涉帮派的年轻人作为领域专家,对社交媒体数据进行背景分析,以创建具有包容性、基于社区信息的算法。利用“帮派干预与计算机科学项目”的数据——对芝加哥涉帮派青年的推特数据进行的全面分析——我们描述了让曾涉帮派的年轻人参与为一个原型自然语言处理系统开发新的词性标注器和内容分类器的过程,该系统用于检测推特数据中的攻击性和失落情绪。我们认为,让年轻人作为领域专家参与进来能带来对背景更深入的理解,包括地方语言、文化和事件。这些见解可能会改变数据科学家开发语料库和算法的方式,以及在这个过程中涉及哪些人,而这些语料库和算法会影响边缘化社区的人群。我们提供了一种社会工作与数据科学之间基于背景驱动的跨学科方法,将领域见解整合到定性注释员的培训以及用于产生积极社会影响的算法生成中。

相似文献

6
Finding Street Gang Members on Twitter.在推特上寻找街头帮派成员。
Proc IEEE ACM Int Conf Adv Soc Netw Anal Min. 2016 Aug;206:685-692. doi: 10.1109/ASONAM.2016.7752311. Epub 2016 Nov 24.

引用本文的文献

4
Equitable Research PRAXIS: A Framework for Health Informatics Methods.公平研究实践:健康信息学方法框架。
Yearb Med Inform. 2022 Aug;31(1):307-316. doi: 10.1055/s-0042-1742542. Epub 2022 Dec 4.

本文引用的文献

1
Big data: survey, technologies, opportunities, and challenges.大数据:调查、技术、机遇与挑战。
ScientificWorldJournal. 2014;2014:712826. doi: 10.1155/2014/712826. Epub 2014 Jul 17.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验