用于自动识别来自推特的亲密伴侣暴力报告的自然语言模型。

Natural language model for automatic identification of Intimate Partner Violence reports from Twitter.

作者信息

Al-Garadi Mohammed Ali, Kim Sangmi, Guo Yuting, Warren Elise, Yang Yuan-Chi, Lakamana Sahithi, Sarker Abeed

机构信息

Department of Biomedical Informatics, School of Medicine, Emory University, Atlanta, GA, United States.

School of Nursing, Emory University, Atlanta, GA, United States.

出版信息

Array (N Y). 2022 Sep;15. doi: 10.1016/j.array.2022.100217. Epub 2022 Jul 20.

DOI:10.1016/j.array.2022.100217

PMID:37006948

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10065459/

Abstract

Intimate partner violence (IPV) is a preventable public health problem that affects millions of people worldwide. Approximately one in four women are estimated to be or have been victims of severe violence at some point in their lives, irrespective of age, ethnicity, and economic status. Victims often report IPV experiences on social media, and automatic detection of such reports via machine learning may enable improved surveillance and targeted distribution of support and/or interventions for those in need. However, no artificial intelligence systems for automatic detection currently exists, and we attempted to address this research gap. We collected posts from Twitter using a list of IPV-related keywords, manually reviewed subsets of retrieved posts, and prepared annotation guidelines to categorize tweets into IPV-report or non-IPV-report. We annotated 6,348 tweets in total, with the inter-annotator agreement (IAA) of 0.86 (Cohen's kappa) among 1,834 double-annotated tweets. The class distribution in the annotated dataset was highly imbalanced, with only 668 posts (~11%) labeled as IPV-report. We then developed an effective natural language processing model to identify IPV-reporting tweets automatically. The developed model achieved classification F-scores of 0.76 for the IPV-report class and 0.97 for the non-IPV-report class. We conducted post-classification analyses to determine the causes of system errors and to ensure that the system did not exhibit biases in its decision making, particularly with respect to race and gender. Our automatic model can be an essential component for a proactive social media-based intervention and support framework, while also aiding population-level surveillance and large-scale cohort studies.

摘要

亲密伴侣暴力（IPV）是一个可预防的公共卫生问题，影响着全球数百万人。据估计，约四分之一的女性在其生命中的某个时刻曾是严重暴力的受害者，无论年龄、种族和经济状况如何。受害者经常在社交媒体上报告亲密伴侣暴力经历，通过机器学习自动检测此类报告可能有助于改善监测，并为有需要的人提供有针对性的支持和/或干预。然而，目前尚不存在用于自动检测的人工智能系统，我们试图填补这一研究空白。我们使用与亲密伴侣暴力相关的关键词列表从推特上收集帖子，手动审查检索到的帖子子集，并制定注释指南，将推文分类为亲密伴侣暴力报告或非亲密伴侣暴力报告。我们总共注释了6348条推文，在1834条经过双重注释的推文中，注释者间一致性（IAA）为0.86（科恩kappa系数）。注释数据集中的类别分布高度不均衡，只有668条帖子（约11%）被标记为亲密伴侣暴力报告。然后，我们开发了一个有效的自然语言处理模型来自动识别报告亲密伴侣暴力的推文。所开发的模型对亲密伴侣暴力报告类别的分类F分数为0.76，对非亲密伴侣暴力报告类别的分类F分数为0.97。我们进行了分类后分析，以确定系统错误的原因，并确保系统在决策过程中没有表现出偏差，特别是在种族和性别方面。我们的自动模型可以成为基于社交媒体的主动干预和支持框架的重要组成部分，同时也有助于进行人群层面的监测和大规模队列研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d629/10065459/a4ad3f81d05a/nihms-1882589-f0001.jpg

相似文献

Natural language model for automatic identification of Intimate Partner Violence reports from Twitter.用于自动识别来自推特的亲密伴侣暴力报告的自然语言模型。

Array (N Y). 2022 Sep;15. doi: 10.1016/j.array.2022.100217. Epub 2022 Jul 20.

Automatic Detection of Intimate Partner Violence Victims from Social Media for Proactive Delivery of Support.通过社交媒体自动检测亲密伴侣暴力受害者，以便主动提供支持。

AMIA Jt Summits Transl Sci Proc. 2023 Jun 16;2023:254-260. eCollection 2023.

A natural language processing pipeline to advance the use of Twitter data for digital epidemiology of adverse pregnancy outcomes.一种自然语言处理流程，以促进将推特数据用于不良妊娠结局的数字流行病学研究。

J Biomed Inform. 2020;112S:100076. doi: 10.1016/j.yjbinx.2020.100076. Epub 2020 Aug 8.

Automatic Detection of Twitter Users Who Express Chronic Stress Experiences via Supervised Machine Learning and Natural Language Processing.基于监督机器学习和自然语言处理的 Twitter 用户慢性应激体验自动检测。

Comput Inform Nurs. 2023 Sep 1;41(9):717-724. doi: 10.1097/CIN.0000000000000985.

Discovering Cohorts of Pregnant Women From Social Media for Safety Surveillance and Analysis.从社交媒体中发现孕妇群体以进行安全监测与分析。

J Med Internet Res. 2017 Oct 30;19(10):e361. doi: 10.2196/jmir.8164.

"Leaving Was a Process, Not an Event": The Lived Experience of Dating and Domestic Violence in 140 Characters.“离开是一个过程，而不是一个事件”：在 140 个字符中体验约会和家庭暴力。

J Interpers Violence. 2021 Jun;36(11-12):NP6553-NP6580. doi: 10.1177/0886260518816325. Epub 2018 Dec 5.

Developing an Automatic System for Classifying Chatter About Health Services on Twitter: Case Study for Medicaid.开发一个自动系统来对 Twitter 上有关医疗服务的闲聊进行分类：以医疗补助计划为例。

J Med Internet Res. 2021 May 3;23(5):e26616. doi: 10.2196/26616.

Inj Prev. 2021 Apr;27(2):137-144. doi: 10.1136/injuryprev-2020-043704. Epub 2020 Aug 24.

An aspect-level sentiment analysis dataset for therapies on Twitter.一个用于推特上疗法的方面级情感分析数据集。

Data Brief. 2023 Sep 23;50:109618. doi: 10.1016/j.dib.2023.109618. eCollection 2023 Oct.

Training healthcare providers to respond to intimate partner violence against women.培训医疗保健提供者以应对针对妇女的亲密伴侣暴力。

Cochrane Database Syst Rev. 2021 May 31;5(5):CD012423. doi: 10.1002/14651858.CD012423.pub2.

引用本文的文献

The Utilization of Natural Language Processing for Analyzing Social Media Data in Nursing Research: A Scoping Review.自然语言处理在护理研究中分析社交媒体数据的应用：一项范围综述

J Nurs Manag. 2024 Dec 30;2024:2857497. doi: 10.1155/jonm/2857497. eCollection 2024.

Violence against women and girls research: Leveraging gains across disciplines.针对妇女和女童的暴力行为研究：利用各学科的成果。

Proc Natl Acad Sci U S A. 2025 Jan 28;122(4):e2404557122. doi: 10.1073/pnas.2404557122. Epub 2025 Jan 23.

Using Artificial Intelligence to Detect Risk of Family Violence: Protocol for a Systematic Review and Meta-Analysis.利用人工智能检测家庭暴力风险：系统评价与荟萃分析方案

JMIR Res Protoc. 2024 Dec 2;13:e54966. doi: 10.2196/54966.

Automatic Detection of Intimate Partner Violence Victims from Social Media for Proactive Delivery of Support.通过社交媒体自动检测亲密伴侣暴力受害者，以便主动提供支持。

AMIA Jt Summits Transl Sci Proc. 2023 Jun 16;2023:254-260. eCollection 2023.

Generalizable Natural Language Processing Framework for Migraine Reporting from Social Media.用于社交媒体偏头痛报告的通用自然语言处理框架

AMIA Jt Summits Transl Sci Proc. 2023 Jun 16;2023:261-270. eCollection 2023.

本文引用的文献

Text classification models for the automatic detection of nonmedical prescription medication use from social media.社交媒体中非医疗处方药物使用的自动检测的文本分类模型。

BMC Med Inform Decis Mak. 2021 Jan 26;21(1):27. doi: 10.1186/s12911-021-01394-0.

The risk of racial bias while tracking influenza-related content on social media using machine learning.使用机器学习追踪社交媒体上与流感相关内容时存在种族偏见的风险。

J Am Med Inform Assoc. 2021 Mar 18;28(4):839-849. doi: 10.1093/jamia/ocaa326.

COVID-19 and the rise of intimate partner violence.新冠疫情与亲密伴侣暴力行为的增加

World Dev. 2021 Jan;137:105217. doi: 10.1016/j.worlddev.2020.105217. Epub 2020 Sep 29.

Effectiveness of ICT-based intimate partner violence interventions: a systematic review.基于信息通信技术的亲密伴侣暴力干预措施的有效性：系统评价。

BMC Public Health. 2020 Sep 7;20(1):1372. doi: 10.1186/s12889-020-09408-8.

Exacerbation of Physical Intimate Partner Violence during COVID-19 Pandemic.新冠疫情期间身体亲密伴侣暴力的加剧。

Radiology. 2021 Jan;298(1):E38-E45. doi: 10.1148/radiol.2020202866. Epub 2020 Aug 13.

The impact of the Covid-19 pandemic in the precipitation of intimate partner violence.Covid-19 大流行对亲密伴侣暴力事件发生的影响。

Int J Law Psychiatry. 2020 Jul-Aug;71:101606. doi: 10.1016/j.ijlp.2020.101606. Epub 2020 Jun 26.

Alarming trends in US domestic violence during the COVID-19 pandemic.新冠疫情期间美国家庭暴力的惊人趋势。

Am J Emerg Med. 2020 Dec;38(12):2753-2755. doi: 10.1016/j.ajem.2020.04.077. Epub 2020 Apr 28.

Social Media and Emergency Preparedness in Response to Novel Coronavirus.社交媒体与应对新型冠状病毒的应急准备

JAMA. 2020 May 26;323(20):2011-2012. doi: 10.1001/jama.2020.4469.

Meta-analysis and systematic review for the treatment of perpetrators of intimate partner violence.元分析和系统评价治疗亲密伴侣暴力的施害者。

Neurosci Biobehav Rev. 2019 Oct;105:220-230. doi: 10.1016/j.neubiorev.2019.08.006. Epub 2019 Aug 12.

Lifetime Economic Burden of Intimate Partner Violence Among U.S. Adults.美国成年人亲密伴侣暴力的终身经济负担。

Am J Prev Med. 2018 Oct;55(4):433-444. doi: 10.1016/j.amepre.2018.04.049. Epub 2018 Aug 22.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于自动识别来自推特的亲密伴侣暴力报告的自然语言模型。

Natural language model for automatic identification of Intimate Partner Violence reports from Twitter.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献