Suppr超能文献

探究在网络健康社区中遭受亲密伴侣暴力的女性所获得的支持和建议:文本挖掘方法。

Examining the Supports and Advice That Women With Intimate Partner Violence Experience Received in Online Health Communities: Text Mining Approach.

机构信息

Center for Smart Health, School of Nursing, The Hong Kong Polytechnic University, Hong Kong, China (Hong Kong).

Health and Community Systems, School of Nursing, University of Pittsburgh, Pittsburgh, PA, United States.

出版信息

J Med Internet Res. 2023 Oct 9;25:e48607. doi: 10.2196/48607.

Abstract

BACKGROUND

Intimate partner violence (IPV) is an underreported public health crisis primarily affecting women associated with severe health conditions and can lead to a high rate of homicide. Owing to the COVID-19 pandemic, more women with IPV experiences visited online health communities (OHCs) to seek help because of anonymity. However, little is known regarding whether their help requests were answered and whether the information provided was delivered in an appropriate manner. To understand the help-seeking information sought and given in OHCs, extraction of postings and linguistic features could be helpful to develop automated models to improve future help-seeking experiences.

OBJECTIVE

The objective of this study was to examine the types and patterns (ie, communication styles) of the advice offered by OHC members and whether the information received from women matched their expressed needs in their initial postings.

METHODS

We examined data from Reddit using data from subreddit community r/domesticviolence posts from November 14, 2020, through November 14, 2021, during the COVID-19 pandemic. We included posts from women aged ≥18 years who self-identified or described experiencing IPV and requested advice or help in this subreddit community. Posts from nonabused women and women aged <18 years, non-English posts, good news announcements, gratitude posts without any advice seeking, and posts related to advertisements were excluded. We developed a codebook and annotated the postings in an iterative manner. Initial posts were also quantified using Linguistic Inquiry and Word Count to categorize linguistic and posting features. Postings were then classified into 2 categories (ie, matched needs and unmatched needs) according to the types of help sought and received in OHCs to capture the help-seeking result. Nonparametric statistical analysis (ie, 2-tailed t test or Mann-Whitney U test) was used to compare the linguistic and posting features between matched and unmatched needs.

RESULTS

Overall, 250 postings were included, and 200 (80%) posting response comments matched with the type of help requested in initial postings, with legal advice and IPV knowledge achieving the highest matching rate. Overall, 17 linguistic or posting features were found to be significantly different between the 2 groups (ie, matched help and unmatched help). Positive title sentiment and linguistic features in postings containing health and wellness wordings were associated with unmatched needs postings, whereas the other 14 features were associated with postings with matched needs.

CONCLUSIONS

OHCs can extract the linguistic and posting features to understand the help-seeking result among women with IPV experiences. Features identified in this corpus reflected the differences found between the 2 groups. This is the first study that leveraged Linguistic Inquiry and Word Count to shed light on generating predictive features from unstructured text in OHCs, which could guide future algorithm development to detect help-seeking results within OHCs effectively.

摘要

背景

亲密伴侣暴力(IPV)是一种未被充分报道的公共卫生危机,主要影响与严重健康状况相关的女性,并可能导致高凶杀率。由于 COVID-19 大流行,更多遭受 IPV 的女性因匿名而选择访问在线健康社区(OHC)寻求帮助。然而,人们对她们的求助是否得到回应以及所提供的信息是否以适当的方式传递知之甚少。为了了解 OHC 中寻求和提供的帮助信息,可以提取帖子和语言特征,以帮助开发自动化模型来改善未来的求助体验。

目的

本研究旨在检查 OHC 成员提供的建议的类型和模式(即沟通方式),以及从女性那里收到的信息是否符合她们在最初帖子中表达的需求。

方法

我们使用了 Reddit 上的数据,数据来自于 2020 年 11 月 14 日至 2021 年 11 月 14 日期间 COVID-19 大流行期间的 r/domesticviolence 帖子子版块社区。我们纳入了年龄≥18 岁、自我认同或描述遭受 IPV 并在该子版块社区中寻求建议或帮助的女性的帖子。排除了未受虐待的女性和年龄<18 岁的女性、非英语帖子、好消息公告、没有任何寻求建议的感谢帖子以及与广告相关的帖子。我们制定了一个编码手册,并以迭代的方式对帖子进行注释。最初的帖子也使用语言查询和单词计数进行了量化,以对语言和帖子特征进行分类。然后,根据 OHC 中寻求和接收的帮助类型,将帖子分类为 2 类(即匹配需求和不匹配需求),以捕捉帮助寻求的结果。使用非参数统计分析(即双尾 t 检验或曼-惠特尼 U 检验)比较匹配和不匹配需求之间的语言和帖子特征。

结果

总体而言,纳入了 250 个帖子,其中 200 个(80%)帖子回复评论与初始帖子中请求的帮助类型相匹配,法律建议和 IPV 知识的匹配率最高。总体而言,发现 17 个语言或帖子特征在两组之间存在显著差异(即匹配的帮助和不匹配的帮助)。帖子标题情绪为正和包含健康与保健措辞的帖子语言特征与不匹配需求的帖子相关,而其他 14 个特征与匹配需求的帖子相关。

结论

OHC 可以提取语言和帖子特征来了解遭受 IPV 的女性的求助结果。该语料库中确定的特征反映了两组之间的差异。这是第一项利用语言查询和单词计数来揭示 OHC 中从非结构化文本生成预测特征的研究,这可以指导未来算法的开发,以便有效地在 OHC 中检测求助结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb40/10594147/92473a68973b/jmir_v25i1e48607_fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验