• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用文本挖掘和机器学习检测儿童虐待和忽视的叙事调查报告中的物质相关问题。

Detecting substance-related problems in narrative investigation summaries of child abuse and neglect using text mining and machine learning.

机构信息

Child and Adolescent Data Lab, University of Michigan, School of Social Work, 1080 S University Ave, Ann Arbor, MI, 48109, United States.

Indiana University School of Social Work, 902 West New York Street Indianapolis, Indiana, 46202, United States.

出版信息

Child Abuse Negl. 2019 Dec;98:104180. doi: 10.1016/j.chiabu.2019.104180. Epub 2019 Sep 12.

DOI:10.1016/j.chiabu.2019.104180
PMID:31521909
Abstract

BACKGROUND

State child welfare agencies collect, store, and manage vast amounts of data. However, they often do not have the right data, or the data is problematic or difficult to inform strategies to improve services and system processes. Considerable resources are required to read and code these text data. Data science and text mining offer potentially efficient and cost-effective strategies for maximizing the value of these data.

OBJECTIVE

The current study tests the feasibility of using text mining for extracting information from unstructured text to better understand substance-related problems among families investigated for abuse or neglect.

METHOD

A state child welfare agency provided written summaries from investigations of child abuse and neglect. Expert human reviewers coded 2956 investigation summaries based on whether the caseworker observed a substance-related problem. These coded documents were used to develop, train, and validate computer models that could perform the coding on an automated basis.

RESULTS

A set of computer models achieved greater than 90% accuracy when judged against expert human reviewers. Fleiss kappa estimates among computer models and expert human reviewers exceeded .80, indicating that expert human reviewer ratings are exchangeable with the computer models.

CONCLUSION

These results provide compelling evidence that text mining procedures can be a cost-effective and efficient solution for extracting meaningful insights from unstructured text data. Additional research is necessary to understand how to extract the actionable insights from these under-utilized stores of data in child welfare.

摘要

背景

州儿童福利机构收集、存储和管理大量数据。然而,他们通常没有正确的数据,或者数据存在问题或难以告知战略以改善服务和系统流程。阅读和编写这些文本数据需要大量资源。数据科学和文本挖掘为最大限度地利用这些数据提供了潜在的高效和具有成本效益的策略。

目的

本研究测试了使用文本挖掘从非结构化文本中提取信息以更好地理解因虐待或忽视而接受调查的家庭中与物质相关的问题的可行性。

方法

州儿童福利机构提供了对虐待和忽视儿童的调查书面摘要。专家人工审查员根据观察到的与物质相关的问题对 2956 份调查摘要进行了编码。这些编码文件用于开发、培训和验证计算机模型,这些模型可以自动执行编码。

结果

当与专家人工审查员进行比较时,一组计算机模型的准确率超过 90%。计算机模型和专家人工审查员之间的 Fleiss kappa 估计值超过.80,表明专家人工审查员的评分可以与计算机模型互换。

结论

这些结果提供了令人信服的证据,表明文本挖掘程序可以成为从非结构化文本数据中提取有意义见解的具有成本效益和高效的解决方案。需要进一步研究如何从儿童福利中这些未充分利用的数据存储中提取可操作的见解。

相似文献

1
Detecting substance-related problems in narrative investigation summaries of child abuse and neglect using text mining and machine learning.使用文本挖掘和机器学习检测儿童虐待和忽视的叙事调查报告中的物质相关问题。
Child Abuse Negl. 2019 Dec;98:104180. doi: 10.1016/j.chiabu.2019.104180. Epub 2019 Sep 12.
2
A text-based approach to measuring opioid-related risk among families involved in the child welfare system.一种基于文本的方法,用于衡量儿童福利系统中涉及的家庭的阿片类药物相关风险。
Child Abuse Negl. 2022 Sep;131:105688. doi: 10.1016/j.chiabu.2022.105688. Epub 2022 Jun 7.
3
Can coders abstract child maltreatment variables from child welfare administrative data and case narratives for public health surveillance in Canada?编码员能否从儿童福利管理数据和案例叙述中提取儿童虐待变量,以便在加拿大进行公共卫生监测?
Child Abuse Negl. 2019 Jun;92:77-84. doi: 10.1016/j.chiabu.2019.03.020. Epub 2019 Mar 29.
4
The use of narrative text for injury surveillance research: a systematic review.利用叙事文本进行伤害监测研究:系统评价。
Accid Anal Prev. 2010 Mar;42(2):354-63. doi: 10.1016/j.aap.2009.09.020. Epub 2009 Oct 24.
5
Prevalence and context of firearms-related problems in child protective service investigations.儿童保护服务调查中与枪支相关问题的流行情况和背景。
Child Abuse Negl. 2020 Sep;107:104572. doi: 10.1016/j.chiabu.2020.104572. Epub 2020 Jun 5.
6
Domestic violence, parental substance misuse and the decision to substantiate child maltreatment.家庭暴力、父母物质滥用与虐待儿童问题的实质性判定决策。
Child Abuse Negl. 2018 May;79:31-41. doi: 10.1016/j.chiabu.2018.01.030. Epub 2018 Feb 6.
7
Discriminative validity and clinical utility of an abuse-neglect interview for adolescents with conduct and substance use problems.针对有品行和物质使用问题的青少年的虐待-忽视访谈的区分效度和临床效用。
Am J Psychiatry. 2003 Aug;160(8):1461-9. doi: 10.1176/appi.ajp.160.8.1461.
8
What Does Child Protective Services Investigate as Neglect? A Population-Based Study.儿童保护服务机构调查哪些情况为忽视?基于人群的研究。
Child Maltreat. 2024 Feb;29(1):96-105. doi: 10.1177/10775595221114144. Epub 2022 Jul 13.
9
Machine learning approaches to analysing textual injury surveillance data: a systematic review.用于分析文本损伤监测数据的机器学习方法:一项系统综述。
Accid Anal Prev. 2015 Jun;79:41-9. doi: 10.1016/j.aap.2015.03.018. Epub 2015 Mar 19.
10
Construction accident narrative classification: An evaluation of text mining techniques.建筑事故叙述分类:文本挖掘技术评估
Accid Anal Prev. 2017 Nov;108:122-130. doi: 10.1016/j.aap.2017.08.026. Epub 2017 Sep 1.

引用本文的文献

1
Leveraging AI to Investigate Child Maltreatment Text Narratives: Promising Benefits and Addressable Risks.利用人工智能研究虐待儿童文本叙述:潜在益处与可应对风险
JMIR Pediatr Parent. 2025 Jul 24;8:e73579. doi: 10.2196/73579.
2
Natural language processing-driven state machines to extract social factors from unstructured clinical documentation.由自然语言处理驱动的状态机,用于从非结构化临床文档中提取社会因素。
JAMIA Open. 2023 Apr 18;6(2):ooad024. doi: 10.1093/jamiaopen/ooad024. eCollection 2023 Jul.
3
Harnessing Machine Learning in Tackling Domestic Violence-An Integrative Review.
利用机器学习解决家庭暴力问题——综合述评。
Int J Environ Res Public Health. 2023 Mar 12;20(6):4984. doi: 10.3390/ijerph20064984.
4
Extracting social determinants of health from electronic health records using natural language processing: a systematic review.利用自然语言处理从电子健康记录中提取健康的社会决定因素:系统评价。
J Am Med Inform Assoc. 2021 Nov 25;28(12):2716-2727. doi: 10.1093/jamia/ocab170.
5
Enabling remote learning system for virtual personalized preferences during COVID-19 pandemic.在新冠疫情期间启用支持虚拟个性化偏好的远程学习系统。
Multimed Tools Appl. 2021;80(24):33329-33355. doi: 10.1007/s11042-021-11414-w. Epub 2021 Aug 17.
6
Psychiatric Disorders Among Older Black Americans: Within- and Between-Group Differences.美国老年非裔美国人的精神疾病:组内和组间差异
Innov Aging. 2020 Apr 15;4(3):igaa007. doi: 10.1093/geroni/igaa007. eCollection 2020.