• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自然语言处理模型能否从精神保健电子记录中提取和分类人际暴力实例:一项应用评估研究。

Can natural language processing models extract and classify instances of interpersonal violence in mental healthcare electronic records: an applied evaluative study.

机构信息

School of Medical Education, Guy's, King's and St Thomas' School of Medicine, London, UK

Section of Women's Mental Health, Department of Health Services and Population Research, King's College London, London, UK.

出版信息

BMJ Open. 2022 Feb 16;12(2):e052911. doi: 10.1136/bmjopen-2021-052911.

DOI:10.1136/bmjopen-2021-052911
PMID:35172999
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8852656/
Abstract

OBJECTIVE

This paper evaluates the application of a natural language processing (NLP) model for extracting clinical text referring to interpersonal violence using electronic health records (EHRs) from a large mental healthcare provider.

DESIGN

A multidisciplinary team iteratively developed guidelines for annotating clinical text referring to violence. Keywords were used to generate a dataset which was annotated (ie, classified as affirmed, negated or irrelevant) for: presence of violence, patient status (ie, as perpetrator, witness and/or victim of violence) and violence type (domestic, physical and/or sexual). An NLP approach using a pretrained transformer model, BioBERT (Bidirectional Encoder Representations from Transformers for Biomedical Text Mining) was fine-tuned on the annotated dataset and evaluated using 10-fold cross-validation.

SETTING

We used the Clinical Records Interactive Search (CRIS) database, comprising over 500 000 de-identified EHRs of patients within the South London and Maudsley NHS Foundation Trust, a specialist mental healthcare provider serving an urban catchment area.

PARTICIPANTS

Searches of CRIS were carried out based on 17 predefined keywords. Randomly selected text fragments were taken from the results for each keyword, amounting to 3771 text fragments from the records of 2832 patients.

OUTCOME MEASURES

We estimated precision, recall and F1 score for each NLP model. We examined sociodemographic and clinical variables in patients giving rise to the text data, and frequencies for each annotated violence characteristic.

RESULTS

Binary classification models were developed for six labels (violence presence, perpetrator, victim, domestic, physical and sexual). Among annotations affirmed for the presence of any violence, 78% (1724) referred to physical violence, 61% (1350) referred to patients as perpetrator and 33% (731) to domestic violence. NLP models' precision ranged from 89% (perpetrator) to 98% (sexual); recall ranged from 89% (victim, perpetrator) to 97% (sexual).

CONCLUSIONS

State of the art NLP models can extract and classify clinical text on violence from EHRs at acceptable levels of scale, efficiency and accuracy.

摘要

目的

本文评估了自然语言处理(NLP)模型在从大型精神保健服务提供商的电子健康记录(EHR)中提取临床文本中涉及人际暴力信息的应用。

设计

一个多学科团队迭代制定了用于注释涉及暴力的临床文本的指南。使用关键词生成了一个数据集,该数据集经过注释(即,分类为肯定、否定或无关),用于:暴力的存在、患者状态(即,作为暴力的实施者、目击者和/或受害者)和暴力类型(家庭、身体和/或性)。使用经过预训练的转换器模型 BioBERT(用于生物医学文本挖掘的双向编码器表示转换器)的 NLP 方法在注释数据集上进行了微调,并使用 10 折交叉验证进行了评估。

设置

我们使用了包含超过 500 000 名来自伦敦南部和莫兹利国民保健信托基金会患者的去识别 EHR 的临床记录交互搜索(CRIS)数据库,这是一家专门的精神保健服务提供商,服务于一个城市集水区。

参与者

根据 17 个预定义的关键词进行了 CRIS 搜索。从每个关键词的结果中随机选择文本片段,从 2832 名患者的记录中获得了 3771 个文本片段。

结果

为六个标签(暴力存在、实施者、受害者、家庭、身体和性)开发了二进制分类模型。在肯定存在任何暴力的注释中,78%(1724)涉及身体暴力,61%(1350)涉及患者作为实施者,33%(731)涉及家庭暴力。NLP 模型的精度范围从 89%(实施者)到 98%(性);召回率范围从 89%(受害者、实施者)到 97%(性)。

结论

最先进的 NLP 模型可以从 EHR 中以可接受的规模、效率和准确性提取和分类关于暴力的临床文本。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4a9/8852656/5186c7f9c019/bmjopen-2021-052911f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4a9/8852656/99763fd8e842/bmjopen-2021-052911f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4a9/8852656/5186c7f9c019/bmjopen-2021-052911f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4a9/8852656/99763fd8e842/bmjopen-2021-052911f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4a9/8852656/5186c7f9c019/bmjopen-2021-052911f02.jpg

相似文献

1
Can natural language processing models extract and classify instances of interpersonal violence in mental healthcare electronic records: an applied evaluative study.自然语言处理模型能否从精神保健电子记录中提取和分类人际暴力实例:一项应用评估研究。
BMJ Open. 2022 Feb 16;12(2):e052911. doi: 10.1136/bmjopen-2021-052911.
2
Enhancing suicidal behavior detection in EHRs: A multi-label NLP framework with transformer models and semantic retrieval-based annotation.增强电子健康记录中的自杀行为检测:一种基于变压器模型和语义检索注释的多标签自然语言处理框架。
J Biomed Inform. 2025 Jan;161:104755. doi: 10.1016/j.jbi.2024.104755. Epub 2024 Dec 2.
3
Text mining occupations from the mental health electronic health record: a natural language processing approach using records from the Clinical Record Interactive Search (CRIS) platform in south London, UK.从精神健康电子健康记录中挖掘文本职业信息:使用英国伦敦南部临床记录交互检索(CRIS)平台记录的自然语言处理方法。
BMJ Open. 2021 Mar 25;11(3):e042274. doi: 10.1136/bmjopen-2020-042274.
4
A Natural Language Processing Model for COVID-19 Detection Based on Dutch General Practice Electronic Health Records by Using Bidirectional Encoder Representations From Transformers: Development and Validation Study.基于荷兰全科电子健康记录的 COVID-19 检测自然语言处理模型:使用转换器的双向编码器表示进行开发和验证研究。
J Med Internet Res. 2023 Oct 4;25:e49944. doi: 10.2196/49944.
5
Natural language processing (NLP) to facilitate abstract review in medical research: the application of BioBERT to exploring the 20-year use of NLP in medical research.自然语言处理(NLP)在医学研究中的应用:BioBERT 在探索 20 年来 NLP 在医学研究中的应用。
Syst Rev. 2024 Apr 15;13(1):107. doi: 10.1186/s13643-024-02470-y.
6
Investigating online activity in UK adolescent mental health patients: a feasibility study using a natural language processing approach for electronic health records.利用自然语言处理方法研究英国青少年心理健康患者的在线活动:一项电子健康记录的可行性研究。
BMJ Open. 2023 May 25;13(5):e061640. doi: 10.1136/bmjopen-2022-061640.
7
Natural language processing to extract symptoms of severe mental illness from clinical text: the Clinical Record Interactive Search Comprehensive Data Extraction (CRIS-CODE) project.利用自然语言处理技术从临床文本中提取严重精神疾病症状:临床记录交互式搜索综合数据提取(CRIS-CODE)项目。
BMJ Open. 2017 Jan 17;7(1):e012012. doi: 10.1136/bmjopen-2016-012012.
8
Development of a Corpus Annotated With Mentions of Pain in Mental Health Records: Natural Language Processing Approach.心理健康记录中提及疼痛的语料库开发:自然语言处理方法
JMIR Form Res. 2023 Jun 26;7:e45849. doi: 10.2196/45849.
9
Few-Shot Learning for Clinical Natural Language Processing Using Siamese Neural Networks: Algorithm Development and Validation Study.使用暹罗神经网络的临床自然语言处理少样本学习:算法开发与验证研究
JMIR AI. 2023 May 4;2:e44293. doi: 10.2196/44293.
10
A large language model-based generative natural language processing framework fine-tuned on clinical notes accurately extracts headache frequency from electronic health records.基于大型语言模型的生成式自然语言处理框架,在临床笔记上进行了微调,能够从电子健康记录中准确提取头痛频率。
Headache. 2024 Apr;64(4):400-409. doi: 10.1111/head.14702. Epub 2024 Mar 25.

引用本文的文献

1
Supervised Natural Language Processing Classification of Violent Death Narratives: Development and Assessment of a Compact Large Language Model.暴力死亡叙事的监督式自然语言处理分类:一个紧凑大语言模型的开发与评估
JMIR AI. 2025 Jun 19;4:e68212. doi: 10.2196/68212.
2
Artificial Intelligence in Psychiatry: A Review of Biological and Behavioral Data Analyses.精神病学中的人工智能:生物和行为数据分析综述
Diagnostics (Basel). 2025 Feb 11;15(4):434. doi: 10.3390/diagnostics15040434.
3
Using Artificial Intelligence to Detect Risk of Family Violence: Protocol for a Systematic Review and Meta-Analysis.

本文引用的文献

1
A natural language processing approach for identifying temporal disease onset information from mental healthcare text.一种从精神保健文本中识别疾病发病时间信息的自然语言处理方法。
Sci Rep. 2021 Jan 12;11(1):757. doi: 10.1038/s41598-020-80457-0.
2
The association between neighbourhood characteristics and physical victimisation in men and women with mental disorders.患有精神障碍的男性和女性的邻里特征与身体伤害之间的关联。
BJPsych Open. 2020 Jul 16;6(4):e73. doi: 10.1192/bjo.2020.52.
3
Efficient Reuse of Natural Language Processing Models for Phenotype-Mention Identification in Free-text Electronic Medical Records: A Phenotype Embedding Approach.
利用人工智能检测家庭暴力风险:系统评价与荟萃分析方案
JMIR Res Protoc. 2024 Dec 2;13:e54966. doi: 10.2196/54966.
4
Applying neural network algorithms to ascertain reported experiences of violence in routine mental healthcare records and distributions of reports by diagnosis.应用神经网络算法来确定常规精神卫生保健记录中所报告的暴力经历以及按诊断分类的报告分布情况。
Front Psychiatry. 2024 Sep 10;15:1181739. doi: 10.3389/fpsyt.2024.1181739. eCollection 2024.
5
A Systematic Literature Review of the Use of Computational Text Analysis Methods in Intimate Partner Violence Research.关于计算文本分析方法在亲密伴侣暴力研究中应用的系统文献综述
J Fam Violence. 2023 Mar 21:1-20. doi: 10.1007/s10896-023-00517-7.
6
Characterizing the Differences in Descriptions of Violence on Reddit During the COVID-19 Pandemic.描述 COVID-19 大流行期间 Reddit 上暴力行为的差异。
J Interpers Violence. 2023 Aug;38(15-16):9290-9314. doi: 10.1177/08862605231163885. Epub 2023 Mar 28.
7
Adverse outcomes associated with recorded victimization in mental health electronic records during the first UK COVID-19 lockdown.与英国首次 COVID-19 封锁期间心理健康电子记录中记录的受害相关的不良后果。
Soc Psychiatry Psychiatr Epidemiol. 2023 Mar;58(3):431-440. doi: 10.1007/s00127-022-02393-w. Epub 2022 Nov 24.
自然语言处理模型在自由文本电子病历中进行表型提及识别的高效复用:一种表型嵌入方法。
JMIR Med Inform. 2019 Dec 17;7(4):e14782. doi: 10.2196/14782.
4
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
5
Enhancing clinical concept extraction with contextual embeddings.利用上下文嵌入增强临床概念提取。
J Am Med Inform Assoc. 2019 Nov 1;26(11):1297-1304. doi: 10.1093/jamia/ocz096.
6
Psychiatric symptoms and risk of victimisation: a population-based study from Southeast London.精神症状与受害风险:来自伦敦东南部的一项基于人群的研究。
Epidemiol Psychiatr Sci. 2019 Apr;28(2):168-178. doi: 10.1017/S2045796018000537. Epub 2018 Sep 10.
7
CLAMP - a toolkit for efficiently building customized clinical natural language processing pipelines.CLAMP - 一个用于高效构建定制化临床自然语言处理管道的工具包。
J Am Med Inform Assoc. 2018 Mar 1;25(3):331-336. doi: 10.1093/jamia/ocx132.
8
Clinical information extraction applications: A literature review.临床信息提取应用:文献综述。
J Biomed Inform. 2018 Jan;77:34-49. doi: 10.1016/j.jbi.2017.11.011. Epub 2017 Nov 21.
9
Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review.用于捕获和标准化非结构化临床信息的自然语言处理系统:一项系统综述。
J Biomed Inform. 2017 Sep;73:14-29. doi: 10.1016/j.jbi.2017.07.012. Epub 2017 Jul 17.
10
Effect of implementation of a smoke-free policy on physical violence in a psychiatric inpatient setting: an interrupted time series analysis.无烟政策的实施对精神科住院环境中身体暴力的影响:一项中断时间序列分析。
Lancet Psychiatry. 2017 Jul;4(7):540-546. doi: 10.1016/S2215-0366(17)30209-2. Epub 2017 Jun 14.