• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

开发一种自然语言处理工具,以在电子医疗记录中识别围产期自伤。

Developing a Natural Language Processing tool to identify perinatal self-harm in electronic healthcare records.

机构信息

Section of Women's Mental Health, Health Service and Population Research Department, Institute of Psychiatry, Psychology and Neuroscience, Kings College London, London, United Kingdom.

South London and Maudsley NHS Foundation Trust, Bethlem Royal Hospital, Kent, London, United Kingdom.

出版信息

PLoS One. 2021 Aug 4;16(8):e0253809. doi: 10.1371/journal.pone.0253809. eCollection 2021.

DOI:10.1371/journal.pone.0253809
PMID:34347787
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8336818/
Abstract

BACKGROUND

Self-harm occurring within pregnancy and the postnatal year ("perinatal self-harm") is a clinically important yet under-researched topic. Current research likely under-estimates prevalence due to methodological limitations. Electronic healthcare records (EHRs) provide a source of clinically rich data on perinatal self-harm.

AIMS

(1) To create a Natural Language Processing (NLP) tool that can, with acceptable precision and recall, identify mentions of acts of perinatal self-harm within EHRs. (2) To use this tool to identify service-users who have self-harmed perinatally, based on their EHRs.

METHODS

We used the Clinical Record Interactive Search system to extract de-identified EHRs of secondary mental healthcare service-users at South London and Maudsley NHS Foundation Trust. We developed a tool that applied several layers of linguistic processing based on the spaCy NLP library for Python. We evaluated mention-level performance in the following domains: span, status, temporality and polarity. Evaluation was done against a manually coded reference standard. Mention-level performance was reported as precision, recall, F-score and Cohen's kappa for each domain. Performance was also assessed at 'service-user' level and explored whether a heuristic rule improved this. We report per-class statistics for service-user performance, as well as likelihood ratios and post-test probabilities.

RESULTS

Mention-level performance: micro-averaged F-score, precision and recall for span, polarity and temporality >0.8. Kappa for status 0.68, temporality 0.62, polarity 0.91. Service-user level performance with heuristic: F-score, precision, recall of minority class 0.69, macro-averaged F-score 0.81, positive LR 9.4 (4.8-19), post-test probability 69.0% (53-82%). Considering the task difficulty, the tool performs well, although temporality was the attribute with the lowest level of annotator agreement.

CONCLUSIONS

It is feasible to develop an NLP tool that identifies, with acceptable validity, mentions of perinatal self-harm within EHRs, although with limitations regarding temporality. Using a heuristic rule, it can also function at a service-user-level.

摘要

背景

妊娠和产后年内发生的自伤行为(“围产期自伤”)是一个具有重要临床意义但研究不足的课题。由于方法学的限制,当前的研究可能低估了其发生率。电子医疗记录(EHR)提供了围产期自伤的丰富临床数据来源。

目的

(1)创建一种自然语言处理(NLP)工具,该工具可以在可接受的精度和召回率的情况下,识别 EHR 中围产期自伤行为的提及。(2)基于 EHR,使用该工具来识别围产期自伤的服务使用者。

方法

我们使用 Clinical Record Interactive Search 系统提取了伦敦南部和莫兹利国民保健信托基金会二级精神保健服务使用者的去识别 EHR。我们开发了一种工具,该工具应用了基于 Python 的 spaCy NLP 库的多层语言处理。我们在以下领域评估了提及级别的性能:范围、状态、时间性和极性。评估是针对手动编码的参考标准进行的。在每个领域,提及级别性能均以精度、召回率、F 分数和 Cohen 的 Kappa 进行报告。还评估了“服务使用者”级别上的性能,并探讨了启发式规则是否可以改善这一点。我们报告了服务使用者性能的每类统计信息,以及似然比和后测概率。

结果

提及级别性能:范围、极性和时间性的微平均 F 分数、精度和召回率>0.8。状态的 Kappa 值为 0.68,时间性为 0.62,极性为 0.91。使用启发式规则的服务使用者级别性能:少数类别的 F 分数、精度和召回率为 0.69,宏平均 F 分数为 0.81,阳性似然比为 9.4(4.8-19),后测概率为 69.0%(53-82%)。考虑到任务的难度,该工具的性能良好,尽管时间性是注释者之间一致性最低的属性。

结论

可以开发一种 NLP 工具,该工具可以以可接受的有效性识别 EHR 中围产期自伤的提及,尽管在时间性方面存在局限性。使用启发式规则,它也可以在服务使用者级别上运行。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ead/8336818/f465888f6a38/pone.0253809.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ead/8336818/f465888f6a38/pone.0253809.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ead/8336818/f465888f6a38/pone.0253809.g001.jpg

相似文献

1
Developing a Natural Language Processing tool to identify perinatal self-harm in electronic healthcare records.开发一种自然语言处理工具,以在电子医疗记录中识别围产期自伤。
PLoS One. 2021 Aug 4;16(8):e0253809. doi: 10.1371/journal.pone.0253809. eCollection 2021.
2
Investigating online activity in UK adolescent mental health patients: a feasibility study using a natural language processing approach for electronic health records.利用自然语言处理方法研究英国青少年心理健康患者的在线活动:一项电子健康记录的可行性研究。
BMJ Open. 2023 May 25;13(5):e061640. doi: 10.1136/bmjopen-2022-061640.
3
Using natural language processing to extract self-harm and suicidality data from a clinical sample of patients with eating disorders: a retrospective cohort study.利用自然语言处理从进食障碍患者的临床样本中提取自伤和自杀倾向数据:一项回顾性队列研究。
BMJ Open. 2021 Dec 31;11(12):e053808. doi: 10.1136/bmjopen-2021-053808.
4
Can natural language processing models extract and classify instances of interpersonal violence in mental healthcare electronic records: an applied evaluative study.自然语言处理模型能否从精神保健电子记录中提取和分类人际暴力实例:一项应用评估研究。
BMJ Open. 2022 Feb 16;12(2):e052911. doi: 10.1136/bmjopen-2021-052911.
5
Identifying Suicidal Adolescents from Mental Health Records Using Natural Language Processing.利用自然语言处理技术从心理健康记录中识别有自杀倾向的青少年。
Stud Health Technol Inform. 2019 Aug 21;264:413-417. doi: 10.3233/SHTI190254.
6
Reviewing a Decade of Research Into Suicide and Related Behaviour Using the South London and Maudsley NHS Foundation Trust Clinical Record Interactive Search (CRIS) System.使用南伦敦和莫兹利国民保健服务基金会信托临床记录交互式搜索(CRIS)系统回顾十年自杀及相关行为研究。
Front Psychiatry. 2020 Nov 27;11:553463. doi: 10.3389/fpsyt.2020.553463. eCollection 2020.
7
Text mining occupations from the mental health electronic health record: a natural language processing approach using records from the Clinical Record Interactive Search (CRIS) platform in south London, UK.从精神健康电子健康记录中挖掘文本职业信息:使用英国伦敦南部临床记录交互检索(CRIS)平台记录的自然语言处理方法。
BMJ Open. 2021 Mar 25;11(3):e042274. doi: 10.1136/bmjopen-2020-042274.
8
Development of a Corpus Annotated With Mentions of Pain in Mental Health Records: Natural Language Processing Approach.心理健康记录中提及疼痛的语料库开发:自然语言处理方法
JMIR Form Res. 2023 Jun 26;7:e45849. doi: 10.2196/45849.
9
Development and assessment of a natural language processing model to identify residential instability in electronic health records' unstructured data: a comparison of 3 integrated healthcare delivery systems.开发和评估一种用于识别电子健康记录非结构化数据中居住不稳定情况的自然语言处理模型:对3个综合医疗服务系统的比较
JAMIA Open. 2022 Feb 16;5(1):ooac006. doi: 10.1093/jamiaopen/ooac006. eCollection 2022 Apr.
10
Identification of Adverse Drug Events from Free Text Electronic Patient Records and Information in a Large Mental Health Case Register.从大型心理健康病例登记册中的自由文本电子病历和信息中识别药物不良事件
PLoS One. 2015 Aug 14;10(8):e0134208. doi: 10.1371/journal.pone.0134208. eCollection 2015.

引用本文的文献

1
Navigating promise and perils: applying artificial intelligence to the perinatal mental health care cascade.应对希望与风险:将人工智能应用于围产期心理健康照护流程
Npj Health Syst. 2025;2(1):26. doi: 10.1038/s44401-025-00030-7. Epub 2025 Jul 23.
2
Artificial intelligence and natural language processing for improved telemedicine: Before, during and after remote consultation.用于改善远程医疗的人工智能与自然语言处理:远程会诊前、会诊期间及会诊后
Aten Primaria. 2025 Feb 15;57(8):103228. doi: 10.1016/j.aprim.2025.103228.
3
The Future of Artificial Intelligence in Mental Health Nursing Practice: An Integrative Review.

本文引用的文献

1
Extraction of Family History Information From Clinical Notes: Deep Learning and Heuristics Approach.从临床记录中提取家族病史信息:深度学习与启发式方法。
JMIR Med Inform. 2020 Dec 29;8(12):e22898. doi: 10.2196/22898.
2
Kappa and Beyond: Is There Agreement?卡帕值及其他:是否存在一致性?
Global Spine J. 2020 Jun;10(4):499-501. doi: 10.1177/2192568220911648. Epub 2020 Mar 3.
3
The Prevalence and Correlates of Self-Harm in the Perinatal Period: A Systematic Review.围产期自我伤害的流行率及相关因素:系统评价。
人工智能在精神科护理实践中的未来:一项综合综述。
Int J Ment Health Nurs. 2025 Feb;34(1):e70003. doi: 10.1111/inm.70003.
4
Scalable incident detection via natural language processing and probabilistic language models.通过自然语言处理和概率语言模型进行可扩展的事件检测。
Sci Rep. 2024 Oct 8;14(1):23429. doi: 10.1038/s41598-024-72756-7.
5
Classifying early infant feeding status from clinical notes using natural language processing and machine learning.使用自然语言处理和机器学习对临床记录进行早期婴儿喂养状态分类。
Sci Rep. 2024 Apr 3;14(1):7831. doi: 10.1038/s41598-024-58299-x.
6
Application of Natural Language Processing (NLP) in Detecting and Preventing Suicide Ideation: A Systematic Review.自然语言处理(NLP)在检测和预防自杀意念中的应用:系统综述。
Int J Environ Res Public Health. 2023 Jan 13;20(2):1514. doi: 10.3390/ijerph20021514.
7
Using natural language processing to extract self-harm and suicidality data from a clinical sample of patients with eating disorders: a retrospective cohort study.利用自然语言处理从进食障碍患者的临床样本中提取自伤和自杀倾向数据:一项回顾性队列研究。
BMJ Open. 2021 Dec 31;11(12):e053808. doi: 10.1136/bmjopen-2021-053808.
J Clin Psychiatry. 2019 Dec 31;81(1):19r12773. doi: 10.4088/JCP.19r12773.
4
Detection of Surgical Site Infection Utilizing Automated Feature Generation in Clinical Notes.利用临床记录中的自动特征生成检测手术部位感染
J Healthc Inform Res. 2019 Sep;3(3):267-282. doi: 10.1007/s41666-018-0042-9. Epub 2018 Nov 6.
5
Why Cohen's Kappa should be avoided as performance measure in classification.为什么科恩氏 Kappa 不应该被用作分类的性能度量?
PLoS One. 2019 Sep 26;14(9):e0222916. doi: 10.1371/journal.pone.0222916. eCollection 2019.
6
Text Classification to Inform Suicide Risk Assessment in Electronic Health Records.用于电子健康记录中自杀风险评估的文本分类
Stud Health Technol Inform. 2019 Aug 21;264:40-44. doi: 10.3233/SHTI190179.
7
Risk Assessment Tools and Data-Driven Approaches for Predicting and Preventing Suicidal Behavior.用于预测和预防自杀行为的风险评估工具及数据驱动方法。
Front Psychiatry. 2019 Feb 13;10:36. doi: 10.3389/fpsyt.2019.00036. eCollection 2019.
8
Identification of suicidal behavior among psychiatrically hospitalized adolescents using natural language processing and machine learning of electronic health records.使用电子健康记录的自然语言处理和机器学习识别精神科住院青少年的自杀行为。
PLoS One. 2019 Feb 19;14(2):e0211116. doi: 10.1371/journal.pone.0211116. eCollection 2019.
9
Integrating shortest dependency path and sentence sequence into a deep learning framework for relation extraction in clinical text.将最短依赖路径和句子序列集成到深度学习框架中,用于临床文本中的关系抽取。
BMC Med Inform Decis Mak. 2019 Jan 31;19(Suppl 1):22. doi: 10.1186/s12911-019-0736-9.
10
Use of natural language processing in electronic medical records to identify pregnant women with suicidal behavior: towards a solution to the complex classification problem.利用自然语言处理技术在电子病历中识别有自杀行为的孕妇:解决复杂分类问题的一种方法。
Eur J Epidemiol. 2019 Feb;34(2):153-162. doi: 10.1007/s10654-018-0470-0. Epub 2018 Dec 10.