• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

患者在说话还是护士在说话?患者-护士录音中的自动说话人类型识别。

Is the patient speaking or the nurse? Automatic speaker type identification in patient-nurse audio recordings.

机构信息

School of Nursing, Columbia University, New York, New York, USA.

Center for Home Care Policy & Research, VNS Health, New York, New York, USA.

出版信息

J Am Med Inform Assoc. 2023 Sep 25;30(10):1673-1683. doi: 10.1093/jamia/ocad139.

DOI:10.1093/jamia/ocad139
PMID:37478477
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10531109/
Abstract

OBJECTIVES

Patient-clinician communication provides valuable explicit and implicit information that may indicate adverse medical conditions and outcomes. However, practical and analytical approaches for audio-recording and analyzing this data stream remain underexplored. This study aimed to 1) analyze patients' and nurses' speech in audio-recorded verbal communication, and 2) develop machine learning (ML) classifiers to effectively differentiate between patient and nurse language.

MATERIALS AND METHODS

Pilot studies were conducted at VNS Health, the largest not-for-profit home healthcare agency in the United States, to optimize audio-recording patient-nurse interactions. We recorded and transcribed 46 interactions, resulting in 3494 "utterances" that were annotated to identify the speaker. We employed natural language processing techniques to generate linguistic features and built various ML classifiers to distinguish between patient and nurse language at both individual and encounter levels.

RESULTS

A support vector machine classifier trained on selected linguistic features from term frequency-inverse document frequency, Linguistic Inquiry and Word Count, Word2Vec, and Medical Concepts in the Unified Medical Language System achieved the highest performance with an AUC-ROC = 99.01 ± 1.97 and an F1-score = 96.82 ± 4.1. The analysis revealed patients' tendency to use informal language and keywords related to "religion," "home," and "money," while nurses utilized more complex sentences focusing on health-related matters and medical issues and were more likely to ask questions.

CONCLUSION

The methods and analytical approach we developed to differentiate patient and nurse language is an important precursor for downstream tasks that aim to analyze patient speech to identify patients at risk of disease and negative health outcomes.

摘要

目的

医患沟通提供了有价值的显性和隐性信息,这些信息可能表明存在不良的医疗状况和结果。然而,用于音频记录和分析该数据流的实用和分析方法仍未得到充分探索。本研究旨在:1)分析音频记录的口头交流中患者和护士的言语;2)开发机器学习(ML)分类器,以有效区分患者和护士的语言。

材料和方法

在 VNS Health(美国最大的非营利性家庭医疗保健机构)进行了试点研究,以优化音频记录患者-护士互动。我们记录并转录了 46 次互动,产生了 3494 个“话语”,这些话语被标注以识别说话者。我们采用自然语言处理技术生成语言特征,并构建了各种 ML 分类器,以在个体和交互层面上区分患者和护士的语言。

结果

在基于词频-逆文档频率、语言探究和词汇计数、Word2Vec 和统一医学语言系统中的医学概念选择的语言特征上训练的支持向量机分类器表现最佳,AUC-ROC=99.01±1.97,F1 分数=96.82±4.1。分析显示患者倾向于使用非正式语言和与“宗教”、“家庭”和“金钱”相关的关键词,而护士则使用更复杂的句子,侧重于与健康相关的问题和医疗问题,并且更有可能提问。

结论

我们开发的区分患者和护士语言的方法和分析方法是分析患者言语以识别有疾病和不良健康结果风险的患者的下游任务的重要前提。

相似文献

1
Is the patient speaking or the nurse? Automatic speaker type identification in patient-nurse audio recordings.患者在说话还是护士在说话?患者-护士录音中的自动说话人类型识别。
J Am Med Inform Assoc. 2023 Sep 25;30(10):1673-1683. doi: 10.1093/jamia/ocad139.
2
Utilizing patient-nurse verbal communication in building risk identification models: the missing critical data stream in home healthcare.利用医患口头交流来构建风险识别模型:家庭医疗中缺失的关键数据流。
J Am Med Inform Assoc. 2024 Jan 18;31(2):435-444. doi: 10.1093/jamia/ocad195.
3
Does synthetic data augmentation improve the performances of machine learning classifiers for identifying health problems in patient-nurse verbal communications in home healthcare settings?在家庭医疗环境中,合成数据增强能否提高机器学习分类器在患者-护士言语交流中识别健康问题的性能?
J Nurs Scholarsh. 2025 Jan;57(1):47-58. doi: 10.1111/jnu.13004. Epub 2024 Jul 3.
4
Audio Recording Patient-Nurse Verbal Communications in Home Health Care Settings: Pilot Feasibility and Usability Study.家庭医疗环境中患者-护士言语交流的音频记录:初步可行性和可用性研究
JMIR Hum Factors. 2022 May 11;9(2):e35325. doi: 10.2196/35325.
5
Medical subdomain classification of clinical notes using a machine learning-based natural language processing approach.基于机器学习的自然语言处理方法对临床笔记进行医学子域分类。
BMC Med Inform Decis Mak. 2017 Dec 1;17(1):155. doi: 10.1186/s12911-017-0556-8.
6
Performance of machine learning algorithms for dementia assessment: impacts of language tasks, recording media, and modalities.机器学习算法在痴呆评估中的性能:语言任务、记录媒体和模态的影响。
BMC Med Inform Decis Mak. 2023 Mar 3;23(1):45. doi: 10.1186/s12911-023-02122-6.
7
Linguistic dissection of nursing handoffs: Implications for patient safety in varied-acuity hospital settings.护理交接班的语言剖析:对不同病情严重程度医院环境中患者安全的影响。
J Clin Nurs. 2024 Aug;33(8):3077-3088. doi: 10.1111/jocn.17190. Epub 2024 Apr 25.
8
Detecting Recovery Problems Just in Time: Application of Automated Linguistic Analysis and Supervised Machine Learning to an Online Substance Abuse Forum.及时发现康复问题:自动语言分析和监督式机器学习在在线药物滥用论坛中的应用
J Med Internet Res. 2018 Jun 12;20(6):e10136. doi: 10.2196/10136.
9
ADscreen: A speech processing-based screening system for automatic identification of patients with Alzheimer's disease and related dementia.ADscreen:一种基于语音处理的筛查系统,用于自动识别阿尔茨海默病和相关痴呆患者。
Artif Intell Med. 2023 Sep;143:102624. doi: 10.1016/j.artmed.2023.102624. Epub 2023 Jul 17.
10
Do nurses document all discussions of patient problems and nursing interventions in the electronic health record? A pilot study in home healthcare.护士是否会在电子健康记录中记录所有关于患者问题和护理干预措施的讨论?一项家庭医疗保健的试点研究。
JAMIA Open. 2022 May 26;5(2):ooac034. doi: 10.1093/jamiaopen/ooac034. eCollection 2022 Jul.

引用本文的文献

1
Beyond electronic health record data: leveraging natural language processing and machine learning to uncover cognitive insights from patient-nurse verbal communications.超越电子健康记录数据:利用自然语言处理和机器学习从患者与护士的言语交流中挖掘认知见解。
J Am Med Inform Assoc. 2025 Feb 1;32(2):328-340. doi: 10.1093/jamia/ocae300.
2
Decoding disparities: evaluating automatic speech recognition system performance in transcribing Black and White patient verbal communication with nurses in home healthcare.解码差异:评估自动语音识别系统在转录家庭医疗保健中黑人和白人患者与护士的言语交流方面的性能。
JAMIA Open. 2024 Dec 10;7(4):ooae130. doi: 10.1093/jamiaopen/ooae130. eCollection 2024 Dec.
3
Utilizing patient-nurse verbal communication in building risk identification models: the missing critical data stream in home healthcare.利用医患口头交流来构建风险识别模型:家庭医疗中缺失的关键数据流。
J Am Med Inform Assoc. 2024 Jan 18;31(2):435-444. doi: 10.1093/jamia/ocad195.

本文引用的文献

1
Is Auto-generated Transcript of Patient-Nurse Communication Ready to Use for Identifying the Risk for Hospitalizations or Emergency Department Visits in Home Health Care? A Natural Language Processing Pilot Study.患者-护士沟通的自动生成转录本是否可用于识别家庭医疗保健中的住院或急诊就诊风险?一项自然语言处理试点研究。
AMIA Annu Symp Proc. 2023 Apr 29;2022:992-1001. eCollection 2022.
2
Use of Voice Recordings in the Consultation of Patients Seeking Genital Gender-Affirming Surgery: An Opportunity for Broader Application Throughout Surgery?语音记录在寻求生殖器性别确认手术患者会诊中的应用:这是在整个外科手术中更广泛应用的契机吗?
J Surg Res (Houst). 2022;5(4):618-625. doi: 10.26502/jsr.10020269. Epub 2022 Dec 9.
3
The language of healthcare worker emotional exhaustion: A linguistic analysis of longitudinal survey.医护人员情绪耗竭的语言:一项纵向调查的语言分析
Front Psychiatry. 2022 Dec 16;13:1044378. doi: 10.3389/fpsyt.2022.1044378. eCollection 2022.
4
Data-driven identification of heart failure disease states and progression pathways using electronic health records.基于电子健康记录的数据驱动方法识别心力衰竭疾病状态和进展途径。
Sci Rep. 2022 Oct 25;12(1):17871. doi: 10.1038/s41598-022-22398-4.
5
Analyzing patient experiences using natural language processing: development and validation of the artificial intelligence patient reported experience measure (AI-PREM).使用自然语言处理分析患者体验:人工智能患者报告体验测量工具(AI-PREM)的开发和验证。
BMC Med Inform Decis Mak. 2022 Jul 15;22(1):183. doi: 10.1186/s12911-022-01923-5.
6
Coding linguistic elements in clinical interactions: a step-by-step guide for analyzing communication form.临床互动中的编码语言要素:分析交流形式的分步指南。
BMC Med Res Methodol. 2022 Jul 11;22(1):191. doi: 10.1186/s12874-022-01647-0.
7
Do nurses document all discussions of patient problems and nursing interventions in the electronic health record? A pilot study in home healthcare.护士是否会在电子健康记录中记录所有关于患者问题和护理干预措施的讨论?一项家庭医疗保健的试点研究。
JAMIA Open. 2022 May 26;5(2):ooac034. doi: 10.1093/jamiaopen/ooac034. eCollection 2022 Jul.
8
Audio Recording Patient-Nurse Verbal Communications in Home Health Care Settings: Pilot Feasibility and Usability Study.家庭医疗环境中患者-护士言语交流的音频记录:初步可行性和可用性研究
JMIR Hum Factors. 2022 May 11;9(2):e35325. doi: 10.2196/35325.
9
Predicting Working Memory in Healthy Older Adults Using Real-Life Language and Social Context Information: A Machine Learning Approach.使用现实生活中的语言和社会背景信息预测健康老年人的工作记忆:一种机器学习方法。
JMIR Aging. 2022 Mar 8;5(1):e28333. doi: 10.2196/28333.
10
Implementation of a patient-collected audio recording audit & feedback quality improvement program to prevent contextual error: stakeholder perspective.实施患者采集音频记录审核和反馈质量改进计划以预防语境错误:利益相关者视角。
BMC Health Serv Res. 2021 Aug 30;21(1):891. doi: 10.1186/s12913-021-06921-3.