• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

语音病理学对自动说话人验证的影响:一项大规模研究。

The effect of speech pathology on automatic speaker verification: a large-scale study.

机构信息

Pattern Recognition Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91058, Erlangen, Germany.

Speech & Language Processing Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054, Erlangen, Germany.

出版信息

Sci Rep. 2023 Nov 22;13(1):20476. doi: 10.1038/s41598-023-47711-7.

DOI:10.1038/s41598-023-47711-7
PMID:37993490
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10665418/
Abstract

Navigating the challenges of data-driven speech processing, one of the primary hurdles is accessing reliable pathological speech data. While public datasets appear to offer solutions, they come with inherent risks of potential unintended exposure of patient health information via re-identification attacks. Using a comprehensive real-world pathological speech corpus, with over n[Formula: see text]3800 test subjects spanning various age groups and speech disorders, we employed a deep-learning-driven automatic speaker verification (ASV) approach. This resulted in a notable mean equal error rate (EER) of [Formula: see text], outstripping traditional benchmarks. Our comprehensive assessments demonstrate that pathological speech overall faces heightened privacy breach risks compared to healthy speech. Specifically, adults with dysphonia are at heightened re-identification risks, whereas conditions like dysarthria yield results comparable to those of healthy speakers. Crucially, speech intelligibility does not influence the ASV system's performance metrics. In pediatric cases, particularly those with cleft lip and palate, the recording environment plays a decisive role in re-identification. Merging data across pathological types led to a marked EER decrease, suggesting the potential benefits of pathological diversity in ASV, accompanied by a logarithmic boost in ASV effectiveness. In essence, this research sheds light on the dynamics between pathological speech and speaker verification, emphasizing its crucial role in safeguarding patient confidentiality in our increasingly digitized healthcare era.

摘要

在处理数据驱动的语音处理的挑战中,一个主要的障碍是获取可靠的病理性语音数据。虽然公共数据集似乎提供了解决方案,但它们存在通过重新识别攻击潜在地暴露患者健康信息的固有风险。我们使用了一个全面的真实世界病理性语音语料库,其中包含超过 n[公式:见文本]3800 名来自不同年龄组和语音障碍的测试对象,采用了深度学习驱动的自动说话人验证 (ASV) 方法。这导致了一个显著的平均等错误率 (EER) [公式:见文本],超过了传统基准。我们的全面评估表明,与健康语音相比,病理性语音总体上面临更高的隐私泄露风险。具体来说,患有发音障碍的成年人面临更高的重新识别风险,而像口吃这样的情况则产生与健康说话者相当的结果。至关重要的是,语音可懂度不会影响 ASV 系统的性能指标。在儿科病例中,特别是唇腭裂患者,录音环境在重新识别中起着决定性的作用。合并病理性语音数据类型会导致 EER 明显下降,这表明病理性语音多样性在 ASV 中的潜在益处,同时对数级提高 ASV 有效性。本质上,这项研究揭示了病理性语音与说话人验证之间的动态关系,强调了在我们日益数字化的医疗保健时代保护患者隐私的重要作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/02f01e2243b6/41598_2023_47711_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/067ca0f13400/41598_2023_47711_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/5f672d0ae9a6/41598_2023_47711_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/1a62b50cfd02/41598_2023_47711_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/f00c8b5ccf21/41598_2023_47711_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/e476d440e328/41598_2023_47711_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/caa6da922a43/41598_2023_47711_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/503e31fcb015/41598_2023_47711_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/02f01e2243b6/41598_2023_47711_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/067ca0f13400/41598_2023_47711_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/5f672d0ae9a6/41598_2023_47711_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/1a62b50cfd02/41598_2023_47711_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/f00c8b5ccf21/41598_2023_47711_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/e476d440e328/41598_2023_47711_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/caa6da922a43/41598_2023_47711_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/503e31fcb015/41598_2023_47711_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32df/10665418/02f01e2243b6/41598_2023_47711_Figb_HTML.jpg

相似文献

1
The effect of speech pathology on automatic speaker verification: a large-scale study.语音病理学对自动说话人验证的影响:一项大规模研究。
Sci Rep. 2023 Nov 22;13(1):20476. doi: 10.1038/s41598-023-47711-7.
2
Effect of the Visual Presentation of a Craniofacial Syndrome on Speech Intelligibility in Noise.颅面综合征的视觉呈现对噪声环境中言语可懂度的影响。
Cleft Palate Craniofac J. 2019 Sep;56(8):1038-1043. doi: 10.1177/1055665618825403. Epub 2019 Jan 22.
3
Toward Realigning Automatic Speaker Verification in the Era of COVID-19.面向新冠疫情时代的自动说话人验证技术的再调整。
Sensors (Basel). 2022 Mar 30;22(7):2638. doi: 10.3390/s22072638.
4
Evaluation of speech intelligibility for children with cleft lip and palate by means of automatic speech recognition.通过自动语音识别评估唇腭裂儿童的言语清晰度
Int J Pediatr Otorhinolaryngol. 2006 Oct;70(10):1741-7. doi: 10.1016/j.ijporl.2006.05.016. Epub 2006 Jun 30.
5
Assessing intelligibility in speakers with cleft palate: a critical review of the literature.评估腭裂患者的言语清晰度:文献综述
Cleft Palate Craniofac J. 2002 Jan;39(1):50-8. doi: 10.1597/1545-1569_2002_039_0050_aiiswc_2.0.co_2.
6
Addressing challenges in speaker anonymization to maintain utility while ensuring privacy of pathological speech.应对说话者匿名化中的挑战,在确保病理性语音隐私的同时保持其可用性。
Commun Med (Lond). 2024 Sep 25;4(1):182. doi: 10.1038/s43856-024-00609-5.
7
Untrained listeners' ratings of speech disorders in a group with cleft palate: a comparison with speech and language pathologists' ratings.未经训练的听众对腭裂患者语音障碍的评估:与言语语言病理学家评估的比较。
Int J Lang Commun Disord. 2009 Sep-Oct;44(5):656-74. doi: 10.1080/13682820802295203.
8
Parent questionnaire for screening early language development in children with cleft palate.用于筛查腭裂患儿早期语言发育的家长问卷。
Cleft Palate Craniofac J. 1995 Jan;32(1):7-13. doi: 10.1597/1545-1569_1995_032_0007_pqfsel_2.3.co_2.
9
The Impact of Real-Time Articulatory Information on Phonetic Transcription: Ultrasound-Aided Transcription in Cleft Lip and Palate Speech.实时发音信息对语音转录的影响:腭裂语音的超声辅助转录。
Folia Phoniatr Logop. 2020;72(2):120-130. doi: 10.1159/000499753. Epub 2019 May 24.
10
Reliability and validity of a computer-mediated, single-word intelligibility test: preliminary findings for children with repaired cleft lip and palate.计算机辅助单字清晰度测试的信度和效度:唇腭裂修复术后儿童的初步研究结果
Cleft Palate Craniofac J. 2011 Sep;48(5):538-49. doi: 10.1597/09-166. Epub 2010 Sep 8.

引用本文的文献

1
Addressing challenges in speaker anonymization to maintain utility while ensuring privacy of pathological speech.应对说话者匿名化中的挑战,在确保病理性语音隐私的同时保持其可用性。
Commun Med (Lond). 2024 Sep 25;4(1):182. doi: 10.1038/s43856-024-00609-5.
2
Reidentification of Participants in Shared Clinical Data Sets: Experimental Study.共享临床数据集参与者的重新识别:实验研究
JMIR AI. 2024 Mar 15;3:e52054. doi: 10.2196/52054.

本文引用的文献

1
Speech-based markers for posttraumatic stress disorder in US veterans.基于语音的美国退伍军人创伤后应激障碍标志物。
Depress Anxiety. 2019 Jul;36(7):607-616. doi: 10.1002/da.22890. Epub 2019 Apr 22.
2
Multimodal Assessment of Parkinson's Disease: A Deep Learning Approach.帕金森病的多模态评估:深度学习方法。
IEEE J Biomed Health Inform. 2019 Jul;23(4):1618-1630. doi: 10.1109/JBHI.2018.2866873. Epub 2018 Aug 23.
3
Biomarker definitions and their applications.生物标志物的定义及其应用。
Exp Biol Med (Maywood). 2018 Feb;243(3):213-221. doi: 10.1177/1535370217750088.
4
What are biomarkers?生物标志物是什么?
Curr Opin HIV AIDS. 2010 Nov;5(6):463-6. doi: 10.1097/COH.0b013e32833ed177.
5
Automatic speech recognition (ASR) and its use as a tool for assessment or therapy of voice, speech, and language disorders.自动语音识别(ASR)及其作为评估或治疗嗓音、言语和语言障碍的工具的应用。
Logoped Phoniatr Vocol. 2009;34(2):91-6. doi: 10.1080/14015430802657216.
6
The current understanding of cleft lip malformations.唇裂畸形的当前认识。
Facial Plast Surg. 2002 Aug;18(3):147-53. doi: 10.1055/s-2002-33061.
7
Different cleft conditions, facial appearance, and speech: relationship to psychological variables.不同的腭裂情况、面部外观与言语:与心理变量的关系。
Cleft Palate Craniofac J. 2001 Jan;38(1):68-75. doi: 10.1597/1545-1569_2001_038_0068_dccfaa_2.0.co_2.
8
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.
9
Characteristics of cleft palate speech.腭裂语音的特征。
Eur J Disord Commun. 1996;31(4):331-57. doi: 10.3109/13682829609031326.