• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

临床语音人工智能的负责任开发:弥合临床研究与技术之间的差距。

Responsible development of clinical speech AI: Bridging the gap between clinical research and technology.

作者信息

Berisha Visar, Liss Julie M

机构信息

School of Electrical Computer and Energy Engineering and College of Health Solutions, Arizona State University, Tempe, AZ, USA.

College of Health Solutions, Arizona State University, Tempe, AZ, USA.

出版信息

NPJ Digit Med. 2024 Aug 9;7(1):208. doi: 10.1038/s41746-024-01199-1.

DOI:10.1038/s41746-024-01199-1
PMID:39122889
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11316053/
Abstract

This perspective article explores the challenges and potential of using speech as a biomarker in clinical settings, particularly when constrained by the small clinical datasets typically available in such contexts. We contend that by integrating insights from speech science and clinical research, we can reduce sample complexity in clinical speech AI models with the potential to decrease timelines to translation. Most existing models are based on high-dimensional feature representations trained with limited sample sizes and often do not leverage insights from speech science and clinical research. This approach can lead to overfitting, where the models perform exceptionally well on training data but fail to generalize to new, unseen data. Additionally, without incorporating theoretical knowledge, these models may lack interpretability and robustness, making them challenging to troubleshoot or improve post-deployment. We propose a framework for organizing health conditions based on their impact on speech and promote the use of speech analytics in diverse clinical contexts beyond cross-sectional classification. For high-stakes clinical use cases, we advocate for a focus on explainable and individually-validated measures and stress the importance of rigorous validation frameworks and ethical considerations for responsible deployment. Bridging the gap between AI research and clinical speech research presents new opportunities for more efficient translation of speech-based AI tools and advancement of scientific discoveries in this interdisciplinary space, particularly if limited to small or retrospective datasets.

摘要

这篇观点文章探讨了在临床环境中使用语音作为生物标志物的挑战和潜力,特别是当受到此类环境中通常可用的小型临床数据集的限制时。我们认为,通过整合语音科学和临床研究的见解,我们可以降低临床语音人工智能模型中的样本复杂性,有可能缩短转化时间。大多数现有模型基于用有限样本量训练的高维特征表示,并且通常没有利用语音科学和临床研究的见解。这种方法可能导致过拟合,即模型在训练数据上表现出色,但无法推广到新的、未见过的数据。此外,如果不纳入理论知识,这些模型可能缺乏可解释性和稳健性,使其在部署后难以进行故障排除或改进。我们提出了一个基于健康状况对语音的影响来组织健康状况的框架,并促进在横断面分类之外的各种临床环境中使用语音分析。对于高风险临床用例,我们主张关注可解释且经过个体验证的措施,并强调严格验证框架和负责任部署的伦理考量的重要性。弥合人工智能研究与临床语音研究之间的差距,为更高效地转化基于语音的人工智能工具以及在这个跨学科领域推进科学发现带来了新机会,特别是在限于小型或回顾性数据集的情况下。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b681/11316053/a5dd6e0c4333/41746_2024_1199_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b681/11316053/9c071dde913e/41746_2024_1199_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b681/11316053/5fd014b5aced/41746_2024_1199_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b681/11316053/6a2e1ed89aa0/41746_2024_1199_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b681/11316053/a5dd6e0c4333/41746_2024_1199_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b681/11316053/9c071dde913e/41746_2024_1199_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b681/11316053/5fd014b5aced/41746_2024_1199_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b681/11316053/6a2e1ed89aa0/41746_2024_1199_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b681/11316053/a5dd6e0c4333/41746_2024_1199_Fig4_HTML.jpg

相似文献

1
Responsible development of clinical speech AI: Bridging the gap between clinical research and technology.临床语音人工智能的负责任开发:弥合临床研究与技术之间的差距。
NPJ Digit Med. 2024 Aug 9;7(1):208. doi: 10.1038/s41746-024-01199-1.
2
Operationalizing Clinical Speech Analytics: Moving From Features to Measures for Real-World Clinical Impact.临床语音分析的实施:从特征到措施,实现真实临床影响。
J Speech Lang Hear Res. 2024 Nov 7;67(11):4226-4232. doi: 10.1044/2024_JSLHR-24-00039. Epub 2024 Jun 5.
3
"They Can't Believe They're a Tiger": Insights from pediatric speech-language pathologist mobile app users and app designers.“他们不敢相信自己是只老虎”:来自儿科言语病理学家移动应用程序用户和应用程序设计师的见解。
Int J Lang Commun Disord. 2023 Sep-Oct;58(5):1717-1737. doi: 10.1111/1460-6984.12898. Epub 2023 May 23.
4
The influence of the COVID-19 pandemic on the adoption and impact of AI ChatGPT: Challenges, applications, and ethical considerations.新冠疫情对人工智能 ChatGPT 的采用和影响:挑战、应用和伦理考量。
Acta Psychol (Amst). 2024 Jun;246:104264. doi: 10.1016/j.actpsy.2024.104264. Epub 2024 Apr 15.
5
Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.在新合成数据集上训练的集成机器学习模型,对于使用可穿戴设备进行压力预测具有良好的泛化能力。
J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.
6
Considerations for paediatric student-led telepractice in speech-language therapy: A pilot observational study from South Africa.考虑在言语治疗中让儿科学生主导远程实践:来自南非的一项试点观察研究。
Int J Lang Commun Disord. 2024 Mar-Apr;59(2):665-677. doi: 10.1111/1460-6984.12958. Epub 2023 Sep 29.
7
Artificial Intelligence and Machine Learning in Pharmacological Research: Bridging the Gap Between Data and Drug Discovery.药理学研究中的人工智能与机器学习:弥合数据与药物发现之间的差距
Cureus. 2023 Aug 30;15(8):e44359. doi: 10.7759/cureus.44359. eCollection 2023 Aug.
8
Generative AI in healthcare: an implementation science informed translational path on application, integration and governance.生成式人工智能在医疗保健领域的应用、整合和治理:基于实施科学的转化途径。
Implement Sci. 2024 Mar 15;19(1):27. doi: 10.1186/s13012-024-01357-9.
9
Bridging the simulation-to-real gap for AI-based needle and target detection in robot-assisted ultrasound-guided interventions.弥合基于人工智能的针和目标检测在机器人辅助超声引导介入中的模拟与现实之间的差距。
Eur Radiol Exp. 2023 Jun 19;7(1):30. doi: 10.1186/s41747-023-00344-x.
10
Cognitive decline assessment using semantic linguistic content and transformer deep learning architecture.使用语义语言内容和变压器深度学习架构评估认知能力下降。
Int J Lang Commun Disord. 2024 May-Jun;59(3):1110-1127. doi: 10.1111/1460-6984.12973. Epub 2023 Nov 16.

引用本文的文献

1
Bridging AI innovation and healthcare: scalable clinical validation methods for voice biomarkers.连接人工智能创新与医疗保健:语音生物标志物的可扩展临床验证方法
Front Digit Health. 2025 Jul 3;7:1575753. doi: 10.3389/fdgth.2025.1575753. eCollection 2025.
2
Remote Monitoring of Amyotrophic Lateral Sclerosis Using Digital Health Technologies: Shifting Toward Digitalized Care and Research?利用数字健康技术对肌萎缩侧索硬化症进行远程监测:是否正在向数字化护理和研究转变?
Neurology. 2025 Jul 8;105(1):e213738. doi: 10.1212/WNL.0000000000213738. Epub 2025 Jun 3.
3
Leveraging large language models for automated detection of velopharyngeal dysfunction in patients with cleft palate.

本文引用的文献

1
Operationalizing Clinical Speech Analytics: Moving From Features to Measures for Real-World Clinical Impact.临床语音分析的实施:从特征到措施,实现真实临床影响。
J Speech Lang Hear Res. 2024 Nov 7;67(11):4226-4232. doi: 10.1044/2024_JSLHR-24-00039. Epub 2024 Jun 5.
2
Illusory generalizability of clinical prediction models.临床预测模型的虚幻泛化性。
Science. 2024 Jan 12;383(6679):164-167. doi: 10.1126/science.adg8538. Epub 2024 Jan 11.
3
Text Dialogue Analysis for Primary Screening of Mild Cognitive Impairment: Development and Validation Study.
利用大语言模型自动检测腭裂患者的腭咽功能障碍。
Front Digit Health. 2025 Mar 28;7:1552746. doi: 10.3389/fdgth.2025.1552746. eCollection 2025.
4
Clinical Decision Support Using Speech Signal Analysis: Systematic Scoping Review of Neurological Disorders.使用语音信号分析的临床决策支持:神经系统疾病的系统综述
J Med Internet Res. 2025 Jan 13;27:e63004. doi: 10.2196/63004.
文本对话分析用于轻度认知障碍的初步筛查:开发与验证研究。
J Med Internet Res. 2023 Dec 29;25:e51501. doi: 10.2196/51501.
4
Reliability and validity of a widely-available AI tool for assessment of stress based on speech.基于语音的广受欢迎的 AI 压力评估工具的可靠性和有效性。
Sci Rep. 2023 Nov 18;13(1):20224. doi: 10.1038/s41598-023-47153-1.
5
Using "Real-World Data" to Study Cleft Lip/Palate Care: An Exploration of Speech Outcomes from a Multi-Center US Learning Health Network.利用“真实世界数据”研究唇腭裂护理:对美国一个多中心学习健康网络的语音结果的探索
Cleft Palate Craniofac J. 2025 Mar;62(3):445-459. doi: 10.1177/10556656231207469. Epub 2023 Oct 16.
6
Leakage and the reproducibility crisis in machine-learning-based science.基于机器学习的科学中的漏洞与可重复性危机。
Patterns (N Y). 2023 Aug 4;4(9):100804. doi: 10.1016/j.patter.2023.100804. eCollection 2023 Sep 8.
7
ADscreen: A speech processing-based screening system for automatic identification of patients with Alzheimer's disease and related dementia.ADscreen:一种基于语音处理的筛查系统,用于自动识别阿尔茨海默病和相关痴呆患者。
Artif Intell Med. 2023 Sep;143:102624. doi: 10.1016/j.artmed.2023.102624. Epub 2023 Jul 17.
8
AI-based assessments of speech and language impairments in dementia.基于人工智能的痴呆症言语和语言障碍评估
Alzheimers Dement. 2023 Oct;19(10):4675-4687. doi: 10.1002/alz.13395. Epub 2023 Aug 14.
9
Speech and language markers of neurodegeneration: a call for global equity.神经退行性病变的言语和语言标志物:呼吁全球公平。
Brain. 2023 Dec 1;146(12):4870-4879. doi: 10.1093/brain/awad253.
10
Misdiagnosis of Vocal Fold Nodules.声带小结的误诊
J Voice. 2023 Jul 7. doi: 10.1016/j.jvoice.2023.06.008.