• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人工智能助力肌萎缩侧索硬化症患者的语音生成。

Artificial intelligence empowered voice generation for amyotrophic lateral sclerosis patients.

作者信息

Regondi Stefano, Donvito Giordana, Frontoni Emanuele, Kostovic Milutin, Minazzi Fabio, Bratières Sébastien, Filosto Massimiliano, Pugliese Raffaele

机构信息

NeMO Lab, ASST GOM Niguarda Cà Granda Hospital, Milan, Italy.

NEuroMuscular Omnicenter (NEMO), Fondazione Serena Onlus, Milan, Italy.

出版信息

Sci Rep. 2025 Jan 8;15(1):1361. doi: 10.1038/s41598-024-84728-y.

DOI:10.1038/s41598-024-84728-y
PMID:39779800
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11711320/
Abstract

Amyotrophic Lateral Sclerosis (ALS) is a neurodegenerative disease that can result in a progressive loss of speech due to bulbar dysfunction, which can have significant negative impact on the patient's mental well-being. Alternative Augmentative Communication (AAC) strategies based on synthetic voices have been shown to assist patients in maintaining communication and improving their Quality of Life (QoL). However, such synthetic voices are often perceived as impersonal and fail to capture the unique voice and identity of the patient. To tackle this issue, combining voice banking (VB) and artificial intelligence (AI) has emerged as a more natural communication strategy, enabling individuals to preserve their voice for use with AAC devices as needed. This involves recording speech samples to generate a synthetic voice closely resembling the individual's own. Despite the increasing interest in VB, there's a lack of clear strategies for its effective implementation in rapidly progressing diseases like ALS. Additionally, the perceptual quality of VB on patients with preserved speech, especially when offered early in the disease, remains poorly understood. In light of these challenges, this study aims to assess the effectiveness and the perceptual impact of AI-generated voices on ALS patients with preserved speech, utilizing a personalized voice synthesis system based on machine learning. The AI-generated patient-specific voice is achieved through voice recording, followed by fine-tuning using a Generative Adversarial Network for Efficient and High Fidelity Speech Synthesis (HiFi-GAN), resulting in a model capable of producing speech highly similar to the patient's own voice, with exceptional expressive and audio quality. By addressing these aspects, this study intends to offer valuable insights into the potential benefits and challenges of combining VB with AI voices to enhance communication support for ALS patients.

摘要

肌萎缩侧索硬化症(ALS)是一种神经退行性疾病,由于延髓功能障碍可导致渐进性言语丧失,这会对患者的心理健康产生重大负面影响。基于合成语音的替代性辅助沟通(AAC)策略已被证明有助于患者保持沟通并提高生活质量(QoL)。然而,这种合成语音通常被认为缺乏人情味,无法捕捉患者独特的声音和个性。为了解决这个问题,将语音库(VB)与人工智能(AI)相结合已成为一种更自然的沟通策略,使个人能够根据需要保存自己的声音以供AAC设备使用。这包括录制语音样本以生成与个人自己的语音非常相似的合成语音。尽管对VB的兴趣日益增加,但在ALS等快速进展的疾病中,缺乏明确的有效实施策略。此外,对于仍有言语能力的患者,尤其是在疾病早期提供VB时,其感知质量仍知之甚少。鉴于这些挑战,本研究旨在利用基于机器学习的个性化语音合成系统,评估人工智能生成的语音对仍有言语能力的ALS患者的有效性和感知影响。通过语音录制,然后使用用于高效和高保真语音合成的生成对抗网络(HiFi-GAN)进行微调,从而生成特定于患者的人工智能语音,得到一个能够产生与患者自己的语音高度相似、具有出色表现力和音频质量的模型。通过解决这些方面的问题,本研究旨在为将VB与人工智能语音相结合以增强对ALS患者的沟通支持的潜在益处和挑战提供有价值的见解。

相似文献

1
Artificial intelligence empowered voice generation for amyotrophic lateral sclerosis patients.人工智能助力肌萎缩侧索硬化症患者的语音生成。
Sci Rep. 2025 Jan 8;15(1):1361. doi: 10.1038/s41598-024-84728-y.
2
A large-scale comparison of two voice synthesis techniques on intelligibility, naturalness, preferences, and attitudes toward voices banked by individuals with amyotrophic lateral sclerosis.两种语音合成技术在可懂度、自然度、偏好以及对由肌萎缩侧索硬化症患者存储的语音库的态度方面的大规模比较。
Augment Altern Commun. 2024 Mar;40(1):31-45. doi: 10.1080/07434618.2023.2262032. Epub 2023 Oct 4.
3
Voice banking for people living with motor neurone disease: Views and expectations.运动神经元病患者的语音库:观点与期望
Int J Lang Commun Disord. 2021 Jan;56(1):116-129. doi: 10.1111/1460-6984.12588. Epub 2020 Dec 22.
4
Voice Conversion for Persons with Amyotrophic Lateral Sclerosis.肌萎缩侧索硬化症患者的语音转换。
IEEE J Biomed Health Inform. 2020 Oct;24(10):2942-2949. doi: 10.1109/JBHI.2019.2961844. Epub 2019 Dec 25.
5
Augmentative and alternative communication improves quality of life in the early stages of amyotrophic lateral sclerosis.辅助和替代沟通改善肌萎缩侧索硬化症早期阶段的生活质量。
Funct Neurol. 2019 Jan-Mar;34(1):35-43.
6
A recent survey of augmentative and alternative communication use and service delivery experiences of people with amyotrophic lateral sclerosis in the United States.美国肌萎缩侧索硬化症患者使用辅助和替代性沟通方式及相关服务的最新调查。
Disabil Rehabil Assist Technol. 2024 May;19(4):1121-1134. doi: 10.1080/17483107.2022.2149866. Epub 2022 Nov 30.
7
Do you like my voice? Stakeholder perspectives about the acceptability of synthetic child voices in three South African languages.你喜欢我的声音吗?利益相关者对三种南非语言中合成儿童声音可接受性的看法。
Int J Lang Commun Disord. 2025 Jan-Feb;60(1):e13152. doi: 10.1111/1460-6984.13152.
8
Improving care for amyotrophic lateral sclerosis with artificial intelligence and affective computing.利用人工智能与情感计算改善肌萎缩侧索硬化症的护理
J Neurol Sci. 2025 Jan 15;468:123328. doi: 10.1016/j.jns.2024.123328. Epub 2024 Nov 25.
9
The BCH message banking process™, voice banking, and double-dipping™.BCH 信息银行处理™、语音银行和双重取款™。
Augment Altern Commun. 2021 Dec;37(4):241-250. doi: 10.1080/07434618.2021.2021554. Epub 2022 Jan 8.
10
Speech deterioration in amyotrophic lateral sclerosis (ALS) after manifestation of bulbar symptoms.肌萎缩侧索硬化症(ALS)延髓症状出现后言语功能恶化。
Int J Lang Commun Disord. 2018 Mar;53(2):385-392. doi: 10.1111/1460-6984.12357. Epub 2017 Nov 21.

引用本文的文献

1
Management of Dysarthria in Amyotrophic Lateral Sclerosis.肌萎缩侧索硬化症构音障碍的管理
Cells. 2025 Jul 9;14(14):1048. doi: 10.3390/cells14141048.

本文引用的文献

1
What do people affected by amyotrophic lateral sclerosis want from health communications? Evidence from the ALS Talk Project.肌萎缩侧索硬化症患者对健康传播有何需求?来自 ALS 对话项目的证据。
Muscle Nerve. 2023 Sep;68(3):286-295. doi: 10.1002/mus.27935. Epub 2023 Jul 18.
2
A real-time voice cloning system with multiple algorithms for speech quality improvement.一种具有多种算法的实时语音克隆系统,可改善语音质量。
PLoS One. 2023 Apr 3;18(4):e0283440. doi: 10.1371/journal.pone.0283440. eCollection 2023.
3
Acoustic Measures of Dysphonia in Amyotrophic Lateral Sclerosis.
肌萎缩侧索硬化症的嗓音声学测量。
J Speech Lang Hear Res. 2023 Mar 7;66(3):872-887. doi: 10.1044/2022_JSLHR-22-00363. Epub 2023 Feb 20.
4
DIA-TTS: Deep-Inherited Attention-Based Text-to-Speech Synthesizer.DIA-TTS:基于深度继承注意力的文本到语音合成器。
Entropy (Basel). 2022 Dec 26;25(1):41. doi: 10.3390/e25010041.
5
A recent survey of augmentative and alternative communication use and service delivery experiences of people with amyotrophic lateral sclerosis in the United States.美国肌萎缩侧索硬化症患者使用辅助和替代性沟通方式及相关服务的最新调查。
Disabil Rehabil Assist Technol. 2024 May;19(4):1121-1134. doi: 10.1080/17483107.2022.2149866. Epub 2022 Nov 30.
6
Amyotrophic lateral sclerosis.肌萎缩性侧索硬化症。
Lancet. 2022 Oct 15;400(10360):1363-1380. doi: 10.1016/S0140-6736(22)01272-7. Epub 2022 Sep 15.
7
A qualitative evidence synthesis of the experiences and perspectives of communicating using augmentative and alternative communication (AAC).使用增强和替代沟通(AAC)进行交流的体验和观点的定性证据综合。
Disabil Rehabil Assist Technol. 2024 Jul;19(5):1802-1816. doi: 10.1080/17483107.2022.2105961. Epub 2022 Aug 26.
8
Voice banking for people living with motor neurone disease: Views and expectations.运动神经元病患者的语音库:观点与期望
Int J Lang Commun Disord. 2021 Jan;56(1):116-129. doi: 10.1111/1460-6984.12588. Epub 2020 Dec 22.
9
ALS: Management Problems.肌萎缩侧索硬化症:管理问题
Neurol Clin. 2020 Aug;38(3):565-575. doi: 10.1016/j.ncl.2020.03.013. Epub 2020 Jun 11.
10
Augmentative and Alternative Communication (AAC) Advances: A Review of Configurations for Individuals with a Speech Disability.辅助沟通技术(AAC)的进展:对言语残疾者配置的综述。
Sensors (Basel). 2019 Apr 22;19(8):1911. doi: 10.3390/s19081911.