• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
ACOUSTICALLY-DRIVEN PHONEME REMOVAL THAT PRESERVES VOCAL AFFECT CUES.保留语音情感线索的声学驱动音素去除
Proc IEEE Int Conf Acoust Speech Signal Process. 2023 Jun;2023. doi: 10.1109/icassp49357.2023.10095942. Epub 2023 May 5.
2
Developmental changes in sensitivity to vocal paralanguage.对语音副语言敏感性的发育变化。
Dev Sci. 2000 May;3(2):148-162. doi: 10.1111/1467-7687.00108. Epub 2001 Dec 25.
3
Does the recording medium influence phonetic transcription of cleft palate speech?录音媒介会影响腭裂语音的语音转录吗?
Int J Lang Commun Disord. 2017 Jul;52(4):440-449. doi: 10.1111/1460-6984.12282. Epub 2016 Sep 13.
4
Talker identification across source mechanisms: experiments with laryngeal and electrolarynx speech.跨声源机制的说话者识别:喉音和电子喉语音实验
J Speech Lang Hear Res. 2014 Oct;57(5):1651-65. doi: 10.1044/2014_JSLHR-S-13-0161.
5
How the voice persuades.声音如何说服人。
J Pers Soc Psychol. 2020 Apr;118(4):661-682. doi: 10.1037/pspi0000193. Epub 2019 Jun 13.
6
Contributions of the glottal source and vocal tract cues to emotional vowel perception in the valence-arousal space.声门源和声道线索对效价唤醒空间中元音情感感知的贡献。
J Acoust Soc Am. 2018 Aug;144(2):908. doi: 10.1121/1.5051323.
7
Is there an ironic tone of voice?是否有一种讽刺的语气?
Lang Speech. 2005;48(Pt 3):257-77. doi: 10.1177/00238309050480030101.
8
Neural Correlates of Phonetic Learning in Postlingually Deafened Cochlear Implant Listeners.语后聋人工耳蜗植入者语音学习的神经关联
Ear Hear. 2016 Sep-Oct;37(5):514-28. doi: 10.1097/AUD.0000000000000287.
9
Now You Hear Me, Later You Don't: The Immediacy of Linguistic Computation and the Representation of Speech.现在你听到我,之后你听不到:语言计算的即时性和言语的表示。
Psychol Sci. 2021 Mar;32(3):410-423. doi: 10.1177/0956797620968787. Epub 2021 Feb 22.
10
Immediate effects of straw phonation in air or water on the laryngeal function and configuration of female speech-language pathology students visualised with strobovideolaryngoscopy: A randomised controlled trial.声门杓状软骨拨动术即时效应的研究:一项随机对照试验
Int J Lang Commun Disord. 2023 May;58(3):944-958. doi: 10.1111/1460-6984.12838. Epub 2023 Feb 1.

本文引用的文献

1
Validating a psychoacoustic model of voice quality.验证一种语音质量的心理声学模型。
J Acoust Soc Am. 2021 Jan;149(1):457. doi: 10.1121/10.0003331.
2
Editorial: Models and Theories of Speech Production.社论:言语产生的模型与理论
Front Psychol. 2020 Jun 19;11:1238. doi: 10.3389/fpsyg.2020.01238. eCollection 2020.
3
A Moan of Pleasure Should Be Breathy: The Effect of Voice Quality on the Meaning of Human Nonverbal Vocalizations.呻吟应该是有气声的:音质对人类非言语发声含义的影响。
Phonetica. 2020;77(5):327-349. doi: 10.1159/000504855. Epub 2020 Jan 21.
4
How the voice persuades.声音如何说服人。
J Pers Soc Psychol. 2020 Apr;118(4):661-682. doi: 10.1037/pspi0000193. Epub 2019 Jun 13.
5
Electroglottography - An Update.声带电图 - 最新进展。
J Voice. 2020 Jul;34(4):503-526. doi: 10.1016/j.jvoice.2018.12.014. Epub 2019 Mar 11.
6
Mechanics of human voice production and control.人类发声与控制的机制。
J Acoust Soc Am. 2016 Oct;140(4):2614. doi: 10.1121/1.4964509.
7
Influence on spectral energy distribution of emotional expression.情绪表达对光谱能量分布的影响。
J Voice. 2013 Jan;27(1):129.e1-129.e10. doi: 10.1016/j.jvoice.2012.08.008. Epub 2012 Nov 15.
8
Auditory processing in autism spectrum disorder: a review.自闭症谱系障碍中的听觉处理:综述。
Neurosci Biobehav Rev. 2012 Feb;36(2):836-54. doi: 10.1016/j.neubiorev.2011.11.008. Epub 2011 Dec 6.
9
A resource of validated affective and neutral sentences to assess identification of emotion in spoken language after a brain injury.一个经过验证的情感和中性句子资源,用于评估脑损伤后口语中情绪的识别。
Brain Inj. 2011;25(2):206-20. doi: 10.3109/02699052.2010.536197. Epub 2010 Nov 30.
10
Major depression is associated with impaired processing of emotion in music as well as in facial and vocal stimuli.重度抑郁症与情绪在音乐、面部和声音刺激方面的处理能力受损有关。
J Affect Disord. 2011 Feb;128(3):243-51. doi: 10.1016/j.jad.2010.06.039.

保留语音情感线索的声学驱动音素去除

ACOUSTICALLY-DRIVEN PHONEME REMOVAL THAT PRESERVES VOCAL AFFECT CUES.

作者信息

Noufi Camille, Berger Jonathan, Frank Michael, Parker Karen, Bowling Daniel L

机构信息

Stanford University, Center for Computer Research in Music and Acoustics, Stanford, CA, USA.

Stanford School of Medicine, Department of Psychiatry and Behavioral Sciences, Stanford, CA, USA.

出版信息

Proc IEEE Int Conf Acoust Speech Signal Process. 2023 Jun;2023. doi: 10.1109/icassp49357.2023.10095942. Epub 2023 May 5.

DOI:10.1109/icassp49357.2023.10095942
PMID:37701064
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10495117/
Abstract

In this paper, we propose a method for removing linguistic information from speech for the purpose of isolating paralinguistic indicators of affect. The immediate utility of this method lies in clinical tests of sensitivity to vocal affect that are not confounded by language, which is impaired in a variety of clinical populations. The method is based on simultaneous recordings of speech audio and electroglotto-graphic (EGG) signals. The speech audio signal is used to estimate the average vocal tract filter response and amplitude envelop. The EGG signal supplies a direct correlate of voice source activity that is mostly independent of phonetic articulation. These signals are used to create a third signal designed to capture as much paralinguistic information from the vocal production system as possible-maximizing the retention of bioacoustic cues to affect-while eliminating phonetic cues to verbal meaning. To evaluate the success of this method, we studied the perception of corresponding speech audio and transformed EGG signals in an affect rating experiment with online listeners. The results show a high degree of similarity in the perceived affect of matched signals, indicating that our method is effective.

摘要

在本文中,我们提出了一种从语音中去除语言信息的方法,目的是分离情感的副语言指标。该方法的直接效用在于对声音情感敏感性的临床测试,这些测试不会受到语言的干扰,而语言在各种临床人群中都存在受损情况。该方法基于语音音频和电声门图(EGG)信号的同步记录。语音音频信号用于估计平均声道滤波器响应和幅度包络。EGG信号提供了与声源活动直接相关的信息,该信息大多独立于语音发音。这些信号用于创建第三个信号,旨在从发声系统中尽可能多地捕获副语言信息——最大限度地保留影响情感的生物声学线索,同时消除语音意义的语音线索。为了评估该方法的成功性,我们在一项有在线听众参与的情感评级实验中研究了对相应语音音频和变换后的EGG信号的感知。结果表明,匹配信号在感知情感方面具有高度相似性,表明我们的方法是有效的。