DAVID：用于实时转换口语中隐段情绪线索的开源平台。

DAVID: An open-source platform for real-time transformation of infra-segmental emotional cues in running speech.

机构信息

Science & Technology of Music and Sound (STMS), UMR 9912 (CNRS/IRCAM/UPMC), 1 place Stravinsky, 75004, Paris, France.

Inserm U 1127, CNRS UMR 7225, Sorbonne Universités UPMC Univ Paris 06 UMR S 1127, Institut du Cerveau et de la Moelle épinière (ICM), Social and Affective Neuroscience (SAN) Laboratory, Paris, France.

出版信息

Behav Res Methods. 2018 Feb;50(1):323-343. doi: 10.3758/s13428-017-0873-y.

DOI:10.3758/s13428-017-0873-y

PMID:28374144

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5809549/

Abstract

We present an open-source software platform that transforms emotional cues expressed by speech signals using audio effects like pitch shifting, inflection, vibrato, and filtering. The emotional transformations can be applied to any audio file, but can also run in real time, using live input from a microphone, with less than 20-ms latency. We anticipate that this tool will be useful for the study of emotions in psychology and neuroscience, because it enables a high level of control over the acoustical and emotional content of experimental stimuli in a variety of laboratory situations, including real-time social situations. We present here results of a series of validation experiments aiming to position the tool against several methodological requirements: that transformed emotions be recognized at above-chance levels, valid in several languages (French, English, Swedish, and Japanese) and with a naturalness comparable to natural speech.

摘要

我们提出了一个开源软件平台，该平台使用音高移动、语调、颤音和滤波等音频效果来转换语音信号所表达的情感提示。这些情感转换可以应用于任何音频文件，也可以使用麦克风的实时输入实时运行，延迟小于 20 毫秒。我们预计，该工具将对心理学和神经科学领域的情感研究非常有用，因为它能够在各种实验室情境中对实验刺激的声学和情感内容进行高度控制，包括实时社交情境。我们在此呈现一系列验证实验的结果，旨在根据以下方法学要求对该工具进行定位：转换后的情感识别率高于随机水平，在多种语言（法语、英语、瑞典语和日语）中有效，且自然度可与自然语音相媲美。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62f3/5809549/ee8092fe6101/13428_2017_873_Fig1_HTML.jpg

相似文献

DAVID: An open-source platform for real-time transformation of infra-segmental emotional cues in running speech.

Behav Res Methods. 2018 Feb;50(1):323-343. doi: 10.3758/s13428-017-0873-y.

Detecting emotion in speech expressing incongruent emotional cues through voice and content: investigation on dominant modality and language.

Cogn Emot. 2022 May;36(3):492-511. doi: 10.1080/02699931.2021.2021144. Epub 2022 Jan 3.

Effects of cue modality and emotional category on recognition of nonverbal emotional signals in schizophrenia.

BMC Psychiatry. 2016 Jul 7;16:218. doi: 10.1186/s12888-016-0913-7.

Effects of Emotional Intelligence on the Impression of Irony Created by the Mismatch between Verbal and Nonverbal Cues.

PLoS One. 2016 Oct 7;11(10):e0163211. doi: 10.1371/journal.pone.0163211. eCollection 2016.

When voices get emotional: a corpus of nonverbal vocalizations for research on emotion processing.

Behav Res Methods. 2013 Dec;45(4):1234-45. doi: 10.3758/s13428-013-0324-3.

Gender differences in identifying emotions from auditory and visual stimuli.

Logoped Phoniatr Vocol. 2017 Dec;42(4):160-166. doi: 10.1080/14015439.2016.1243725. Epub 2016 Nov 21.

Cerebral integration of verbal and nonverbal emotional cues: impact of individual nonverbal dominance.

Neuroimage. 2012 Jul 2;61(3):738-47. doi: 10.1016/j.neuroimage.2012.03.085. Epub 2012 Apr 6.

Crossmodal and incremental perception of audiovisual cues to emotional speech.

Lang Speech. 2010;53(Pt 1):3-30. doi: 10.1177/0023830909348993.

Nonverbal signals speak up: association between perceptual nonverbal dominance and emotional intelligence.

Cogn Emot. 2013;27(5):783-99. doi: 10.1080/02699931.2012.739999. Epub 2012 Nov 8.

Recognizing emotional speech in Persian: a validated database of Persian emotional speech (Persian ESD).

Behav Res Methods. 2015 Mar;47(1):275-94. doi: 10.3758/s13428-014-0467-x.

引用本文的文献

Aligning the smiles of dating dyads causally increases attraction.

Proc Natl Acad Sci U S A. 2024 Nov 5;121(45):e2400369121. doi: 10.1073/pnas.2400369121. Epub 2024 Oct 28.

Prosodic discrimination skills mediate the association between musical aptitude and vocal emotion recognition ability.

Sci Rep. 2024 Jul 16;14(1):16462. doi: 10.1038/s41598-024-66889-y.

The Emerging Science of Interacting Minds.

Perspect Psychol Sci. 2024 Mar;19(2):355-373. doi: 10.1177/17456916231200177. Epub 2023 Dec 14.

Algorithmic voice transformations reveal the phonological basis of language-familiarity effects in cross-cultural emotion judgments.

PLoS One. 2023 May 3;18(5):e0285028. doi: 10.1371/journal.pone.0285028. eCollection 2023.

The honest sound of physical effort.

PeerJ. 2023 Apr 3;11:e14944. doi: 10.7717/peerj.14944. eCollection 2023.

Pupil dilation reflects the dynamic integration of audiovisual emotional speech.

Sci Rep. 2023 Apr 4;13(1):5507. doi: 10.1038/s41598-023-32133-2.

The shallow of your smile: the ethics of expressive vocal deep-fakes.

Philos Trans R Soc Lond B Biol Sci. 2022 Jan 3;377(1841):20210083. doi: 10.1098/rstb.2021.0083. Epub 2021 Nov 15.

Even violins can cry: specifically vocal emotional behaviours also drive the perception of emotions in non-vocal music.

Philos Trans R Soc Lond B Biol Sci. 2021 Dec 20;376(1840):20200396. doi: 10.1098/rstb.2020.0396. Epub 2021 Nov 1.

Neural representations of own-voice in the human auditory cortex.

Sci Rep. 2021 Jan 12;11(1):591. doi: 10.1038/s41598-020-80095-6.

Good vibrations: A review of vocal expressions of positive emotions.

Psychon Bull Rev. 2020 Apr;27(2):237-265. doi: 10.3758/s13423-019-01701-x.

本文引用的文献

Biomusic: An Auditory Interface for Detecting Physiological Indicators of Anxiety in Children.

Front Neurosci. 2016 Aug 30;10:401. doi: 10.3389/fnins.2016.00401. eCollection 2016.

Covert digital manipulation of vocal emotion alter speakers' emotional states in a congruent direction.

Proc Natl Acad Sci U S A. 2016 Jan 26;113(4):948-53. doi: 10.1073/pnas.1506552113. Epub 2016 Jan 11.

Human emotions track changes in the acoustic environment.

Proc Natl Acad Sci U S A. 2015 Nov 24;112(47):14563-8. doi: 10.1073/pnas.1515087112. Epub 2015 Nov 9.

Experiencing Physical Pain Leads to More Sympathetic Moral Judgments.

PLoS One. 2015 Oct 14;10(10):e0140580. doi: 10.1371/journal.pone.0140580. eCollection 2015.

Perceptions of Competence, Strength, and Age Influence Voters to Select Leaders with Lower-Pitched Voices.

PLoS One. 2015 Aug 7;10(8):e0133779. doi: 10.1371/journal.pone.0133779. eCollection 2015.

Effect of Acting Experience on Emotion Expression and Recognition in Voice: Non-Actors Provide Better Stimuli than Expected.

J Nonverbal Behav. 2015;39(3):195-214. doi: 10.1007/s10919-015-0209-5.

Embodied memory: unconscious smiling modulates emotional evaluation of episodic memories.

Front Psychol. 2015 May 26;6:650. doi: 10.3389/fpsyg.2015.00650. eCollection 2015.

17 Ways to Say Yes: Toward Nuanced Tone of Voice in AAC and Speech Technology.

Augment Altern Commun. 2015 Jun;31(2):170-80. doi: 10.3109/07434618.2015.1037930. Epub 2015 May 12.

The effects of emotional expression on vibrato.

J Voice. 2015 Mar;29(2):170-81. doi: 10.1016/j.jvoice.2014.06.007. Epub 2014 Dec 9.

Towards personalized speech synthesis for augmentative and alternative communication.

Augment Altern Commun. 2014 Sep;30(3):226-36. doi: 10.3109/07434618.2014.924026. Epub 2014 Jul 15.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

DAVID：用于实时转换口语中隐段情绪线索的开源平台。

DAVID: An open-source platform for real-time transformation of infra-segmental emotional cues in running speech.

机构信息

Science & Technology of Music and Sound (STMS), UMR 9912 (CNRS/IRCAM/UPMC), 1 place Stravinsky, 75004, Paris, France.

出版信息

Behav Res Methods. 2018 Feb;50(1):323-343. doi: 10.3758/s13428-017-0873-y.

DOI:10.3758/s13428-017-0873-y

PMID:28374144

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5809549/

Abstract

摘要

DAVID：用于实时转换口语中隐段情绪线索的开源平台。

DAVID: An open-source platform for real-time transformation of infra-segmental emotional cues in running speech.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

DAVID：用于实时转换口语中隐段情绪线索的开源平台。

DAVID: An open-source platform for real-time transformation of infra-segmental emotional cues in running speech.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献