• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

“自身声音”的听觉特征。

Auditory traits of "own voice".

机构信息

Department of Life Sciences, The University of Tokyo, Tokyo, Japan.

出版信息

PLoS One. 2018 Jun 26;13(6):e0199443. doi: 10.1371/journal.pone.0199443. eCollection 2018.

DOI:10.1371/journal.pone.0199443
PMID:29944698
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6019673/
Abstract

People perceive their recorded voice differently from their actively spoken voice. The uncanny valley theory proposes that as an object approaches humanlike characteristics, there is an increase in the sense of familiarity; however, eventually a point is reached where the object becomes strangely similar and makes us feel uneasy. The feeling of discomfort experienced when people hear their recorded voice may correspond to the floor of the proposed uncanny valley. To overcome the feeling of eeriness of own-voice recordings, previous studies have suggested equalization of the recorded voice with various types of filters, such as step, bandpass, and low-pass, yet the effectiveness of these filters has not been evaluated. To address this, the aim of experiment 1 was to identify what type of voice recording was the most representative of one's own voice. The voice recordings were presented in five different conditions: unadjusted recorded voice, step filtered voice, bandpass filtered voice, low-pass filtered voice, and a voice for which the participants freely adjusted the parameters. We found large individual differences in the most representative own-voice filter. In order to consider roles of sense of agency, experiment 2 investigated if lip-synching would influence the rating of own voice. The result suggested lip-synching did not affect own voice ratings. In experiment 3, based on the assumption that the voices used in previous experiments corresponded to continuous representations of non-own voice to own voice, the existence of an uncanny valley was examined. Familiarity, eeriness, and the sense of own voice were rated. The result did not support the existence of an uncanny valley. Taken together, the experiments led us to the following conclusions: there is no general filter that can represent own voice for everyone, sense of agency has no effect on own voice rating, and the uncanny valley does not exist for own voice, specifically.

摘要

人们对自己录制的声音和主动说出的声音有不同的感知。“恐怖谷理论”提出,随着一个物体越来越接近人类的特征,熟悉感会增加;然而,最终会达到一个点,这个物体变得非常相似,让我们感到不安。人们听到自己录制的声音时感到的不适可能与恐怖谷的谷底相对应。为了克服对自己录音的怪异感,之前的研究提出了用各种类型的滤波器(如阶跃、带通和低通滤波器)对录音进行均衡,然而这些滤波器的效果尚未得到评估。为了解决这个问题,实验 1 的目的是确定哪种录音最能代表自己的声音。录音以五种不同的条件呈现:未经调整的录音、阶跃滤波的录音、带通滤波的录音、低通滤波的录音以及参与者自由调整参数的录音。我们发现,最能代表自己声音的录音滤波器存在很大的个体差异。为了考虑主体感的作用,实验 2 调查了口型同步是否会影响对自己声音的评价。结果表明,口型同步不会影响对自己声音的评价。在实验 3 中,基于之前的实验中使用的声音对应于非自己声音到自己声音的连续表示的假设,检验了恐怖谷的存在。对熟悉度、怪异感和自我声音的感知进行了评价。结果不支持恐怖谷的存在。总的来说,这些实验得出了以下结论:对于每个人来说,没有一个通用的滤波器可以代表自己的声音,主体感对自己声音的评价没有影响,而且特定的恐怖谷并不存在于自己的声音中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/c99afad7d70f/pone.0199443.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/11a6be3ffdcd/pone.0199443.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/5c1c39b362c9/pone.0199443.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/0e403c2c3efd/pone.0199443.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/a9aaadf804ad/pone.0199443.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/babf478c2966/pone.0199443.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/eddd544e92f6/pone.0199443.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/d785599dd658/pone.0199443.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/c99afad7d70f/pone.0199443.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/11a6be3ffdcd/pone.0199443.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/5c1c39b362c9/pone.0199443.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/0e403c2c3efd/pone.0199443.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/a9aaadf804ad/pone.0199443.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/babf478c2966/pone.0199443.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/eddd544e92f6/pone.0199443.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/d785599dd658/pone.0199443.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aae1/6019673/c99afad7d70f/pone.0199443.g008.jpg

相似文献

1
Auditory traits of "own voice".“自身声音”的听觉特征。
PLoS One. 2018 Jun 26;13(6):e0199443. doi: 10.1371/journal.pone.0199443. eCollection 2018.
2
The Human Takes It All: Humanlike Synthesized Voices Are Perceived as Less Eerie and More Likable. Evidence From a Subjective Ratings Study.人类主导一切:类人合成语音被认为不那么怪异且更讨人喜欢。来自一项主观评分研究的证据。
Front Neurorobot. 2020 Dec 16;14:593732. doi: 10.3389/fnbot.2020.593732. eCollection 2020.
3
Neural representations of own-voice in the human auditory cortex.人类听觉皮层中自身声音的神经表示。
Sci Rep. 2021 Jan 12;11(1):591. doi: 10.1038/s41598-020-80095-6.
4
I like my voice better: self-enhancement bias in perceptions of voice attractiveness.我更喜欢自己的声音:对声音吸引力感知中的自我提升偏差。
Perception. 2013;42(9):941-9. doi: 10.1068/p7526.
5
A review of empirical evidence on different uncanny valley hypotheses: support for perceptual mismatch as one road to the valley of eeriness.关于不同恐怖谷假说的实证证据综述:支持感知不匹配是通往怪异之谷的一条途径。
Front Psychol. 2015 Apr 10;6:390. doi: 10.3389/fpsyg.2015.00390. eCollection 2015.
6
I Hear My Voice; Therefore I Spoke: The Sense of Agency Over Speech Is Enhanced by Hearing One's Own Voice.我听见我的声音;因此我发声:对言语的自主感通过听见自己的声音而增强。
Psychol Sci. 2022 Aug;33(8):1226-1239. doi: 10.1177/09567976211068880. Epub 2022 Jul 5.
7
The perception of humanness from the movements of synthetic agents.从合成智能体的动作中感知人性。
Perception. 2011;40(6):695-704. doi: 10.1068/p6900.
8
Feeling robots and human zombies: mind perception and the uncanny valley.感受机器人和人类僵尸:心智知觉与恐怖谷
Cognition. 2012 Oct;125(1):125-30. doi: 10.1016/j.cognition.2012.06.007. Epub 2012 Jul 9.
9
A reappraisal of the uncanny valley: categorical perception or frequency-based sensitization?对诡异谷现象的再评价:范畴性感知还是基于频率的敏感化?
Front Psychol. 2015 Jan 21;5:1488. doi: 10.3389/fpsyg.2014.01488. eCollection 2014.
10
A mismatch in the human realism of face and voice produces an uncanny valley.面部与声音在人类真实感上的不匹配会产生恐怖谷效应。
Iperception. 2011;2(1):10-2. doi: 10.1068/i0415. Epub 2011 Mar 1.

引用本文的文献

1
The fundamental frequencies of our own voice.我们自己声音的基频。
R Soc Open Sci. 2025 Feb 19;12(2):241081. doi: 10.1098/rsos.241081. eCollection 2025 Feb.
2
Listen to yourself! Prioritization of self-associated and own voice cues.倾听你自己!自我关联和自身语音线索的优先级。
Br J Psychol. 2025 Feb;116(1):131-148. doi: 10.1111/bjop.12741. Epub 2024 Oct 3.
3
Neural Effects of One's Own Voice on Self-Talk for Emotion Regulation.自身声音对用于情绪调节的自我对话的神经影响。

本文引用的文献

1
Voice-only communication enhances empathic accuracy.仅语音交流就能提高共情准确性。
Am Psychol. 2017 Oct;72(7):644-654. doi: 10.1037/amp0000147.
2
DAVID: An open-source platform for real-time transformation of infra-segmental emotional cues in running speech.DAVID:用于实时转换口语中隐段情绪线索的开源平台。
Behav Res Methods. 2018 Feb;50(1):323-343. doi: 10.3758/s13428-017-0873-y.
3
Covert digital manipulation of vocal emotion alter speakers' emotional states in a congruent direction.对声音情感进行隐蔽的数字操纵会使说话者的情绪状态朝着一致的方向改变。
Brain Sci. 2024 Jun 26;14(7):637. doi: 10.3390/brainsci14070637.
4
Bone conduction facilitates self-other voice discrimination.骨传导有助于区分自我与他人的声音。
R Soc Open Sci. 2023 Feb 15;10(2):221561. doi: 10.1098/rsos.221561. eCollection 2023 Feb.
5
Expectancy changes the self-monitoring of voice identity.期望改变了自我监测的声音特征。
Eur J Neurosci. 2021 Apr;53(8):2681-2695. doi: 10.1111/ejn.15162. Epub 2021 Mar 26.
6
Neural representations of own-voice in the human auditory cortex.人类听觉皮层中自身声音的神经表示。
Sci Rep. 2021 Jan 12;11(1):591. doi: 10.1038/s41598-020-80095-6.
Proc Natl Acad Sci U S A. 2016 Jan 26;113(4):948-53. doi: 10.1073/pnas.1506552113. Epub 2016 Jan 11.
4
Cartilage conduction hearing.软骨传导听力
J Acoust Soc Am. 2014 Apr;135(4):1959-66. doi: 10.1121/1.4868372.
5
The timbre of the voice as perceived by the singer him-/herself.歌手自身所感知到的嗓音音色。
Logoped Phoniatr Vocol. 2014 Apr;39(1):1-10. doi: 10.3109/14015439.2013.775334. Epub 2013 Mar 19.
6
The potential link between sense of agency and output monitoring over speech.言语产生中主体感和输出监控之间的潜在联系。
Conscious Cogn. 2013 Mar;22(1):360-74. doi: 10.1016/j.concog.2012.07.010. Epub 2012 Aug 19.
7
Human voice perception.人类语音感知。
Curr Biol. 2011 Feb 22;21(4):R143-5. doi: 10.1016/j.cub.2010.12.033.
8
A sawtooth waveform inspired pitch estimator for speech and music.一种用于语音和音乐的锯齿波激励基音估计器。
J Acoust Soc Am. 2008 Sep;124(3):1638-52. doi: 10.1121/1.2951592.
9
Estimating bone conduction transfer functions using otoacoustic emissions.利用耳声发射估计骨传导传递函数。
J Acoust Soc Am. 2003 Aug;114(2):907-18. doi: 10.1121/1.1582436.
10
Toward a better understanding of the perception of self-produced speech.为了更好地理解对自我产生言语的感知。
J Commun Disord. 2003 Jan-Feb;36(1):1-11. doi: 10.1016/s0021-9924(02)00132-6.