鼻音的说话者依赖特征。

Amino Kanae, Arai Takayuki

Faculty of Science and Technology, Department of Electrical and Electronics Engineering, Sophia University, Tokyo, Japan.

Forensic Sci Int. 2009 Mar 10;185(1-3):21-8. doi: 10.1016/j.forsciint.2008.11.018. Epub 2009 Jan 21.

Investigation on human speaker identification enables us to know the indexical cues to speakers, and it may consequently lead to the effective acoustical parameters that can be used for forensic speaker recognition. It is known that speaker individuality interacts with the phonological or linguistic information contained in speech signals. As proof, the accuracy of perceptual speaker identification (PSI) performances depends on what types of sounds are presented to the listeners. In a series of our previous experiments, we have been investigating the effective sounds for PSI, and the stimuli containing a nasal were found to be the ones. In this present study, we conducted another PSI experiment in order to examine the reproducibility of the nasal effectiveness, and to see the effects of the following vowels. Coronal nasals were shown to be effective despite the different speaker set or the following vowels, and the stimuli containing a nasal were significantly better than those without it. In the second part of this paper, we introduce the results of the acoustical analysis of the stimuli. The contours of the energy transitions showed variations in shape among speakers for all three types of the analysis targets; nasals, stops, and fricatives, although the inter-speaker difference in the energy slopes for the consonant articulation was significant especially in nasal sounds. We also examined the effects of the sampling frequencies and the speech codecs, and found that the speaker-dependent shapes of these energy contours were maintained as long as the speech materials were uncompressed. The contours of the nasals appeared to be stable within a speaker, compared to other types of sounds.

对人类说话者识别的研究使我们能够了解说话者的索引线索，进而可能得出可用于法医说话者识别的有效声学参数。众所周知，说话者的个体特征与语音信号中包含的音系或语言信息相互作用。作为证据，感知说话者识别（PSI）性能的准确性取决于向听众呈现的声音类型。在我们之前的一系列实验中，我们一直在研究PSI的有效声音，发现包含鼻音的刺激是有效的。在本研究中，我们进行了另一项PSI实验，以检验鼻音有效性的可重复性，并观察后续元音的影响。尽管说话者集合或后续元音不同，但冠状鼻音被证明是有效的，并且包含鼻音的刺激明显优于不包含鼻音的刺激。在本文的第二部分，我们介绍了刺激的声学分析结果。对于所有三种分析目标——鼻音、塞音和擦音，能量转换的轮廓在不同说话者之间呈现出形状变化，尽管辅音发音的能量斜率在说话者之间的差异尤其在鼻音中很显著。我们还研究了采样频率和语音编解码器的影响，发现只要语音材料未被压缩，这些能量轮廓的说话者依赖形状就会保持。与其他类型的声音相比，鼻音的轮廓在一个说话者内部似乎是稳定的。

相似文献

Speaker-dependent characteristics of the nasals.

Forensic Sci Int. 2009 Mar 10;185(1-3):21-8. doi: 10.1016/j.forsciint.2008.11.018. Epub 2009 Jan 21.

Attentional influences on functional mapping of speech sounds in human auditory cortex.

BMC Neurosci. 2004 Jul 21;5:24. doi: 10.1186/1471-2202-5-24.

An acoustical and perceptual study of vowels produced by alaryngeal speakers of Cantonese.

Folia Phoniatr Logop. 2009;61(2):97-104. doi: 10.1159/000209272. Epub 2009 Mar 20.

Speaker normalization using cortical strip maps: a neural model for steady-state vowel categorization.

J Acoust Soc Am. 2008 Dec;124(6):3918-36. doi: 10.1121/1.2997478.

Influence of speaker gender on listener judgments of tracheoesophageal speech.

J Voice. 2008 Jan;22(1):43-57. doi: 10.1016/j.jvoice.2006.08.008. Epub 2006 Oct 18.

The relative contributions of speaking fundamental frequency and formant frequencies to gender identification based on isolated vowels.

J Voice. 2005 Dec;19(4):544-54. doi: 10.1016/j.jvoice.2004.10.006.

Voice aftereffects of adaptation to speaker identity.

Hear Res. 2010 Sep 1;268(1-2):38-45. doi: 10.1016/j.heares.2010.04.011. Epub 2010 Apr 27.

[The voice as an anthropologic marker system, its constitutional correlates and characteristics].

Anthropol Anz. 1988 Jun;46(2):185-93.

Automatic source speaker selection for voice conversion.

J Acoust Soc Am. 2009 Jan;125(1):480-91. doi: 10.1121/1.3027445.

J Speech Lang Hear Res. 2005 Aug;48(4):753-65. doi: 10.1044/1092-4388(2005/052).

引用本文的文献

Identifying Voice Individuality Unaffected by Age-Related Voice Changes during Adolescence.

Sensors (Basel). 2022 Feb 17;22(4):1542. doi: 10.3390/s22041542.

The processing of intimately familiar and unfamiliar voices: Specific neural responses of speaker recognition and identification.

PLoS One. 2021 Apr 16;16(4):e0250214. doi: 10.1371/journal.pone.0250214. eCollection 2021.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Speaker-dependent characteristics of the nasals.

Forensic Sci Int. 2009 Mar 10;185(1-3):21-8. doi: 10.1016/j.forsciint.2008.11.018. Epub 2009 Jan 21.

Attentional influences on functional mapping of speech sounds in human auditory cortex.

BMC Neurosci. 2004 Jul 21;5:24. doi: 10.1186/1471-2202-5-24.

An acoustical and perceptual study of vowels produced by alaryngeal speakers of Cantonese.

Folia Phoniatr Logop. 2009;61(2):97-104. doi: 10.1159/000209272. Epub 2009 Mar 20.

Speaker normalization using cortical strip maps: a neural model for steady-state vowel categorization.

J Acoust Soc Am. 2008 Dec;124(6):3918-36. doi: 10.1121/1.2997478.

Influence of speaker gender on listener judgments of tracheoesophageal speech.

J Voice. 2008 Jan;22(1):43-57. doi: 10.1016/j.jvoice.2006.08.008. Epub 2006 Oct 18.

The relative contributions of speaking fundamental frequency and formant frequencies to gender identification based on isolated vowels.

J Voice. 2005 Dec;19(4):544-54. doi: 10.1016/j.jvoice.2004.10.006.

Voice aftereffects of adaptation to speaker identity.

Hear Res. 2010 Sep 1;268(1-2):38-45. doi: 10.1016/j.heares.2010.04.011. Epub 2010 Apr 27.

[The voice as an anthropologic marker system, its constitutional correlates and characteristics].

Anthropol Anz. 1988 Jun;46(2):185-93.

Automatic source speaker selection for voice conversion.

J Acoust Soc Am. 2009 Jan;125(1):480-91. doi: 10.1121/1.3027445.

J Speech Lang Hear Res. 2005 Aug;48(4):753-65. doi: 10.1044/1092-4388(2005/052).

引用本文的文献

Identifying Voice Individuality Unaffected by Age-Related Voice Changes during Adolescence.

Sensors (Basel). 2022 Feb 17;22(4):1542. doi: 10.3390/s22041542.

The processing of intimately familiar and unfamiliar voices: Specific neural responses of speaker recognition and identification.

PLoS One. 2021 Apr 16;16(4):e0250214. doi: 10.1371/journal.pone.0250214. eCollection 2021.

Speaker-dependent characteristics of the nasals.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献