Suppr超能文献

鼻音的说话者依赖特征。

Speaker-dependent characteristics of the nasals.

作者信息

Amino Kanae, Arai Takayuki

机构信息

Faculty of Science and Technology, Department of Electrical and Electronics Engineering, Sophia University, Tokyo, Japan.

出版信息

Forensic Sci Int. 2009 Mar 10;185(1-3):21-8. doi: 10.1016/j.forsciint.2008.11.018. Epub 2009 Jan 21.

Abstract

Investigation on human speaker identification enables us to know the indexical cues to speakers, and it may consequently lead to the effective acoustical parameters that can be used for forensic speaker recognition. It is known that speaker individuality interacts with the phonological or linguistic information contained in speech signals. As proof, the accuracy of perceptual speaker identification (PSI) performances depends on what types of sounds are presented to the listeners. In a series of our previous experiments, we have been investigating the effective sounds for PSI, and the stimuli containing a nasal were found to be the ones. In this present study, we conducted another PSI experiment in order to examine the reproducibility of the nasal effectiveness, and to see the effects of the following vowels. Coronal nasals were shown to be effective despite the different speaker set or the following vowels, and the stimuli containing a nasal were significantly better than those without it. In the second part of this paper, we introduce the results of the acoustical analysis of the stimuli. The contours of the energy transitions showed variations in shape among speakers for all three types of the analysis targets; nasals, stops, and fricatives, although the inter-speaker difference in the energy slopes for the consonant articulation was significant especially in nasal sounds. We also examined the effects of the sampling frequencies and the speech codecs, and found that the speaker-dependent shapes of these energy contours were maintained as long as the speech materials were uncompressed. The contours of the nasals appeared to be stable within a speaker, compared to other types of sounds.

摘要

对人类说话者识别的研究使我们能够了解说话者的索引线索,进而可能得出可用于法医说话者识别的有效声学参数。众所周知,说话者的个体特征与语音信号中包含的音系或语言信息相互作用。作为证据,感知说话者识别(PSI)性能的准确性取决于向听众呈现的声音类型。在我们之前的一系列实验中,我们一直在研究PSI的有效声音,发现包含鼻音的刺激是有效的。在本研究中,我们进行了另一项PSI实验,以检验鼻音有效性的可重复性,并观察后续元音的影响。尽管说话者集合或后续元音不同,但冠状鼻音被证明是有效的,并且包含鼻音的刺激明显优于不包含鼻音的刺激。在本文的第二部分,我们介绍了刺激的声学分析结果。对于所有三种分析目标——鼻音、塞音和擦音,能量转换的轮廓在不同说话者之间呈现出形状变化,尽管辅音发音的能量斜率在说话者之间的差异尤其在鼻音中很显著。我们还研究了采样频率和语音编解码器的影响,发现只要语音材料未被压缩,这些能量轮廓的说话者依赖形状就会保持。与其他类型的声音相比,鼻音的轮廓在一个说话者内部似乎是稳定的。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验