Suppr超能文献

连续语音信号中的音段时长:初步结果。

Segmental durations in connected speech signals: preliminary results.

作者信息

Crystal T H, House A S

出版信息

J Acoust Soc Am. 1982 Sep;72(3):705-16. doi: 10.1121/1.388251.

Abstract

The data base, methods for a study of the durations of phonetic units in connected speech, and some preliminary results are described. From readings of two scripts by many talkers, two sets of seven talkers each were selected, based on total reading time, to form a fast group and a slow group of talkers. Using computer graphics and digital playback procedures, the recordings were segmented into breath groups and pauses, and the first four sentences in each script were segmented into phones. The hold and release (that is, plosion and/or frication) portions of stops were identified and measured; less than 50% of the stops included releases. To establish the usefulness of the data base, the first-order statistics of the phonetic segments were determined, and a variety of durational characteristics were compared to existing reports. Analysis of number of breath groups, phonation time, and pause characterized the difference between so-called average fast and average slow talkers; however, no script-independent measure of these variables was found which would accurately predict the classification of individual talkers. The mean durations of various phonetic categories showed essentially the same percentage change when the fast and slow talkers were compared. Preliminary analyses of contextual influences on durations showed some expected changes, and also indicated that certain traditional predictions may not hold for informal connected speech. Gamma functions were fitted to the distributions of durations of various gross categories.

摘要

本文描述了数据库、连读语音中语音单位时长的研究方法以及一些初步结果。从许多说话者对两份文稿的朗读中,根据总朗读时间,每组挑选出七名说话者,分别组成快速组和慢速组。利用计算机图形技术和数字回放程序,将录音分割成呼吸组和停顿,并且将每份文稿中的前四个句子分割成音素。确定并测量了塞音的持阻和除阻(即爆破和/或摩擦)部分;不到50%的塞音包含除阻部分。为了确定数据库的实用性,确定了语音片段的一阶统计量,并将各种时长特征与现有报告进行了比较。对呼吸组数量、发声时间和停顿的分析表征了所谓平均语速快和平均语速慢的说话者之间的差异;然而,未发现这些变量的与文稿无关的测量方法能够准确预测个体说话者的分类。比较快速组和慢速组说话者时,各种语音类别的平均时长呈现出基本相同的百分比变化。对时长的语境影响的初步分析显示了一些预期的变化,并且还表明某些传统预测可能不适用于非正式连读语音。用伽马函数拟合了各种总体类别的时长分布。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验