Prathosh A P, Ramakrishnan A G, Ananthapadmanabha T V
Department of Electrical Engineering, Indian Institute of Science, Bangalore 560012, India
Voice and Speech Systems, Malleshwaram, Bangalore 560003, India
J Acoust Soc Am. 2014 Aug;136(2):EL122-8. doi: 10.1121/1.4885768.
This paper proposes an automatic acoustic-phonetic method for estimating voice-onset time of stops. This method requires neither transcription of the utterance nor training of a classifier. It makes use of the plosion index for the automatic detection of burst onsets of stops. Having detected the burst onset, the onset of the voicing following the burst is detected using the epochal information and a temporal measure named the maximum weighted inner product. For validation, several experiments are carried out on the entire TIMIT database and two of the CMU Arctic corpora. The performance of the proposed method compares well with three state-of-the-art techniques.
本文提出了一种用于估计塞音语音起始时间的自动声学-语音学方法。该方法既不需要话语转录,也不需要分类器训练。它利用爆破指数自动检测塞音的爆发起始点。在检测到爆发起始点后,利用历元信息和一种名为最大加权内积的时间度量来检测爆发之后的浊音起始点。为了进行验证,在整个TIMIT数据库以及两个CMU北极语料库上进行了多项实验。所提方法的性能与三种最先进的技术相比具有优势。