美式英语/r/的声学建模

Acoustic modeling of American English /r/.

作者信息

Espy-Wilson C Y, Boyce S E, Jackson M, Narayanan S, Alwan A

机构信息

Electrical and Computer Engineering Department, Boston University, Massachusetts 02215, USA.

出版信息

J Acoust Soc Am. 2000 Jul;108(1):343-56. doi: 10.1121/1.429469.

DOI:10.1121/1.429469

PMID:10923897

Abstract

Recent advances in physiological data collection methods have made it possible to test the accuracy of predictions against speaker-specific vocal tracts and acoustic patterns. Vocal tract dimensions for /r/ derived via magnetic-resonance imaging (MRI) for two speakers of American English [Alwan, Narayanan, and Haker, J. Acoust. Soc. Am. 101, 1078-1089 (1997)] were used to construct models of the acoustics of /r/. Because previous models have not sufficiently accounted for the very low F3 characteristic of /r/, the aim was to match formant frequencies predicted by the models to the full range of formant frequency values produced by the speakers in recordings of real words containing /r/. In one set of experiments, area functions derived from MRI data were used to argue that the Perturbation Theory of tube acoustics cannot adequately account for /r/, primarily because predicted locations did not match speakers' actual constriction locations. Different models of the acoustics of /r/ were tested using the Maeda computer simulation program [Maeda, Speech Commun. 1, 199-299 (1982)]; the supralingual vocal-tract dimensions reported in Alwan et al. were found to be adequate at predicting only the highest of attested F3 values. By using (1) a recently developed adaptation of the Maeda model that incorporates the sublingual space as a side branch from the front cavity, and by including (2) the sublingual space as an increment to the dimensions of the front cavity, the mid-to-low values of the speakers' F3 range were matched. Finally, a simple tube model with dimensions derived from MRI data was developed to account for cavity affiliations. This confirmed F3 as a front cavity resonance, and variations in F1, F2, and F4 as arising from mid- and back-cavity geometries. Possible trading relations for F3 lowering based on different acoustic mechanisms for extending the front cavity are also proposed.

摘要

生理数据收集方法的最新进展使得针对特定说话者的声道和声学模式来测试预测准确性成为可能。通过磁共振成像（MRI）得出的两位美式英语发音者发/r/音时的声道尺寸[阿尔万、纳拉亚南和哈克，《美国声学学会杂志》101, 1078 - 1089（1997）]被用于构建/r/音的声学模型。由于先前的模型没有充分考虑/r/音非常低的第三共振峰（F3）特征，目标是使模型预测的共振峰频率与发音者在包含/r/音的真实单词录音中产生的共振峰频率值的全范围相匹配。在一组实验中，从MRI数据得出的面积函数被用于论证管声学微扰理论不能充分解释/r/音，主要是因为预测位置与发音者实际的收缩位置不匹配。使用前田计算机模拟程序[前田，《语音通信》1, 199 - 299（1982）]测试了不同的/r/音声学模型；发现阿尔万等人报告的舌上声道尺寸仅能充分预测已证实的最高F3值。通过使用（1）前田模型的一种最近开发的变体，该变体将舌下空间纳入作为前腔的一个侧支，并且通过将（2）舌下空间作为前腔尺寸的一个增量，发音者F3范围的中低值得以匹配。最后，开发了一个具有从MRI数据得出的尺寸的简单管模型来解释腔的归属关系。这证实了F3是前腔共振，并且第一共振峰（F1）、第二共振峰（F2）和第四共振峰（F4）的变化是由中腔和后腔的几何形状引起的。还提出了基于扩展前腔的不同声学机制使F3降低的可能权衡关系。

相似文献

Acoustic modeling of American English /r/.

J Acoust Soc Am. 2000 Jul;108(1):343-56. doi: 10.1121/1.429469.

A magnetic resonance imaging-based articulatory and acoustic study of "retroflex" and "bunched" American English /r/.

J Acoust Soc Am. 2008 Jun;123(6):4466-81. doi: 10.1121/1.2902168.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.

Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

The Black Book of Psychotropic Dosing and Monitoring.

Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.

Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.

Sexual Harassment and Prevention Training

Non-speech oral motor treatment for children with developmental speech sound disorders.

Cochrane Database Syst Rev. 2015 Mar 25;2015(3):CD009383. doi: 10.1002/14651858.CD009383.pub2.

引用本文的文献

Tongue Root Configuration of the Apicoalveolar Trill /r/: An Ultrasound Imaging Study.

Commun Disord Q. 2023 May;44(3):143-151. doi: 10.1177/15257401221111531. Epub 2022 Jul 20.

What R Mandarin Chinese /ɹ/s? - acoustic and articulatory features of Mandarin Chinese rhotics.

Phonetica. 2024 Sep 16;81(5):509-552. doi: 10.1515/phon-2023-0023. Print 2024 Oct 28.

Artificial Intelligence-Assisted Speech Therapy for /ɹ/: A Single-Case Experimental Study.

Am J Speech Lang Pathol. 2024 Sep 18;33(5):2461-2486. doi: 10.1044/2024_AJSLP-23-00448. Epub 2024 Aug 22.

Evaluating acoustic representations and normalization for rhoticity classification in children with speech sound disorders.

JASA Express Lett. 2024 Feb 1;4(2). doi: 10.1121/10.0024632.

Relating Acoustic Measures to Listener Ratings of Children's Productions of Word-Initial /ɹ/ and /w/.

J Speech Lang Hear Res. 2023 Sep 13;66(9):3413-3427. doi: 10.1044/2023_JSLHR-22-00713. Epub 2023 Aug 17.

Classification of accurate and misarticulated /r/ for ultrasound biofeedback using tongue part displacement trajectories.

Clin Linguist Phon. 2023 Feb 1;37(2):196-222. doi: 10.1080/02699206.2022.2039777. Epub 2022 Mar 7.

Telepractice Treatment of Residual Rhotic Errors Using App-Based Biofeedback: A Pilot Study.

Lang Speech Hear Serv Sch. 2022 Apr 11;53(2):256-274. doi: 10.1044/2021_LSHSS-21-00084. Epub 2022 Jan 20.

Comparing Biofeedback Types for Children With Residual /ɹ/ Errors in American English: A Single-Case Randomization Design.

Am J Speech Lang Pathol. 2021 Jul 14;30(4):1819-1845. doi: 10.1044/2021_AJSLP-20-00216. Epub 2021 Jul 7.

Does Early Phonetic Differentiation Predict Later Phonetic Development? Evidence From a Longitudinal Study of /ɹ/ Development in Preschool Children.

J Speech Lang Hear Res. 2021 Jul 16;64(7):2417-2437. doi: 10.1044/2021_JSLHR-20-00555. Epub 2021 May 31.

Evaluation of a Wireless Tongue Tracking System on the Identification of Phoneme Landmarks.

IEEE Trans Biomed Eng. 2021 Apr;68(4):1190-1197. doi: 10.1109/TBME.2020.3023284. Epub 2021 Mar 22.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

美式英语/r/的声学建模

Acoustic modeling of American English /r/.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献