• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过计算机分类技术实现声道中发音到声学转换的反转。

Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique.

作者信息

Atal B S, Chang J J, Mathews M V, Tukey J W

出版信息

J Acoust Soc Am. 1978 May;63(5):1535-53. doi: 10.1121/1.381848.

DOI:10.1121/1.381848
PMID:690333
Abstract

We present numerical methods for studying the relationship between the shape of the vocal tract and its acoustic output. For a stationary vocal tract, the articulatory-acoustic relationship can be represented as a multidimensional function of a multidimensional argument: y=f(x), where x, y are vectors describing the vocal-tract shape and the resulting acoustic output, respectively. Assuming that y may be computed for any x, we develop a procedure for inverting f(x). Inversion by computer sorting consists of computing y for many values of x and sorting the resulting (y,x) pairs into a convenient order according to y; x for a given y is then obtained by looking up y in the sorted data. Application of this method for determining parameters of an articulatory model corresponding to a given set of formant frequencies is presented. A method is also described for finding articulatory regions (fibers) which map into a single point in the acoustic space. The local nature of f(x) is determined by linearization in a small neighborhood. Larger regions are explored by extending the linear neighborhoods in small steps. This method was applied for the study of compensatory articulation. Sounds produced by various articulations along a fiber were synthesized and were compared by informal listening tests. These tests show that, in many cases of interest, a given sound could be produced by many different vocal-tract shapes.

摘要

我们提出了用于研究声道形状与其声学输出之间关系的数值方法。对于静止的声道,发音 - 声学关系可以表示为多维自变量的多维函数:y = f(x),其中x、y分别是描述声道形状和产生的声学输出的向量。假设对于任何x都可以计算出y,我们开发了一种求f(x)反函数的方法。通过计算机排序求反函数包括针对许多x值计算y,并根据y将所得的(y, x)对按方便的顺序排序;然后通过在排序后的数据中查找y来获得给定y对应的x。介绍了将此方法应用于确定与给定一组共振峰频率相对应的发音模型参数的情况。还描述了一种用于找到在声学空间中映射到单个点的发音区域(纤维)的方法。f(x)的局部性质通过在小邻域内进行线性化来确定。通过小步扩展线性邻域来探索更大的区域。该方法被应用于补偿性发音的研究。沿着纤维通过各种发音产生的声音被合成,并通过非正式听力测试进行比较。这些测试表明,在许多感兴趣的情况下,给定的声音可以由许多不同的声道形状产生。

相似文献

1
Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique.通过计算机分类技术实现声道中发音到声学转换的反转。
J Acoust Soc Am. 1978 May;63(5):1535-53. doi: 10.1121/1.381848.
2
Incorporation of phonetic constraints in acoustic-to-articulatory inversion.在声学到发音逆向转换中纳入语音约束。
J Acoust Soc Am. 2008 Apr;123(4):2310-23. doi: 10.1121/1.2885747.
3
Vocal tract normalization for midsagittal articulatory recovery with analysis-by-synthesis.基于合成分析的矢状面中部发音恢复的声道归一化
J Acoust Soc Am. 1999 Aug;106(2):1090-105. doi: 10.1121/1.427117.
4
Modeling the articulatory space using a hypercube codebook for acoustic-to-articulatory inversion.使用超立方码本对发音空间进行建模以实现声学到发音的反转。
J Acoust Soc Am. 2005 Jul;118(1):444-60. doi: 10.1121/1.1921448.
5
Acquisition of vowel articulation in childhood investigated by acoustic-to-articulatory inversion.通过声学-发音反演研究儿童元音发音的习得。
Infant Behav Dev. 2017 Feb;46:178-193. doi: 10.1016/j.infbeh.2017.01.007. Epub 2017 Feb 20.
6
Articulatory measurement and synthesis. Methods and preliminary results.发音测量与合成。方法及初步结果。
Phonetica. 1979;36(4-5):294-301. doi: 10.1159/000259967.
7
Acoustic measurements of articulator motions.发音器官运动的声学测量。
Phonetica. 1979;36(4-5):302-13. doi: 10.1159/000259968.
8
A model of speech production based on the acoustic relativity of the vocal tract.基于声道声学相对性的言语产生模型。
J Acoust Soc Am. 2019 Oct;146(4):2522. doi: 10.1121/1.5127756.
9
A modeling investigation of articulatory variability and acoustic stability during American English /r/ production.美式英语/r/发音过程中发音器官变异性和声学稳定性的建模研究。
J Acoust Soc Am. 2005 May;117(5):3196-212. doi: 10.1121/1.1893271.
10
An auditory-feedback-based neural network model of speech production that is robust to developmental changes in the size and shape of the articulatory system.一种基于听觉反馈的语音产生神经网络模型,该模型对发音系统大小和形状的发育变化具有鲁棒性。
J Speech Lang Hear Res. 2000 Jun;43(3):721-36. doi: 10.1044/jslhr.4303.721.

引用本文的文献

1
A practical guide to calculating vocal tract length and scale-invariant formant patterns.计算声道长度和标度不变共振峰模式的实用指南。
Behav Res Methods. 2024 Sep;56(6):5588-5604. doi: 10.3758/s13428-023-02288-x. Epub 2023 Dec 29.
2
Sigma-Lognormal Modeling of Speech.语音的西格玛-对数正态模型
Cognit Comput. 2021;13(2):488-503. doi: 10.1007/s12559-020-09803-8. Epub 2021 Feb 7.
3
Which way to the dawn of speech?: Reanalyzing half a century of debates and data in light of speech science.通往言语之曙光的道路在何方?——基于言语科学重新分析半个世纪以来的争论与数据。
Sci Adv. 2019 Dec 11;5(12):eaaw3916. doi: 10.1126/sciadv.aaw3916. eCollection 2019 Dec.
4
A thirteenth-century theory of speech.十三世纪的言语理论。
J Acoust Soc Am. 2019 Aug;146(2):937. doi: 10.1121/1.5119126.
5
Formant Space Reconstruction From Brain Activity in Frontal and Temporal Regions Coding for Heard Vowels.基于额叶和颞叶区域编码所听元音的大脑活动进行共振峰空间重建
Front Hum Neurosci. 2019 Feb 8;13:32. doi: 10.3389/fnhum.2019.00032. eCollection 2019.
6
Variability of articulator positions and formants across nine English vowels.九个英语元音的发音器官位置和共振峰的变异性。
J Phon. 2018 May;68:1-14. doi: 10.1016/j.wocn.2018.01.003. Epub 2018 Feb 23.
7
Silent Speech Recognition as an Alternative Communication Device for Persons with Laryngectomy.无声语音识别作为喉切除患者的替代交流设备
IEEE/ACM Trans Audio Speech Lang Process. 2017 Dec;25(12):2386-2398. doi: 10.1109/TASLP.2017.2740000. Epub 2017 Nov 28.
8
Human Sensorimotor Cortex Control of Directly Measured Vocal Tract Movements during Vowel Production.人类感觉运动皮层对元音产生期间直接测量的声道运动的控制。
J Neurosci. 2018 Mar 21;38(12):2955-2966. doi: 10.1523/JNEUROSCI.2382-17.2018. Epub 2018 Feb 8.
9
What drives the perceptual change resulting from speech motor adaptation? Evaluation of hypotheses in a Bayesian modeling framework.是什么驱动了言语运动适应导致的知觉变化?贝叶斯建模框架中的假设评估。
PLoS Comput Biol. 2018 Jan 22;14(1):e1005942. doi: 10.1371/journal.pcbi.1005942. eCollection 2018 Jan.
10
Advances in real-time magnetic resonance imaging of the vocal tract for speech science and technology research.用于语音科学与技术研究的声道实时磁共振成像进展。
APSIPA Trans Signal Inf Process. 2016;5. doi: 10.1017/ATSIP.2016.5. Epub 2016 Mar 31.