• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Using automatic alignment to analyze endangered language data: testing the viability of untrained alignment.使用自动对齐分析濒危语言数据:测试未训练对齐的可行性。
J Acoust Soc Am. 2013 Sep;134(3):2235-46. doi: 10.1121/1.4816491.
2
Estimation of voice-onset time in continuous speech using temporal measures.使用时间测量方法估计连续语音中的语音起始时间。
J Acoust Soc Am. 2014 Aug;136(2):EL122-8. doi: 10.1121/1.4885768.
3
Automatic measurement of voice onset time using discriminative structured prediction.基于判别结构预测的语音起始时间自动测量。
J Acoust Soc Am. 2012 Dec;132(6):3965-79. doi: 10.1121/1.4763995.
4
Automatic alignment for New Englishes: Applying state-of-the-art aligners to Trinidadian English.新英语的自动对齐:将最先进的对齐工具应用于特立尼达英语。
J Acoust Soc Am. 2020 Apr;147(4):2283. doi: 10.1121/10.0001069.
5
Syllabic reduction in Mandarin and English speech.汉语和英语语音中的音节缩减
J Acoust Soc Am. 2014 Jun;135(6):EL270-6. doi: 10.1121/1.4874357.
6
Reliability of human-supervised formant-trajectory measurement for forensic voice comparison.人类监督的共振峰轨迹测量在法医语音比较中的可靠性。
J Acoust Soc Am. 2013 Jan;133(1):EL54-60. doi: 10.1121/1.4773223.
7
Emotional speech acoustic model for Malay: iterative versus isolated unit training.马来语情感语音声学模型:迭代与孤立单元训练。
J Acoust Soc Am. 2013 Oct;134(4):3057-66. doi: 10.1121/1.4818741.
8
Automatic Voice Pathology Detection With Running Speech by Using Estimation of Auditory Spectrum and Cepstral Coefficients Based on the All-Pole Model.基于全极点模型,通过估计听觉频谱和倒谱系数,对连续语音进行自动语音病理学检测。
J Voice. 2016 Nov;30(6):757.e7-757.e19. doi: 10.1016/j.jvoice.2015.08.010. Epub 2015 Oct 27.
9
Target-locus scaling for modeling formant transitions in vowel + consonant + vowel utterances.用于模拟元音+辅音+元音发音中元音共振峰过渡的目标轨迹缩放。
J Acoust Soc Am. 2017 Mar;141(3):EL192. doi: 10.1121/1.4976139.
10
A cross-dialectal acoustic study of Saterland Frisian vowels.萨特兰弗里斯兰语元音的跨方言声学研究。
J Acoust Soc Am. 2017 Apr;141(4):2893. doi: 10.1121/1.4980855.

引用本文的文献

1
The Mason-Alberta Phonetic Segmenter: a forced alignment system based on deep neural networks and interpolation.梅森-阿尔伯塔音标分段器:一种基于深度神经网络和插值的强制对齐系统。
Phonetica. 2024 Sep 5;81(5):451-508. doi: 10.1515/phon-2024-0015. Print 2024 Oct 28.
2
Extreme stop allophony in Mixtec spontaneous speech: data, word prosody, and modelling.米斯特克语自然话语中的极端塞音音变:数据、单词韵律及建模
J Phon. 2022 May;92. doi: 10.1016/j.wocn.2022.101147. Epub 2022 Apr 20.
3
Improvement of Acoustic Models Fused with Lip Visual Information for Low-Resource Speech.融合唇动视觉信息的低资源语音声学模型改进
Sensors (Basel). 2023 Feb 12;23(4):2071. doi: 10.3390/s23042071.
4
Computational Modelling of Tone Perception Based on Direct Processing of Contours.基于轮廓直接处理的音调感知计算模型
Brain Sci. 2022 Mar 2;12(3):337. doi: 10.3390/brainsci12030337.
5
Advances in Completely Automated Vowel Analysis for Sociophonetics: Using End-to-End Speech Recognition Systems With DARLA.社会语音学中全自动化元音分析的进展:使用带有DARLA的端到端语音识别系统
Front Artif Intell. 2021 Sep 24;4:662097. doi: 10.3389/frai.2021.662097. eCollection 2021.

本文引用的文献

1
Nouns and verbs in Chintang: children's usage and surrounding adult speech.《昌都方言中的名词和动词:儿童使用情况以及周围成人言语》
J Child Lang. 2012 Mar;39(2):284-321. doi: 10.1017/S0305000911000080. Epub 2011 Aug 22.
2
Speaker-Independent Phoneme Alignment Using Transition-Dependent States.使用依赖于过渡的状态进行与说话者无关的音素对齐
Speech Commun. 2009 Apr;51(4):352-368. doi: 10.1016/j.specom.2008.11.003.
3
Perception of coarticulatory nasalization by speakers of English and Thai: evidence for partial compensation.以英语和泰语为母语者对协同发音鼻化的感知:部分补偿的证据。
J Acoust Soc Am. 1999 Nov;106(5):2868-87. doi: 10.1121/1.428111.
4
Detection of target phonemes in spontaneous and read speech.在自发语音和朗读语音中检测目标音素。
Lang Speech. 1988 Apr-Jun;31 ( Pt 2):135-56. doi: 10.1177/002383098803100203.
5
Information for Mandarin tones in the amplitude contour and in brief segments.关于汉语声调在幅度轮廓及简短片段中的信息。
Phonetica. 1992;49(1):25-47. doi: 10.1159/000261901.
6
On the role of perception in shaping phonological assimilation rules.论感知在塑造音位同化规则中的作用。
Lang Speech. 1992 Jan-Jun;35 ( Pt 1-2):59-72. doi: 10.1177/002383099203500206.

使用自动对齐分析濒危语言数据:测试未训练对齐的可行性。

Using automatic alignment to analyze endangered language data: testing the viability of untrained alignment.

机构信息

Haskins Laboratories, 300 George Street, New Haven, Connecticut 06511, USA.

出版信息

J Acoust Soc Am. 2013 Sep;134(3):2235-46. doi: 10.1121/1.4816491.

DOI:10.1121/1.4816491
PMID:23967953
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5392066/
Abstract

While efforts to document endangered languages have steadily increased, the phonetic analysis of endangered language data remains a challenge. The transcription of large documentation corpora is, by itself, a tremendous feat. Yet, the process of segmentation remains a bottleneck for research with data of this kind. This paper examines whether a speech processing tool, forced alignment, can facilitate the segmentation task for small data sets, even when the target language differs from the training language. The authors also examined whether a phone set with contextualization outperforms a more general one. The accuracy of two forced aligners trained on English (hmalign and p2fa) was assessed using corpus data from Yoloxóchitl Mixtec. Overall, agreement performance was relatively good, with accuracy at 70.9% within 30 ms for hmalign and 65.7% within 30 ms for p2fa. Segmental and tonal categories influenced accuracy as well. For instance, additional stop allophones in hmalign's phone set aided alignment accuracy. Agreement differences between aligners also corresponded closely with the types of data on which the aligners were trained. Overall, using existing alignment systems was found to have potential for making phonetic analysis of small corpora more efficient, with more allophonic phone sets providing better agreement than general ones.

摘要

虽然记录濒危语言的努力一直在稳步增加,但对濒危语言数据的语音分析仍然是一个挑战。对大型文献语料库进行转写本身就是一项艰巨的任务。然而,对于这种类型的数据,分割过程仍然是研究的瓶颈。本文探讨了语音处理工具强制对齐是否可以促进小数据集的分割任务,即使目标语言与训练语言不同。作者还研究了具有语境化的音位集是否优于更通用的音位集。作者使用来自 Yoloxóchitl Mixtec 的语料库数据评估了针对英语的两种强制对齐器(hmalign 和 p2fa)的准确性。总体而言,协议性能相对较好,hmalign 的准确率为 70.9%,在 30ms 内,p2fa 的准确率为 65.7%。分段和声调类别也会影响准确性。例如,hmalign 音位集中的额外停止音位有助于提高对齐准确性。对齐器之间的协议差异也与对齐器所训练的数据类型密切相关。总的来说,使用现有的对齐系统有可能提高对小语料库的语音分析效率,使用更多的变音位集比使用通用音位集可以获得更好的协议。