• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于米斯特克语语音资源的开发。

On the development of speech resources for the Mixtec language.

作者信息

Caballero-Morales Santiago-Omar

机构信息

Technological University of the Mixteca, Road to Acatlima K.m. 2.5, 69000 Huajuapan de León, OAX, Mexico.

出版信息

ScientificWorldJournal. 2013 Apr 16;2013:170649. doi: 10.1155/2013/170649. Print 2013.

DOI:10.1155/2013/170649
PMID:23710134
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3654258/
Abstract

The Mixtec language is one of the main native languages in Mexico. In general, due to urbanization, discrimination, and limited attempts to promote the culture, the native languages are disappearing. Most of the information available about the Mixtec language is in written form as in dictionaries which, although including examples about how to pronounce the Mixtec words, are not as reliable as listening to the correct pronunciation from a native speaker. Formal acoustic resources, as speech corpora, are almost non-existent for the Mixtec, and no speech technologies are known to have been developed for it. This paper presents the development of the following resources for the Mixtec language: (1) a speech database of traditional narratives of the Mixtec culture spoken by a native speaker (labelled at the phonetic and orthographic levels by means of spectral analysis) and (2) a native speaker-adaptive automatic speech recognition (ASR) system (trained with the speech database) integrated with a Mixtec-to-Spanish/Spanish-to-Mixtec text translator. The speech database, although small and limited to a single variant, was reliable enough to build the multiuser speech application which presented a mean recognition/translation performance up to 94.36% in experiments with non-native speakers (the target users).

摘要

米斯特克语是墨西哥主要的本土语言之一。总体而言,由于城市化、歧视以及推广该文化的尝试有限,本土语言正在逐渐消失。关于米斯特克语的现有信息大多以书面形式呈现,如字典中的内容,尽管字典包含了米斯特克语单词发音的示例,但不如从母语者那里听到正确发音可靠。对于米斯特克语来说,像语音语料库这样的正式声学资源几乎不存在,也没有已知的针对该语言开发的语音技术。本文介绍了为米斯特克语开发的以下资源:(1) 一个由母语者讲述的米斯特克文化传统故事的语音数据库(通过频谱分析在语音和正字法层面进行标注),以及 (2) 一个与米斯特克语到西班牙语/西班牙语到米斯特克语文本翻译器集成的母语者自适应自动语音识别(ASR)系统(使用语音数据库进行训练)。该语音数据库虽然规模较小且仅限于单一变体,但足够可靠以构建多用户语音应用程序,在针对非母语者(目标用户)的实验中,该应用程序的平均识别/翻译性能高达94.36%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/a1bbbb33c07e/TSWJ2013-170649.020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/9844f767b82d/TSWJ2013-170649.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/c0edc9947336/TSWJ2013-170649.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/b1c404995d81/TSWJ2013-170649.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/856d131ad702/TSWJ2013-170649.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/f94d9faa9207/TSWJ2013-170649.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/d582224e8054/TSWJ2013-170649.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/7c3aa4b84c61/TSWJ2013-170649.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/32303716b31c/TSWJ2013-170649.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/3229ade26314/TSWJ2013-170649.009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/4e9f3d824e62/TSWJ2013-170649.010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/10851caaa9dd/TSWJ2013-170649.011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/b3504a2b369b/TSWJ2013-170649.012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/2d382d0a7874/TSWJ2013-170649.013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/7c31bb9e03ac/TSWJ2013-170649.014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/07ebd143f003/TSWJ2013-170649.015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/8d39e86df13a/TSWJ2013-170649.016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/5bb34575ae7b/TSWJ2013-170649.017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/e54ad3d2412a/TSWJ2013-170649.018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/6568ff00d193/TSWJ2013-170649.019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/a1bbbb33c07e/TSWJ2013-170649.020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/9844f767b82d/TSWJ2013-170649.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/c0edc9947336/TSWJ2013-170649.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/b1c404995d81/TSWJ2013-170649.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/856d131ad702/TSWJ2013-170649.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/f94d9faa9207/TSWJ2013-170649.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/d582224e8054/TSWJ2013-170649.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/7c3aa4b84c61/TSWJ2013-170649.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/32303716b31c/TSWJ2013-170649.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/3229ade26314/TSWJ2013-170649.009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/4e9f3d824e62/TSWJ2013-170649.010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/10851caaa9dd/TSWJ2013-170649.011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/b3504a2b369b/TSWJ2013-170649.012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/2d382d0a7874/TSWJ2013-170649.013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/7c31bb9e03ac/TSWJ2013-170649.014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/07ebd143f003/TSWJ2013-170649.015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/8d39e86df13a/TSWJ2013-170649.016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/5bb34575ae7b/TSWJ2013-170649.017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/e54ad3d2412a/TSWJ2013-170649.018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/6568ff00d193/TSWJ2013-170649.019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8933/3654258/a1bbbb33c07e/TSWJ2013-170649.020.jpg

相似文献

1
On the development of speech resources for the Mixtec language.关于米斯特克语语音资源的开发。
ScientificWorldJournal. 2013 Apr 16;2013:170649. doi: 10.1155/2013/170649. Print 2013.
2
Traditional knowledge and use of wild mushrooms by Mixtecs or Ñuu savi, the people of the rain, from Southeastern Mexico.墨西哥东南部的米斯特克人或Ñuu savi(意为“雨之民”)对野生蘑菇的传统认知与利用。
J Ethnobiol Ethnomed. 2016 Sep 5;12(1):35. doi: 10.1186/s13002-016-0108-9.
3
Deep learning applications in telerehabilitation speech therapy scenarios.深度学习在远程康复语音治疗场景中的应用。
Comput Biol Med. 2022 Sep;148:105864. doi: 10.1016/j.compbiomed.2022.105864. Epub 2022 Jul 12.
4
Automatic speech recognition (ASR) and its use as a tool for assessment or therapy of voice, speech, and language disorders.自动语音识别(ASR)及其作为评估或治疗嗓音、言语和语言障碍的工具的应用。
Logoped Phoniatr Vocol. 2009;34(2):91-6. doi: 10.1080/14015430802657216.
5
Improvement of Acoustic Models Fused with Lip Visual Information for Low-Resource Speech.融合唇动视觉信息的低资源语音声学模型改进
Sensors (Basel). 2023 Feb 12;23(4):2071. doi: 10.3390/s23042071.
6
A reduced ambiguity lexical system.一个减少歧义的词汇系统。
Biomed Sci Instrum. 2004;40:424-8.
7
Pronunciation difficulty, temporal regularity, and the speech-to-song illusion.发音难度、时间规律性与言语-歌曲错觉。
Front Psychol. 2015 Jan 29;6:48. doi: 10.3389/fpsyg.2015.00048. eCollection 2015.
8
Retrospective Analysis of Clinical Performance of an Estonian Speech Recognition System for Radiology: Effects of Different Acoustic and Language Models.回顾性分析爱沙尼亚语音识别系统在放射学中的临床性能:不同声学和语言模型的影响。
J Digit Imaging. 2018 Oct;31(5):615-621. doi: 10.1007/s10278-018-0085-8.
9
Non-native acoustic modeling for mispronunciation verification based on language adversarial representation learning.基于语言对抗表示学习的非母语发音验证的声学建模。
Neural Netw. 2021 Oct;142:597-607. doi: 10.1016/j.neunet.2021.07.017. Epub 2021 Jul 17.
10
Linguistic disparities in cross-language automatic speech recognition transfer from Arabic to Tashlhiyt.从阿拉伯语到塔什利特语的跨语言自动语音识别中的语言差异
Sci Rep. 2024 Jan 3;14(1):313. doi: 10.1038/s41598-023-50516-3.

引用本文的文献

1
Identifying and Addressing Genetic Counseling Challenges among Indigenous People of Oaxaca-One Center's Experience with Two Immigrant Farmworker Families in the Central Valley of California.识别并应对瓦哈卡州原住民中的遗传咨询挑战——加利福尼亚中央谷地一个中心对两个移民农场工人家庭的经验
J Genet Couns. 2018 Aug;27(4):996-1004. doi: 10.1007/s10897-018-0221-0. Epub 2018 Feb 3.

本文引用的文献

1
Automatic speech recognition and training for severely dysarthric users of assistive technology: the STARDUST project.针对严重构音障碍的辅助技术用户的自动语音识别与训练:星尘项目。
Clin Linguist Phon. 2006 Apr-May;20(2-3):149-56. doi: 10.1080/02699200400026884.