• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

跨越方言语料库构建中的转录障碍:以荷兰南方方言语料库为例

Clearing the Transcription Hurdle in Dialect Corpus Building: The Corpus of Southern Dutch Dialects as Case Study.

作者信息

Ghyselen Anne-Sophie, Breitbarth Anne, Farasyn Melissa, Van Keymeulen Jacques, van Hessen Arjan

机构信息

Department of Linguistics, Ghent University, Ghent, Belgium.

Variaties VZW, Umbrella Organisation for Dialects and Oral Heritage, Brussels, Belgium.

出版信息

Front Artif Intell. 2020 Apr 15;3:10. doi: 10.3389/frai.2020.00010. eCollection 2020.

DOI:10.3389/frai.2020.00010
PMID:33733130
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7861295/
Abstract

This paper discusses how the transcription hurdle in dialect corpus building can be cleared. While corpus analysis has strongly gained in popularity in linguistic research, dialect corpora are still relatively scarce. This scarcity can be attributed to several factors, one of which is the challenging nature of transcribing dialects, given a lack of both orthographic norms for many dialects and speech technological tools trained on dialect data. This paper addresses the questions (i) how dialects can be transcribed efficiently and (ii) whether speech technological tools can lighten the transcription work. These questions are tackled using the Southern Dutch dialects (SDDs) as case study, for which the usefulness of automatic speech recognition (ASR), respeaking, and forced alignment is considered. Tests with these tools indicate that dialects still constitute a major speech technological challenge. In the case of the SDDs, the decision was made to use speech technology only for the word-level segmentation of the audio files, as the transcription itself could not be sped up by ASR tools. The discussion does however indicate that the usefulness of ASR and other related tools for a dialect corpus project is strongly determined by the sound quality of the dialect recordings, the availability of statistical dialect-specific models, the degree of linguistic differentiation between the dialects and the standard language, and the goals the transcripts have to serve.

摘要

本文探讨了如何消除方言语料库建设中的转录障碍。虽然语料库分析在语言学研究中越来越受欢迎,但方言语料库仍然相对较少。这种稀缺性可归因于几个因素,其中之一是方言转录具有挑战性,因为许多方言缺乏拼写规范,且缺乏基于方言数据训练的语音技术工具。本文探讨了以下问题:(i)如何高效转录方言;(ii)语音技术工具是否能减轻转录工作。本文以荷兰南方方言(SDDs)为案例研究来解决这些问题,研究了自动语音识别(ASR)、重新朗读和强制对齐的实用性。使用这些工具进行的测试表明,方言仍然构成主要的语音技术挑战。就荷兰南方方言而言,由于ASR工具无法加快转录速度,因此决定仅将语音技术用于音频文件的词级分割。然而,讨论表明,ASR和其他相关工具对方言语料库项目的实用性很大程度上取决于方言录音的音质、特定方言统计模型的可用性、方言与标准语言之间的语言差异程度以及转录文本的用途。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57aa/7861295/c619b5928892/frai-03-00010-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57aa/7861295/e3e7a1b9c828/frai-03-00010-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57aa/7861295/a283c4dfee0d/frai-03-00010-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57aa/7861295/19c2c18a3e78/frai-03-00010-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57aa/7861295/0c93780eddd8/frai-03-00010-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57aa/7861295/c619b5928892/frai-03-00010-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57aa/7861295/e3e7a1b9c828/frai-03-00010-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57aa/7861295/a283c4dfee0d/frai-03-00010-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57aa/7861295/19c2c18a3e78/frai-03-00010-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57aa/7861295/0c93780eddd8/frai-03-00010-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57aa/7861295/c619b5928892/frai-03-00010-g0005.jpg

相似文献

1
Clearing the Transcription Hurdle in Dialect Corpus Building: The Corpus of Southern Dutch Dialects as Case Study.跨越方言语料库构建中的转录障碍:以荷兰南方方言语料库为例
Front Artif Intell. 2020 Apr 15;3:10. doi: 10.3389/frai.2020.00010. eCollection 2020.
2
Advances in Completely Automated Vowel Analysis for Sociophonetics: Using End-to-End Speech Recognition Systems With DARLA.社会语音学中全自动化元音分析的进展:使用带有DARLA的端到端语音识别系统
Front Artif Intell. 2021 Sep 24;4:662097. doi: 10.3389/frai.2021.662097. eCollection 2021.
3
Consonants, vowels and tones across Vietnamese dialects.越南各地方言中的辅音、元音和声调。
Int J Speech Lang Pathol. 2016 Apr;18(2):122-34. doi: 10.3109/17549507.2015.1101162. Epub 2016 Feb 6.
4
The Nationwide Speech Project: A new corpus of American English dialects.全国性言语项目:美国英语方言的一个新语料库。
Speech Commun. 2006 Jun 1;48(6):633-644. doi: 10.1016/j.specom.2005.09.010.
5
Dataset for the recognition of Kurdish sound dialects.库尔德语音方言识别数据集。
Data Brief. 2024 Feb 22;53:110231. doi: 10.1016/j.dib.2024.110231. eCollection 2024 Apr.
6
Quantification of the effects of Mandarin dialect differences on the use of norm-referenced speech perception tests.量化汉语方言差异对常模参照言语感知测试使用的影响。
Int J Audiol. 2015 Jul;54(7):461-6. doi: 10.3109/14992027.2014.1001075. Epub 2015 Feb 26.
7
The voice as a material clue: a new forensic Algerian Corpus.作为物质线索的声音:一个新的阿尔及利亚法医语料库。
Multimed Tools Appl. 2023 Mar 15:1-19. doi: 10.1007/s11042-023-14412-2.
8
Considerations of dialect on the identification of speech sound disorder in Vietnamese-speaking children.考虑越南语儿童言语障碍识别中的方言因素。
Int J Lang Commun Disord. 2024 Nov-Dec;59(6):2208-2216. doi: 10.1111/1460-6984.12992. Epub 2023 Dec 17.
9
Hate speech detection with ADHAR: a multi-dialectal hate speech corpus in Arabic.使用ADHAR进行仇恨言论检测:一个阿拉伯语多方言仇恨言论语料库。
Front Artif Intell. 2024 May 30;7:1391472. doi: 10.3389/frai.2024.1391472. eCollection 2024.
10
Some acoustic cues for the perceptual categorization of American English regional dialects.一些用于美国英语地域方言感知分类的声学线索。
J Phon. 2004 Jan 1;32(1):111-140. doi: 10.1016/s0095-4470(03)00009-3.

引用本文的文献

1
Integrating Textual Queries with AI-Based Object Detection: A Compositional Prompt-Guided Approach.将文本查询与基于人工智能的目标检测相结合:一种组合式提示引导方法。
Sensors (Basel). 2025 Apr 3;25(7):2258. doi: 10.3390/s25072258.