吉甘特-KTTS数据集：致力于构建一个用于库尔德语语音合成系统的大型数据集。

Gigant-KTTS dataset: Towards building an extensive gigant dataset for Kurdish text-to-speech systems.

作者信息

Ahmad Hawraz A, Rashid Tarik A

机构信息

Department of Software and Informatics Engineering, Salahaddin University-Erbil, Erbil, KR, Iraq.

Department of Computer Science and Engineering, University of Kurdistan Hewler, Erbil, KR, Iraq.

出版信息

Data Brief. 2024 Jul 14;55:110753. doi: 10.1016/j.dib.2024.110753. eCollection 2024 Aug.

DOI:10.1016/j.dib.2024.110753

PMID:39149720

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11324836/

Abstract

Today, speech synthesis is a part of our daily lives in computers all around the world. Central Kurdish Speech Corpus Construction is a speech corpus that is a primary data source for developing a speech system. There are still two main issues that prevent them from achieving the best possible performance, the lack of efficiency in training and analysis, and the difficulty in modelling. The biggest obstacle against text-to-speech in the Kurdish language is that there is a lack of text and speech recognition tools compounded by the fact that around 30 million people speak the Kurdish language in different countries. To address this issue, this corpus introduced a large vocabulary of Kurdish Text-to-Speech Dataset (KTTS, Gigant), including a pronunciation lexicon and speech corpus for the Central Kurdish dialect. A variety of subjects is comprised to record these sentences. The sentences are recorded in a voice recording studio by a Kurdish man who is a dubber. The goal of the speech corpus is to create a collection of sentences that accurately reflect the real data about the Central Kurdish dialect. A combination of audio and visual sources is used to record the 6,078 sentences of 12 document topics. They were recorded in a controlled environment using microphones that were not noisy. The total record duration is 13.63 h. The recorded sentences are in the ".wav" format.

摘要

如今，语音合成已成为全球计算机日常生活的一部分。库尔德语中部语音语料库建设是一个语音语料库，是开发语音系统的主要数据源。仍然存在两个主要问题阻碍它们实现最佳性能，即训练和分析效率低下以及建模困难。库尔德语语音合成面临的最大障碍是缺乏文本和语音识别工具，再加上不同国家约有3000万人讲库尔德语。为了解决这个问题，该语料库引入了一个庞大的库尔德语语音合成数据集（KTTS，Gigant）词汇表，包括库尔德语中部方言的发音词典和语音语料库。录制这些句子涵盖了各种主题。这些句子由一名库尔德配音演员在录音室录制。语音语料库的目标是创建一组能够准确反映库尔德语中部方言真实数据的句子。音频和视觉源相结合用于录制12个文档主题的6078个句子。它们在使用无噪音麦克风的受控环境中录制。总录制时长为13.63小时。录制的句子为“.wav”格式。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f75/11324836/4c8f1d12c235/gr1.jpg

相似文献

Gigant-KTTS dataset: Towards building an extensive gigant dataset for Kurdish text-to-speech systems.吉甘特-KTTS数据集：致力于构建一个用于库尔德语语音合成系统的大型数据集。

Data Brief. 2024 Jul 14;55:110753. doi: 10.1016/j.dib.2024.110753. eCollection 2024 Aug.

Dataset for the recognition of Kurdish sound dialects.库尔德语音方言识别数据集。

Data Brief. 2024 Feb 22;53:110231. doi: 10.1016/j.dib.2024.110231. eCollection 2024 Apr.

Development of Hausa dataset a baseline for speech recognition.豪萨语数据集的开发——语音识别的一个基线。

Data Brief. 2022 Jan 10;40:107820. doi: 10.1016/j.dib.2022.107820. eCollection 2022 Feb.

In the heart of Swahili: An exploration of data collection methods and corpus curation for natural language processing.在斯瓦希里语的核心地带：自然语言处理中数据收集方法与语料库构建的探索

Data Brief. 2024 Jul 17;55:110751. doi: 10.1016/j.dib.2024.110751. eCollection 2024 Aug.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Non-native listeners' recognition of high-variability speech using PRESTO.非母语听众使用PRESTO对高变异性语音的识别。

J Am Acad Audiol. 2014 Oct;25(9):869-92. doi: 10.3766/jaaa.25.9.9.

Database description: Russian fricatives recorded in 198 real speech sentences from 59 speakers.数据库描述：从59位说话者的198个真实语音句子中记录的俄语擦音。

Data Brief. 2023 May 11;48:109205. doi: 10.1016/j.dib.2023.109205. eCollection 2023 Jun.

Dataset of British English speech recordings for psychoacoustics and speech processing research: The clarity speech corpus.用于心理声学和语音处理研究的英式英语语音录音数据集：清晰度语音语料库。

Data Brief. 2022 Feb 15;41:107951. doi: 10.1016/j.dib.2022.107951. eCollection 2022 Apr.

Clearing the Transcription Hurdle in Dialect Corpus Building: The Corpus of Southern Dutch Dialects as Case Study.跨越方言语料库构建中的转录障碍：以荷兰南方方言语料库为例

Front Artif Intell. 2020 Apr 15;3:10. doi: 10.3389/frai.2020.00010. eCollection 2020.

Hate speech detection in the Arabic language: corpus design, construction, and evaluation.阿拉伯语中的仇恨言论检测：语料库设计、构建与评估。

Front Artif Intell. 2024 Feb 20;7:1345445. doi: 10.3389/frai.2024.1345445. eCollection 2024.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

吉甘特-KTTS数据集：致力于构建一个用于库尔德语语音合成系统的大型数据集。

Gigant-KTTS dataset: Towards building an extensive gigant dataset for Kurdish text-to-speech systems.

作者信息

机构信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献