一个乌克兰语真实与合成语音数据集。

A Dataset of Real and Synthetic Speech in Ukrainian.

作者信息

Lipianina-Honcharenko Khrystyna, Bohuta Hennadii, Ivaniush Adam, Soia Mariana

机构信息

West Ukrainian National University, Ternopil, Ukraine.

出版信息

Sci Data. 2025 May 6;12(1):745. doi: 10.1038/s41597-025-05084-8.

DOI:10.1038/s41597-025-05084-8

PMID:40328782

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12056226/

Abstract

This work is dedicated to the analysis and evaluation of the DRSSU dataset: A Dataset of Real and Synthetic Speech in Ukrainian, created to support research in the field of natural language processing and speech recognition. The dataset contains a unique collection of audio recordings that include both real and synthesized Ukrainian speech, providing unprecedented opportunities for improving machine learning algorithms aimed at speech recognition and analysis. The main focus of the research is on identifying statistically significant differences between generated and real speech, which is of great importance for the further development of automatic speech recognition systems. The analysis demonstrates potential applications of the dataset in a wide range of areas, from combating misinformation to supporting linguistic diversity and cultural heritage. The work emphasizes the importance of innovation in the field of NLP and speech processing, with a special focus on the development of technologies adapted to the Ukrainian language.

摘要

这项工作致力于对DRSSU数据集进行分析和评估：这是一个乌克兰语真实与合成语音数据集，旨在支持自然语言处理和语音识别领域的研究。该数据集包含一组独特的音频记录，其中既有真实的乌克兰语语音，也有合成的乌克兰语语音，为改进针对语音识别和分析的机器学习算法提供了前所未有的机会。研究的主要重点是识别生成语音和真实语音之间具有统计显著性的差异，这对自动语音识别系统的进一步发展至关重要。分析表明该数据集在广泛领域具有潜在应用，从打击错误信息到支持语言多样性和文化遗产。这项工作强调了自然语言处理和语音处理领域创新的重要性，特别关注适用于乌克兰语的技术发展。