• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HiACC:印式英语成人与儿童语码转换语料库。

HiACC: Hinglish adult & children code-switched corpus.

作者信息

Singh Shruti, Singh Muskaan, Kadyan Virender

机构信息

SoCS, University of Petroleum and Energy Studies, Dehradun, Uttarakhand, India.

SCIES, Ulster University, Northland Road, Londonderry, UK.

出版信息

Data Brief. 2025 Jul 17;62:111886. doi: 10.1016/j.dib.2025.111886. eCollection 2025 Oct.

DOI:10.1016/j.dib.2025.111886
PMID:40778380
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12329218/
Abstract

Code-switching is the frequent alternation between two or more languages within a single utterance and is a widespread phenomenon among bilingual and multilingual speakers. In India, more than 250 million people are estimated to engage in code-switched communication, especially blending English with Hindi (Hinglish), making it one of the largest bilingual populations globally, making challenging for developing accurate and robust Automatic Speech Recognition (ASR) systems. Existing ASR models, typically trained on monolingual corpus, struggle with code-switched input due to a lack of large, balanced, and representative datasets-particularly for diverse age groups. Recent evaluations have shown that ASR models experience a relative increase in Word Error Rate (WER) of 30-50 % when exposed to code-switched speech compared to monolingual input. To address this resource gap, we introduce a benchmark Hinglish speech corpus, to improve ASR performance in resource-constrained settings. While several monolingual Hindi and English corpus exist, publicly available code-switched datasets remain scarce, and none till date include children's speech. Our corpus fills this gap by providing the first code-switched Hinglish speech dataset with recordings from both adults and children. It comprises 3,318 audio segments from adult participants and 1,858 segments from children, covering 5.24 hours of read and spontaneous speech. The transcriptions include detailed annotations and code-switching tags to assist in linguistic and computational analysis. The corpus is publicly available at [https://zenodo.org/records/15551669], offering segmented audio and aligned transcripts for open research. We also present baseline ASR experiments, which show that standard models trained on monolingual data underperform by approximately 42 % WER on our test set, highlighting the complexity of the task. To our knowledge, this is the first publicly available resource on code-switched Hinglish speech encompassing both adult and child speakers, designed to catalyse progress in this challenging yet important area of speech recognition.

摘要

语码转换是指在单个话语中频繁交替使用两种或更多语言,这在双语和多语使用者中是一种普遍现象。在印度,估计有超过2.5亿人进行语码转换交流,尤其是将英语与印地语混合(印式英语),使其成为全球最大的双语群体之一,这给开发准确且强大的自动语音识别(ASR)系统带来了挑战。现有的ASR模型通常在单语语料库上进行训练,由于缺乏大型、平衡且具有代表性的数据集,尤其是针对不同年龄组的数据集,因此在处理语码转换输入时存在困难。最近的评估表明,与单语输入相比,当ASR模型接触语码转换语音时,其单词错误率(WER)相对增加30 - 50%。为了弥补这一资源缺口,我们引入了一个基准印式英语语音语料库,以提高资源受限环境下的ASR性能。虽然存在几个单语的印地语和英语语料库,但公开可用的语码转换数据集仍然稀缺,而且迄今为止没有一个包含儿童语音。我们的语料库通过提供第一个包含成人和儿童录音的语码转换印式英语语音数据集填补了这一空白。它包括来自成人参与者的3318个音频片段和来自儿童的1858个片段,涵盖了5.24小时的朗读和自发语音。转录内容包括详细的注释和语码转换标签,以协助进行语言和计算分析。该语料库可在[https://zenodo.org/records/15551669]上公开获取,提供分段音频和对齐的转录文本以供开放研究使用。我们还展示了基准ASR实验,结果表明在单语数据上训练的标准模型在我们的测试集上的WER比预期差约42%,突出了该任务的复杂性。据我们所知,这是第一个公开可用的涵盖成人和儿童使用者的语码转换印式英语语音资源,旨在推动这一具有挑战性但重要的语音识别领域的进展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4faf/12329218/0a6e21dac00b/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4faf/12329218/feecc33884e7/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4faf/12329218/fc21e527422d/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4faf/12329218/116a28ae1f39/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4faf/12329218/9acacd34b1cd/gr4a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4faf/12329218/0a6e21dac00b/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4faf/12329218/feecc33884e7/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4faf/12329218/fc21e527422d/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4faf/12329218/116a28ae1f39/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4faf/12329218/9acacd34b1cd/gr4a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4faf/12329218/0a6e21dac00b/gr5.jpg

相似文献

1
HiACC: Hinglish adult & children code-switched corpus.HiACC:印式英语成人与儿童语码转换语料库。
Data Brief. 2025 Jul 17;62:111886. doi: 10.1016/j.dib.2025.111886. eCollection 2025 Oct.
2
A systematic review of speech, language and communication interventions for children with Down syndrome from 0 to 6 years.对0至6岁唐氏综合征儿童言语、语言和沟通干预措施的系统评价。
Int J Lang Commun Disord. 2022 Mar;57(2):441-463. doi: 10.1111/1460-6984.12699. Epub 2022 Feb 22.
3
"We're all in it together": uniting a diverse range of professionals and people with lived experience within the development of a complex, theory-based paediatric speech and language therapy intervention.“我们同舟共济”:在一项基于理论的复杂儿科言语和语言治疗干预措施的开发过程中,团结各类专业人员以及有实际经验的人士。
Res Involv Engagem. 2025 Jun 19;11(1):67. doi: 10.1186/s40900-025-00738-8.
4
Sexual Harassment and Prevention Training性骚扰与预防培训
5
Short-Term Memory Impairment短期记忆障碍
6
Interventions for childhood apraxia of speech.儿童言语失用症的干预措施。
Cochrane Database Syst Rev. 2018 May 30;5(5):CD006278. doi: 10.1002/14651858.CD006278.pub3.
7
Antidepressants for pain management in adults with chronic pain: a network meta-analysis.抗抑郁药治疗成人慢性疼痛的疼痛管理:一项网络荟萃分析。
Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.
8
Examining Neurodiversity in Bilingual Development Research: Recent Insights Through an Equity, Diversity, and Inclusion Lens.审视双语发展研究中的神经多样性:通过公平、多样性和包容性视角获得的最新见解。
Int J Lang Commun Disord. 2025 Sep-Oct;60(5):e70100. doi: 10.1111/1460-6984.70100.
9
Do you like my voice? Stakeholder perspectives about the acceptability of synthetic child voices in three South African languages.你喜欢我的声音吗?利益相关者对三种南非语言中合成儿童声音可接受性的看法。
Int J Lang Commun Disord. 2025 Jan-Feb;60(1):e13152. doi: 10.1111/1460-6984.13152.
10
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.

本文引用的文献

1
Transfer Learning from Adult to Children for Speech Recognition: Evaluation, Analysis and Recommendations.从成人到儿童的语音识别迁移学习:评估、分析与建议
Comput Speech Lang. 2020 Sep;63. doi: 10.1016/j.csl.2020.101077. Epub 2020 Feb 18.
2
Analysis of Intonation Patterns in Cantonese Aphasia Speech.粤语失语症言语的语调模式分析。
Int Conf Speech Database Assess. 2015 Oct;2015:86-89. doi: 10.1109/ICSDA.2015.7357870. Epub 2015 Dec 17.