• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个用于根据音高将短语和句子分类为陈述句、疑问句或感叹句的数据集。

A dataset for classifying phrases and sentences into statements, questions, or exclamations based on sound pitch.

作者信息

Abdulrahman Ayub Othman, Othman Shanga Ismail, Yasin Gazo Badran, Ali Meer Salam

机构信息

Department of Computer Science, College of Science, University of Halabja, Kurdistan Region, F.R., Halabja, Iraq.

出版信息

Data Brief. 2025 Jun 24;61:111826. doi: 10.1016/j.dib.2025.111826. eCollection 2025 Aug.

DOI:10.1016/j.dib.2025.111826
PMID:40677266
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12269433/
Abstract

Speech is the most fundamental and sophisticated channel of human communication, and breakthroughs in Natural Language Processing (NLP) have substantially raised the quality of human-computer interaction. In particular, new wave of deep learning methods have significantly advanced human speech recognition by obtaining fine-grained acoustic cues including pitch, an acoustic feature that can be a critical ingredient in understanding communicative intent. Pitch variation is in particular important for prosodic classification tasks (i.e., statements, questions, and exclamations), which is crucial in tonal and low resource languages such as Kurdish, where intonation holds significant semantic information. This paper presents the dataset of the Statements, Questions, or Exclamations Based on Sound Pitch (SQEBSP) which contains 12,660 professionally-recorded speech audio clips by 431 native Kurdish speakers who reside in the Kurdistan Region of Iraq. Regarding utterances, 10 new phrases were articulated by each speaker per three prosodic categories: statements, questions, and exclamations. All utterances were digitized at 16 kHz and then manually checked for correctness concerning pitch-based classification. The dataset contains equal representation from all three classes, about 4200 samples per class, and metadata such as speaker gender, age group, and sentence identifiers. The original audio files, alongside resources like Mel-Frequency Cepstral Coefficients (MFCCs) and waveform visualizations, can be found on Mendeley Data. The dataset offered has significant advantages for formulating and testing pitch-based speech classification algorithms, furthers the work on pronunciation modelling for languages lacking sufficient resources. It furthermore, aids in developing speech technologies sensitive to dialects.

摘要

言语是人类交流最基本、最复杂的渠道,自然语言处理(NLP)的突破极大地提高了人机交互的质量。特别是,新一轮的深度学习方法通过获取包括音高在内的细粒度声学线索,显著推进了人类语音识别,音高是一种声学特征,可能是理解交流意图的关键因素。音高变化对于韵律分类任务(即陈述句、疑问句和感叹句)尤为重要,这在库尔德语等声调语言和资源匮乏的语言中至关重要,在这些语言中语调包含重要的语义信息。本文介绍了基于音高的陈述句、疑问句或感叹句数据集(SQEBSP),该数据集包含居住在伊拉克库尔德地区的431名库尔德语母语者专业录制的12660个语音音频片段。关于话语,每位说话者针对陈述句、疑问句和感叹句这三个韵律类别,每个类别说出10个新短语。所有话语均以16 kHz进行数字化处理,然后手动检查基于音高分类的正确性。该数据集包含所有三个类别的均等样本,每个类别约4200个样本,以及诸如说话者性别、年龄组和句子标识符等元数据。原始音频文件以及诸如梅尔频率倒谱系数(MFCC)和波形可视化等资源可在Mendeley Data上找到。所提供的数据集对于制定和测试基于音高的语音分类算法具有显著优势,推动了缺乏足够资源的语言的发音建模工作。此外,它有助于开发对方言敏感的语音技术。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2c3/12269433/c875b67cd6b3/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2c3/12269433/d962113b5fdd/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2c3/12269433/7cbbb9b0d5b0/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2c3/12269433/c875b67cd6b3/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2c3/12269433/d962113b5fdd/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2c3/12269433/7cbbb9b0d5b0/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2c3/12269433/c875b67cd6b3/gr3.jpg

相似文献

1
A dataset for classifying phrases and sentences into statements, questions, or exclamations based on sound pitch.一个用于根据音高将短语和句子分类为陈述句、疑问句或感叹句的数据集。
Data Brief. 2025 Jun 24;61:111826. doi: 10.1016/j.dib.2025.111826. eCollection 2025 Aug.
2
Sexual Harassment and Prevention Training性骚扰与预防培训
3
Short-Term Memory Impairment短期记忆障碍
4
Non-speech oral motor treatment for children with developmental speech sound disorders.针对发育性语音障碍儿童的非言语口腔运动治疗。
Cochrane Database Syst Rev. 2015 Mar 25;2015(3):CD009383. doi: 10.1002/14651858.CD009383.pub2.
5
The clinical effectiveness and cost-effectiveness of enzyme replacement therapy for Gaucher's disease: a systematic review.戈谢病酶替代疗法的临床疗效和成本效益:一项系统评价。
Health Technol Assess. 2006 Jul;10(24):iii-iv, ix-136. doi: 10.3310/hta10240.
6
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
7
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
8
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
9
Measures implemented in the school setting to contain the COVID-19 pandemic.学校为控制 COVID-19 疫情而采取的措施。
Cochrane Database Syst Rev. 2022 Jan 17;1(1):CD015029. doi: 10.1002/14651858.CD015029.
10
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.

本文引用的文献

1
Pitch-based correspondences related to abstract concepts.与抽象概念相关的基于音高的对应关系。
Acta Psychol (Amst). 2025 Mar;253:104754. doi: 10.1016/j.actpsy.2025.104754. Epub 2025 Jan 24.
2
Temporally Dissociable Neural Representations of Pitch Height and Chroma.音高和音级的时间可分离神经表征
J Neurosci. 2025 Feb 19;45(8):e1567242024. doi: 10.1523/JNEUROSCI.1567-24.2024.
3
Dataset for the recognition of Kurdish sound dialects.库尔德语音方言识别数据集。
Data Brief. 2024 Feb 22;53:110231. doi: 10.1016/j.dib.2024.110231. eCollection 2024 Apr.
4
Natural language processing: state of the art, current trends and challenges.自然语言处理:技术现状、当前趋势与挑战。
Multimed Tools Appl. 2023;82(3):3713-3744. doi: 10.1007/s11042-022-13428-4. Epub 2022 Jul 14.