• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

强大的基频检测算法,不受人类声音中嘶哑情况的影响。

Robust fundamental frequency-detection algorithm unaffected by the presence of hoarseness in human voice.

作者信息

Kitayama Itsuki, Hosokawa Kiyohito, Iwaki Shinobu, Yoshida Misao, Miyauchi Akira, Kishikawa Toshihiro, Tanaka Hidenori, Tsuda Takeshi, Sato Takashi, Takenaka Yukinori, Ogawa Makoto, Inohara Hidenori

机构信息

Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine, Osaka 565-0871, Japan.

Department of Otorhinolaryngology, Osaka International Medical & Science Center, Osaka 543-0035, Japan.

出版信息

J Acoust Soc Am. 2024 Dec 1;156(6):4217-4228. doi: 10.1121/10.0034624.

DOI:10.1121/10.0034624
PMID:39718355
Abstract

The fundamental frequency (fo) is pivotal for quantifying vocal-fold characteristics. However, the accuracy of fo estimation in hoarse voices is notably low, and no definitive algorithm for fo estimation has been previously established. In this study, we introduce an algorithm named, "Spectral-based fo Estimator Emphasized by Domination and Sequence (SFEEDS)," which enhances the spectrum method and conducted comparative analyses with conventional estimation methods. We analyzed 454 voice samples and used conventional methods and SFEEDS to calculate fo. The ground truth of fo was determined as the lowest frequency within the most dominant harmonic complex observed on the spectrogram. Subsequently, we assessed the concordance between each fo-estimation method and the fo ground truth. We also examined the variations in the accuracy of these methods when analyzing speech with hoarseness. Regardless of hoarseness, the fo-estimation accuracy was significantly greater by SFEEDS than by conventional methods. Moreover, whereas the conventional methods impaired fo-estimation accuracy in samples with roughness, the SFEEDS algorithm was robust and significantly reduced subharmonic errors. The SFEEDS fo-estimation algorithm accurately estimated the fo of both normal and hoarse voices.

摘要

基频(fo)对于量化声带特征至关重要。然而,嘶哑嗓音中基频估计的准确性显著较低,且此前尚未建立确定的基频估计算法。在本研究中,我们引入了一种名为“基于频谱的主导与序列强化基频估计器(SFEEDS)”的算法,该算法改进了频谱方法,并与传统估计方法进行了比较分析。我们分析了454个语音样本,并使用传统方法和SFEEDS来计算基频。基频的真实值被确定为在频谱图上观察到的最主要谐波复合体中的最低频率。随后,我们评估了每种基频估计方法与基频真实值之间的一致性。我们还研究了在分析嘶哑语音时这些方法准确性的变化。无论是否存在嘶哑,SFEEDS的基频估计准确性均显著高于传统方法。此外,虽然传统方法在粗糙度样本中损害了基频估计准确性,但SFEEDS算法具有鲁棒性,显著减少了次谐波误差。SFEEDS基频估计算法准确地估计了正常嗓音和嘶哑嗓音的基频。

相似文献

1
Robust fundamental frequency-detection algorithm unaffected by the presence of hoarseness in human voice.强大的基频检测算法,不受人类声音中嘶哑情况的影响。
J Acoust Soc Am. 2024 Dec 1;156(6):4217-4228. doi: 10.1121/10.0034624.
2
An Innovative Voice Analyzer "VA" Smart Phone Program for Quantitative Analysis of Voice Quality.一种创新的语音分析器“VA”智能手机程序,用于语音质量的定量分析。
J Voice. 2019 Sep;33(5):642-648. doi: 10.1016/j.jvoice.2018.01.026. Epub 2018 May 22.
3
Perturbation and hoarseness: a pilot study of six children's voices.嗓音扰动与嘶哑:对六名儿童嗓音的初步研究
J Voice. 1996 Sep;10(3):252-61. doi: 10.1016/s0892-1997(96)80006-3.
4
Harmonic-intensity analysis of normal and hoarse voices.
J Acoust Soc Am. 1984 Dec;76(6):1648-51. doi: 10.1121/1.391611.
5
A case report in changes in phonatory physiology following voice therapy: application of high-speed imaging.一例嗓音治疗后发声生理变化的病例报告:高速成像的应用。
J Voice. 2012 Nov;26(6):734-41. doi: 10.1016/j.jvoice.2012.01.001. Epub 2012 Jun 19.
6
Pitch Strength as an Outcome Measure for Treatment of Dysphonia.音调强度作为嗓音障碍治疗的一项疗效指标。
J Voice. 2017 Nov;31(6):691-696. doi: 10.1016/j.jvoice.2017.01.016. Epub 2017 Mar 17.
7
Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach.基于倒谱向量的病理性嗓音检测:深度学习方法。
J Voice. 2019 Sep;33(5):634-641. doi: 10.1016/j.jvoice.2018.02.003. Epub 2018 Mar 19.
8
Spectral moment analysis of unilateral vocal fold paralysis.单侧声带麻痹的频谱矩分析。
J Voice. 2011 May;25(3):330-6. doi: 10.1016/j.jvoice.2010.03.006. Epub 2010 Sep 2.
9
New Evidence That Nonlinear Source-Filter Coupling Affects Harmonic Intensity and fo Stability During Instances of Harmonics Crossing Formants.非线性源-滤波器耦合在谐波跨越共振峰时影响谐波强度和基频稳定性的新证据。
J Voice. 2017 Mar;31(2):149-156. doi: 10.1016/j.jvoice.2016.04.010. Epub 2016 Aug 5.
10
Objective voice and speech analysis of persons with chronic hoarseness by prosodic analysis of speech samples.通过对语音样本进行韵律分析,对慢性声音嘶哑患者进行客观的语音和言语分析。
Logoped Phoniatr Vocol. 2016 Oct;41(3):106-16. doi: 10.3109/14015439.2015.1019563. Epub 2015 May 27.

引用本文的文献

1
A multivariate model incorporating subharmonic measurements for evaluating vocal roughness.一种纳入亚谐波测量以评估嗓音粗糙度的多变量模型。
NPJ Digit Med. 2025 May 20;8(1):295. doi: 10.1038/s41746-025-01702-2.