• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
ROBUST DETECTION OF VOICED SEGMENTS IN SAMPLES OF EVERYDAY CONVERSATIONS USING UNSUPERVISED HMMS.使用无监督隐马尔可夫模型对日常对话样本中的浊音段进行稳健检测。
SLT Workshop Spok Lang Technol. 2012 Dec;2012:438-442. doi: 10.1109/slt.2012.6424264. Epub 2013 Feb 1.
2
Robust and Accurate Features for Detecting and Diagnosing Autism Spectrum Disorders.用于检测和诊断自闭症谱系障碍的强大且准确的特征
Interspeech. 2013 Aug;2013:191-194.
3
Significance of voiced and unvoiced speech segments for the detection of common cold.有声和无声语音片段对感冒检测的意义。
Signal Image Video Process. 2023;17(5):1785-1792. doi: 10.1007/s11760-022-02389-8. Epub 2022 Nov 15.
4
Symptom Expression Across Voiced Speech Sounds in Adductor Laryngeal Dystonia.内收型喉肌张力障碍中浊语音的症状表现
J Voice. 2025 Mar;39(2):567.e23-567.e30. doi: 10.1016/j.jvoice.2022.10.002. Epub 2022 Nov 21.
5
Quantifying Voice Characteristics for Detecting Autism.量化用于检测自闭症的语音特征。
Front Psychol. 2021 Sep 7;12:665096. doi: 10.3389/fpsyg.2021.665096. eCollection 2021.
6
Waveform Amplitude and Temporal Symmetric/Asymmetric Characteristics of Phoneme and Syllable Segments in the W-1 Spondaic Words Recorded by Four Speakers.由四位说话者记录的W-1双音节词中音素和音节片段的波形幅度及时间对称/不对称特征
J Am Acad Audiol. 2021 Jul;32(7):445-463. doi: 10.1055/s-0041-1730959. Epub 2021 Nov 30.
7
The cricothyroid muscle in voicing control.发声控制中的环甲肌。
J Acoust Soc Am. 1989 Mar;85(3):1314-21. doi: 10.1121/1.397462.
8
INFERRING SOCIAL CONTEXTS FROM AUDIO RECORDINGS USING DEEP NEURAL NETWORKS.使用深度神经网络从音频记录中推断社会背景
IEEE Int Workshop Mach Learn Signal Process. 2014 Sep;2014. doi: 10.1109/MLSP.2014.6958853. Epub 2014 Nov 20.
9
Influence of musical training on understanding voiced and whispered speech in noise.音乐训练对噪声中浊音和低语语音理解的影响。
PLoS One. 2014 Jan 28;9(1):e86980. doi: 10.1371/journal.pone.0086980. eCollection 2014.
10
Investigating the role of harmonic cancellation in speech-on-speech masking.研究谐波抵消在语音掩蔽中的作用。
Hear Res. 2022 Dec;426:108562. doi: 10.1016/j.heares.2022.108562. Epub 2022 Jun 17.

引用本文的文献

1
AUTOMATIC MEASUREMENT OF AFFECTIVE VALENCE AND AROUSAL IN SPEECH.语音中情感效价和唤醒度的自动测量
Proc IEEE Int Conf Acoust Speech Signal Process. 2014 May;2014:965-969. doi: 10.1109/ICASSP.2014.6853740. Epub 2014 Jul 14.
2
Robust and Accurate Features for Detecting and Diagnosing Autism Spectrum Disorders.用于检测和诊断自闭症谱系障碍的强大且准确的特征
Interspeech. 2013 Aug;2013:191-194.
3
INFERRING CLINICAL DEPRESSION FROM SPEECH AND SPOKEN UTTERANCES.从语音和话语中推断临床抑郁症
IEEE Int Workshop Mach Learn Signal Process. 2014 Sep;2014. doi: 10.1109/mlsp.2014.6958856. Epub 2014 Nov 20.

本文引用的文献

1
Personality in its natural habitat: manifestations and implicit folk theories of personality in daily life.自然情境下的人格:日常生活中人格的表现及内隐民间理论
J Pers Soc Psychol. 2006 May;90(5):862-77. doi: 10.1037/0022-3514.90.5.862.

使用无监督隐马尔可夫模型对日常对话样本中的浊音段进行稳健检测。

ROBUST DETECTION OF VOICED SEGMENTS IN SAMPLES OF EVERYDAY CONVERSATIONS USING UNSUPERVISED HMMS.

作者信息

Asgari Meysam, Shafran Izhak, Bayestehtashk Alireza

机构信息

Center for Spoken Language Understanding, OHSU, Portland, OR, USA.

出版信息

SLT Workshop Spok Lang Technol. 2012 Dec;2012:438-442. doi: 10.1109/slt.2012.6424264. Epub 2013 Feb 1.

DOI:10.1109/slt.2012.6424264
PMID:33644784
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7909075/
Abstract

We investigate methods for detecting voiced segments in everyday conversations from ambient recordings. Such recordings contain high diversity of background noise, making it difficult or infeasible to collect representative labelled samples for estimating noise-specific HMM models. The popular utility and its derivatives compute normalized cross-correlation for detecting voiced segments, which unfortunately is sensitive to different types of noise. Exploiting the fact that voiced speech is not just periodic but also rich in harmonic, we model voiced segments by adopting harmonic models, which have recently gained considerable attention. In previous work, the parameters of the model were estimated independently for each frame using maximum likelihood criterion. However, since the distribution of harmonic coefficients depend on articulators of speakers, we estimate the model parameters more robustly using a maximum criterion. We use the likelihood of voicing, computed from the harmonic model, as an observation probability of an HMM and detect speech using this unsupervised HMM. The one caveat of the harmonic model is that they fail to distinguish speech from other stationary harmonic noise. We rectify this weakness by taking advantage of the non-stationary property of speech. We evaluate our models empirically on a task of detecting speech on a large corpora of everyday speech and demonstrate that these models perform significantly better than standard voice detection algorithm employed in popular tools.

摘要

我们研究了从环境录音中检测日常对话中浊音段的方法。此类录音包含高度多样的背景噪声,使得收集用于估计特定噪声的隐马尔可夫模型(HMM)的代表性标记样本变得困难或不可行。流行的工具及其衍生工具通过计算归一化互相关来检测浊音段,但不幸的是,它对不同类型的噪声很敏感。利用浊音不仅具有周期性而且谐波丰富这一事实,我们采用谐波模型对浊音段进行建模,该模型最近受到了广泛关注。在先前的工作中,使用最大似然准则为每一帧独立估计模型参数。然而,由于谐波系数的分布取决于说话者的发音器官,我们使用最大准则更稳健地估计模型参数。我们将根据谐波模型计算出的浊音似然性用作HMM的观测概率,并使用这种无监督的HMM来检测语音。谐波模型的一个问题是它们无法将语音与其他平稳谐波噪声区分开来。我们利用语音的非平稳特性来纠正这一弱点。我们在一个大型日常语音语料库上的语音检测任务中对我们的模型进行了实证评估,并证明这些模型的性能明显优于流行工具中使用的标准语音检测算法。