• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

语音混合信号的双麦克风分离

Two-microphone separation of speech mixtures.

作者信息

Pedersen Michael Syskind, Wang DeLiang, Larsen Jan, Kjems Ulrik

机构信息

Oticon A/S, Smørum DK-2765, Denmark.

出版信息

IEEE Trans Neural Netw. 2008 Mar;19(3):475-92. doi: 10.1109/TNN.2007.911740.

DOI:10.1109/TNN.2007.911740
PMID:18334366
Abstract

Separation of speech mixtures, often referred to as the cocktail party problem, has been studied for decades. In many source separation tasks, the separation method is limited by the assumption of at least as many sensors as sources. Further, many methods require that the number of signals within the recorded mixtures be known in advance. In many real-world applications, these limitations are too restrictive. We propose a novel method for underdetermined blind source separation using an instantaneous mixing model which assumes closely spaced microphones. Two source separation techniques have been combined, independent component analysis (ICA) and binary time - frequency (T-F) masking. By estimating binary masks from the outputs of an ICA algorithm, it is possible in an iterative way to extract basis speech signals from a convolutive mixture. The basis signals are afterwards improved by grouping similar signals. Using two microphones, we can separate, in principle, an arbitrary number of mixed speech signals. We show separation results for mixtures with as many as seven speech signals under instantaneous conditions. We also show that the proposed method is applicable to segregate speech signals under reverberant conditions, and we compare our proposed method to another state-of-the-art algorithm. The number of source signals is not assumed to be known in advance and it is possible to maintain the extracted signals as stereo signals.

摘要

语音混合分离,通常被称为鸡尾酒会问题,已经被研究了几十年。在许多源分离任务中,分离方法受到传感器数量至少与源数量一样多这一假设的限制。此外,许多方法要求预先知道录制混合信号中的信号数量。在许多实际应用中,这些限制过于严格。我们提出了一种使用瞬时混合模型的欠定盲源分离新方法,该模型假设麦克风间距很近。我们将两种源分离技术相结合,即独立成分分析(ICA)和二进制时频(T-F)掩蔽。通过从ICA算法的输出中估计二进制掩码,可以以迭代方式从卷积混合信号中提取基本语音信号。之后通过对相似信号进行分组来改进基本信号。原则上,使用两个麦克风我们可以分离任意数量的混合语音信号。我们展示了在瞬时条件下多达七个语音信号的混合信号的分离结果。我们还表明所提出的方法适用于在混响条件下分离语音信号,并且我们将所提出的方法与另一种最新算法进行了比较。源信号的数量不假定预先已知,并且可以将提取的信号保持为立体声信号。

相似文献

1
Two-microphone separation of speech mixtures.语音混合信号的双麦克风分离
IEEE Trans Neural Netw. 2008 Mar;19(3):475-92. doi: 10.1109/TNN.2007.911740.
2
Blind source separation and deconvolution: the dynamic component analysis algorithm.盲源分离与反卷积:动态分量分析算法
Neural Comput. 1998 Aug 15;10(6):1373-424.
3
Two-microphone separation of speech mixtures based on interclass variance maximization.基于类间方差最大化的语音混合的双麦克风分离。
J Acoust Soc Am. 2010 Mar;127(3):1661-72. doi: 10.1121/1.3294713.
4
Blind separation of mutually correlated sources using precoders.使用预编码器对相互关联的源进行盲分离。
IEEE Trans Neural Netw. 2010 Jan;21(1):82-90. doi: 10.1109/TNN.2009.2034518. Epub 2009 Nov 24.
5
MISEP method for postnonlinear blind source separation.用于后非线性盲源分离的MISEP方法。
Neural Comput. 2007 Sep;19(9):2557-78. doi: 10.1162/neco.2007.19.9.2557.
6
Initialization method for speech separation algorithms that work in the time-frequency domain.用于在时频域工作的语音分离算法的初始化方法。
J Acoust Soc Am. 2010 Apr;127(4):EL121-6. doi: 10.1121/1.3310248.
7
Estimation of sparse nonnegative sources from noisy overcomplete mixtures using MAP.使用最大后验概率(MAP)从含噪超完备混合信号中估计稀疏非负源。
Neural Comput. 2009 Dec;21(12):3487-518. doi: 10.1162/neco.2009.08-08-846.
8
An algorithm for separation of mixed sparse and Gaussian sources.一种用于分离混合稀疏源和高斯源的算法。
PLoS One. 2017 Apr 17;12(4):e0175775. doi: 10.1371/journal.pone.0175775. eCollection 2017.
9
Blind extraction and localization of sound sources using point sources based approaches.基于点声源的声源盲提取和定位。
J Acoust Soc Am. 2012 Aug;132(2):904-17. doi: 10.1121/1.4726072.
10
Advances in blind source separation (BSS) and independent component analysis (ICA) for nonlinear mixtures.用于非线性混合的盲源分离(BSS)和独立成分分析(ICA)的进展。
Int J Neural Syst. 2004 Oct;14(5):267-92. doi: 10.1142/S012906570400208X.

引用本文的文献

1
Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation.基于双通道余弦函数的稳健语音分离的 ITI 估计。
Sensors (Basel). 2017 Jun 20;17(6):1447. doi: 10.3390/s17061447.
2
Time-frequency masking for speech separation and its potential for hearing aid design.用于语音分离的时频掩蔽及其在助听器设计中的潜力。
Trends Amplif. 2008 Dec;12(4):332-53. doi: 10.1177/1084713808326455. Epub 2008 Oct 30.