• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

语音的分形维数:计算及其在自动语音识别中的应用。

Fractal dimensions of speech sounds: computation and application to automatic speech recognition.

作者信息

Maragos P, Potamianos A

机构信息

Department of Electrical and Computer Engineering, National Technical University of Athens, Greece.

出版信息

J Acoust Soc Am. 1999 Mar;105(3):1925-32. doi: 10.1121/1.426738.

DOI:10.1121/1.426738
PMID:10089613
Abstract

The dynamics of airflow during speech production may often result in some small or large degree of turbulence. In this paper, the geometry of speech turbulence as reflected in the fragmentation of the time signal is quantified by using fractal models. An efficient algorithm for estimating the short-time fractal dimension of speech signals based on multiscale morphological filtering is described, and its potential for speech segmentation and phonetic classification discussed. Also reported are experimental results on using the short-time fractal dimension of speech signals at multiple scales as additional features in an automatic speech-recognition system using hidden Markov models, which provide a modest improvement in speech-recognition performance.

摘要

言语产生过程中的气流动力学常常会导致不同程度的湍流。本文利用分形模型对时间信号碎片化所反映的言语湍流几何特征进行量化。描述了一种基于多尺度形态滤波估计语音信号短时分数维的有效算法,并讨论了其在语音分割和语音分类方面的潜力。还报告了在使用隐马尔可夫模型的自动语音识别系统中,将多尺度语音信号的短时分数维作为附加特征的实验结果,该结果使语音识别性能有适度提高。

相似文献

1
Fractal dimensions of speech sounds: computation and application to automatic speech recognition.语音的分形维数:计算及其在自动语音识别中的应用。
J Acoust Soc Am. 1999 Mar;105(3):1925-32. doi: 10.1121/1.426738.
2
An algorithm for the automatic differentiation between the speech of normals and patients with Friedreich's ataxia based on the short-time fractal dimension.一种基于短时分形维数的用于自动区分正常人语音与弗里德赖希共济失调患者语音的算法。
Comput Biol Med. 1998 Jan;28(1):75-89. doi: 10.1016/s0010-4825(97)00039-5.
3
Automatic speech recognition using a predictive echo state network classifier.使用预测回声状态网络分类器的自动语音识别。
Neural Netw. 2007 Apr;20(3):414-23. doi: 10.1016/j.neunet.2007.04.006. Epub 2007 Apr 29.
4
Structural design of hidden Markov model speech recognizer using multivalued phonetic features: comparison with segmental speech units.
J Acoust Soc Am. 1992 Dec;92(6):3058-67. doi: 10.1121/1.404202.
5
Speech production knowledge in automatic speech recognition.自动语音识别中的语音生成知识。
J Acoust Soc Am. 2007 Feb;121(2):723-42. doi: 10.1121/1.2404622.
6
Investigating unscripted speech: implications for phonetics and phonology.研究自然话语:对语音学和音系学的启示。
Phonetica. 2000 Apr-Dec;57(2-4):85-94. doi: 10.1159/000028464.
7
Acoustic-phonetic features for the automatic classification of fricatives.用于擦音自动分类的声学语音特征。
J Acoust Soc Am. 2001 May;109(5 Pt 1):2217-35. doi: 10.1121/1.1357814.
8
Speech sound representation, perception, and plasticity: a neurophysiologic perceptive.语音表征、感知与可塑性:一种神经生理学视角
Audiol Neurootol. 1998 Mar-Jun;3(2-3):168-82. doi: 10.1159/000013788.
9
Detection of speech landmarks: use of temporal information.
J Acoust Soc Am. 2004 Mar;115(3):1296-305. doi: 10.1121/1.1646400.
10
Hidden Markov model classification of myoelectric signals in speech.语音中肌电信号的隐马尔可夫模型分类
IEEE Eng Med Biol Mag. 2002 Sep-Oct;21(5):143-6. doi: 10.1109/memb.2002.1044184.

引用本文的文献

1
Sound Matrix Shaping of Living Matter: From Macrosystems to Cell Microenvironment, Where Mitochondria Act as Energy Portals in Detecting and Processing Sound Vibrations.活体物质的声矩阵塑形:从宏观系统到细胞微环境,线粒体在探测和处理声振动中充当能量门户。
Int J Mol Sci. 2024 Jun 21;25(13):6841. doi: 10.3390/ijms25136841.
2
A convolutional neural network-based framework for analysis and assessment of non-linguistic sound classification and enhancement for normal hearing and cochlear implant listeners.一种基于卷积神经网络的框架,用于分析和评估正常听力和人工耳蜗听者的非语言声音分类及增强。
J Acoust Soc Am. 2022 Nov;152(5):2720. doi: 10.1121/10.0014955.
3
Sounds Stimulation on In Vitro HL1 Cells: A Pilot Study and a Theoretical Physical Model.
体外 HL1 细胞的电刺激:一项初步研究与理论物理模型。
Int J Mol Sci. 2020 Dec 25;22(1):156. doi: 10.3390/ijms22010156.
4
Scaling and universality in the human voice.人类声音中的标度律和普遍性。
J R Soc Interface. 2015 Apr 6;12(105). doi: 10.1098/rsif.2014.1344.
5
Exploiting nonlinear recurrence and fractal scaling properties for voice disorder detection.利用非线性递归和分形标度特性进行语音障碍检测。
Biomed Eng Online. 2007 Jun 26;6:23. doi: 10.1186/1475-925X-6-23.