• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于声门图的功能性嗓音障碍分类。

Classification of functional voice disorders based on phonovibrograms.

机构信息

Department of Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Germany.

出版信息

Artif Intell Med. 2010 May;49(1):51-9. doi: 10.1016/j.artmed.2010.01.001.

DOI:10.1016/j.artmed.2010.01.001
PMID:20138486
Abstract

OBJECTIVE

This work presents a computer-aided method for automatically and objectively classifying individuals with healthy and dysfunctional vocal fold vibration patterns as depicted in clinical high-speed (HS) videos of the larynx.

METHODS

By employing a specialized image segmentation and vocal fold movement visualization technique - namely phonovibrography - a novel set of numerical features is derived from laryngeal HS videos capturing the dynamic behavior and the symmetry of oscillating vocal folds. In order to assess the discriminatory power of the features, a support vector machine is applied to the preprocessed data with regard to clinically relevant diagnostic tasks. Finally, the classification performance of the learned nonlinear models is evaluated to allow for conclusions to be drawn about suitability of features and data resulting from different examination paradigms. As a reference, a second feature set is determined which corresponds to more traditional voice analysis approaches.

RESULTS

For the first time an automatic classification of healthy and pathological voices could be obtained by analyzing the vibratory patterns of vocal folds using phonovibrograms (PVGs). An average classification accuracy of approximately 81% was achieved for 2-class discrimination with PVG features. This exceeds the results obtained through traditional voice analysis features. Furthermore, a relevant influence of phonation frequency on classification accuracy was substantiated by the clinical HS data.

CONCLUSION

The PVG feature extraction and classification approach can be assessed as being promising with regard to the diagnosis of functional voice disorders. The obtained results indicate that an objective analysis of dysfunctional vocal fold vibration can be achieved with considerably high accuracy. Moreover, the PVG classification method holds a lot of potential when it comes to the clinical assessment of voice pathologies in general, as the diagnostic support can be provided to the voice clinician in a timely and reliable manner. Due to the observed interdependency between phonation frequency and classification accuracy, in future comparative studies of HS recordings of oscillating vocal folds homogeneous frequencies should be taken into account during examination.

摘要

目的

本研究提出了一种计算机辅助方法,用于自动且客观地对临床高速(HS)喉视频中显示的健康和功能障碍声带振动模式的个体进行分类。

方法

通过使用专门的图像分割和声带运动可视化技术 - 即声振图 - 从捕获动态行为和振荡声带对称性的 HS 视频中得出一组新的数值特征。为了评估特征的判别能力,使用支持向量机对预处理数据进行分析,以实现与临床相关的诊断任务。最后,评估所学习的非线性模型的分类性能,以得出关于不同检查范式的特征和数据的适用性的结论。作为参考,确定了与更传统的语音分析方法相对应的第二特征集。

结果

首次通过使用声振图分析声带的振动模式,可以实现对健康和病理嗓音的自动分类。使用 PVG 特征进行 2 类判别时,平均分类准确率约为 81%。这超过了通过传统语音分析特征获得的结果。此外,通过临床 HS 数据证实了声频对分类准确性的相关影响。

结论

就功能性嗓音障碍的诊断而言,PVG 特征提取和分类方法具有很大的潜力。所获得的结果表明,可以非常高的准确度实现对功能障碍声带振动的客观分析。此外,当涉及到一般的嗓音病理学的临床评估时,PVG 分类方法具有很大的潜力,因为可以及时可靠地为嗓音临床医生提供诊断支持。由于观察到声频与分类准确性之间的相互依存关系,在未来对振荡声带的 HS 记录进行比较研究时,应在检查期间考虑到同质频率。

相似文献

1
Classification of functional voice disorders based on phonovibrograms.基于声门图的功能性嗓音障碍分类。
Artif Intell Med. 2010 May;49(1):51-9. doi: 10.1016/j.artmed.2010.01.001.
2
A generalized procedure for analyzing sustained and dynamic vocal fold vibrations from laryngeal high-speed videos using phonovibrograms.一种使用声振图从喉部高速视频分析持续和动态声带振动的通用程序。
Artif Intell Med. 2016 Jan;66:15-28. doi: 10.1016/j.artmed.2015.10.002. Epub 2015 Oct 30.
3
Automatic diagnosis of vocal fold paresis by employing phonovibrogram features and machine learning methods.采用声门图特征和机器学习方法自动诊断声带麻痹。
Comput Methods Programs Biomed. 2010 Sep;99(3):275-88. doi: 10.1016/j.cmpb.2010.01.004.
4
Phonovibrogram visualization of entire vocal fold dynamics.整个声带动态的声振图可视化
Laryngoscope. 2008 Apr;118(4):753-8. doi: 10.1097/MLG.0b013e318161f9e1.
5
Phonovibrography: mapping high-speed movies of vocal fold vibrations into 2-D diagrams for visualizing and analyzing the underlying laryngeal dynamics.声振描记术:将声带振动的高速电影映射到二维图表中,以可视化和分析潜在的喉部动力学。
IEEE Trans Med Imaging. 2008 Mar;27(3):300-9. doi: 10.1109/TMI.2007.903690.
6
Vibration parameter extraction from endoscopic image series of the vocal folds.从声带的内镜图像序列中提取振动参数。
IEEE Trans Biomed Eng. 2002 Aug;49(8):773-81. doi: 10.1109/TBME.2002.800755.
7
Assessment of the variability of vocal fold dynamics within and between recordings with high-speed imaging and by phonovibrogram.高速成像和语音图评估声带动力学在记录内和记录间的可变性。
Laryngoscope. 2010 May;120(5):981-7. doi: 10.1002/lary.20832.
8
[Diagnosis of functional voice disorders by using the high speed recording technics].[运用高速记录技术诊断功能性嗓音障碍]
Laryngorhinootologie. 2008 May;87(5):323-30. doi: 10.1055/s-2007-967068. Epub 2007 Nov 29.
9
Variability of normal vocal fold dynamics for different vocal loading in one healthy subject investigated by phonovibrograms.通过声振图研究一名健康受试者在不同发声负荷下正常声带动力学的变异性。
J Voice. 2009 Mar;23(2):175-81. doi: 10.1016/j.jvoice.2007.09.008. Epub 2008 Mar 3.
10
Automatic tracing of vocal-fold motion from high-speed digital images.从高速数字图像中自动追踪声带运动。
IEEE Trans Biomed Eng. 2006 Jul;53(7):1394-400. doi: 10.1109/TBME.2006.873751.

引用本文的文献

1
Machine learning based assessment of hoarseness severity: a multi-sensor approach centered on high-speed videoendoscopy.基于机器学习的声音嘶哑严重程度评估:一种以高速视频内镜检查为核心的多传感器方法。
Front Artif Intell. 2025 Jun 5;8:1601716. doi: 10.3389/frai.2025.1601716. eCollection 2025.
2
The Laryngovibrogram as a normalized spatiotemporal representation of vocal fold dynamics.喉振动图作为声带动力学的标准化时空表征。
Sci Rep. 2025 May 12;15(1):16473. doi: 10.1038/s41598-025-00966-8.
3
GIRAFE: Glottal imaging dataset for advanced segmentation, analysis, and facilitative playbacks evaluation.
GIRAFE:用于高级分割、分析及辅助回放评估的声门成像数据集。
Data Brief. 2025 Feb 8;59:111376. doi: 10.1016/j.dib.2025.111376. eCollection 2025 Apr.
4
Interdependencies between acoustic and high-speed videoendoscopy parameters.声学和高速视频内镜参数之间的相互关系。
PLoS One. 2021 Feb 2;16(2):e0246136. doi: 10.1371/journal.pone.0246136. eCollection 2021.
5
Machine learning based identification of relevant parameters for functional voice disorders derived from endoscopic high-speed recordings.基于机器学习的内窥镜高速录像中功能性嗓音障碍相关参数的识别。
Sci Rep. 2020 Jun 29;10(1):10517. doi: 10.1038/s41598-020-66405-y.
6
Determination of Clinical Parameters Sensitive to Functional Voice Disorders Applying Boosted Decision Stumps.应用增强决策树桩确定对功能性嗓音障碍敏感的临床参数。
IEEE J Transl Eng Health Med. 2020 May 22;8:2100511. doi: 10.1109/JTEHM.2020.2985026. eCollection 2020.
7
Advanced computing solutions for analysis of laryngeal disorders.高级计算解决方案用于分析喉部疾病。
Med Biol Eng Comput. 2019 Nov;57(11):2535-2552. doi: 10.1007/s11517-019-02031-9. Epub 2019 Sep 6.
8
Glottal Gap tracking by a continuous background modeling using inpainting.基于图像修复的连续背景建模进行声门间隙跟踪。
Med Biol Eng Comput. 2017 Dec;55(12):2123-2141. doi: 10.1007/s11517-017-1652-8. Epub 2017 May 27.
9
Voice pathology classification based on High-Speed Videoendoscopy.基于高速视频内镜的嗓音病理学分类
Annu Int Conf IEEE Eng Med Biol Soc. 2015 Aug;2015:735-8. doi: 10.1109/EMBC.2015.7318467.
10
An automatic method to detect and track the glottal gap from high speed videoendoscopic images.一种从高速视频内窥镜图像中检测和跟踪声门间隙的自动方法。
Biomed Eng Online. 2015 Oct 29;14:100. doi: 10.1186/s12938-015-0096-3.