一种使用声振图从喉部高速视频分析持续和动态声带振动的通用程序。

A generalized procedure for analyzing sustained and dynamic vocal fold vibrations from laryngeal high-speed videos using phonovibrograms.

作者信息

Unger Jakob, Schuster Maria, Hecker Dietmar J, Schick Bernhard, Lohscheller Jörg

机构信息

Department of Computer Science, Trier University of Applied Sciences, Schneidershof, 54293 Trier, Germany.

Department of Otorhinolaryngology and Head and Neck Surgery, University of Munich, Campus Grosshadern, Marchioninistr. 13, 81366 München, Germany.

出版信息

Artif Intell Med. 2016 Jan;66:15-28. doi: 10.1016/j.artmed.2015.10.002. Epub 2015 Oct 30.

DOI:10.1016/j.artmed.2015.10.002

PMID:26597002

Abstract

OBJECTIVE

This work presents a computer-based approach to analyze the two-dimensional vocal fold dynamics of endoscopic high-speed videos, and constitutes an extension and generalization of a previously proposed wavelet-based procedure. While most approaches aim for analyzing sustained phonation conditions, the proposed method allows for a clinically adequate analysis of both dynamic as well as sustained phonation paradigms.

MATERIALS AND METHODS

The analysis procedure is based on a spatio-temporal visualization technique, the phonovibrogram, that facilitates the documentation of the visible laryngeal dynamics. From the phonovibrogram, a low-dimensional set of features is computed using a principle component analysis strategy that quantifies the type of vibration patterns, irregularity, lateral symmetry and synchronicity, as a function of time. Two different test bench data sets are used to validate the approach: (I) 150 healthy and pathologic subjects examined during sustained phonation. (II) 20 healthy and pathologic subjects that were examined twice: during sustained phonation and a glissando from a low to a higher fundamental frequency. In order to assess the discriminative power of the extracted features, a Support Vector Machine is trained to distinguish between physiologic and pathologic vibrations. The results for sustained phonation sequences are compared to the previous approach. Finally, the classification performance of the stationary analyzing procedure is compared to the transient analysis of the glissando maneuver.

RESULTS

For the first test bench the proposed procedure outperformed the previous approach (proposed feature set: accuracy: 91.3%, sensitivity: 80%, specificity: 97%, previous approach: accuracy: 89.3%, sensitivity: 76%, specificity: 96%). Comparing the classification performance of the second test bench further corroborates that analyzing transient paradigms provides clear additional diagnostic value (glissando maneuver: accuracy: 90%, sensitivity: 100%, specificity: 80%, sustained phonation: accuracy: 75%, sensitivity: 80%, specificity: 70%).

CONCLUSIONS

The incorporation of parameters describing the temporal evolvement of vocal fold vibration clearly improves the automatic identification of pathologic vibration patterns. Furthermore, incorporating a dynamic phonation paradigm provides additional valuable information about the underlying laryngeal dynamics that cannot be derived from sustained conditions. The proposed generalized approach provides a better overall classification performance than the previous approach, and hence constitutes a new advantageous tool for an improved clinical diagnosis of voice disorders.

摘要

目的

本研究提出一种基于计算机的方法，用于分析内镜高速视频中的二维声带动力学，它是对先前提出的基于小波的方法的扩展和推广。虽然大多数方法旨在分析持续发声条件，但该方法能够对动态发声和持续发声范式进行临床上充分的分析。

材料与方法

分析程序基于一种时空可视化技术——声振图，它有助于记录可见的喉部动力学。从声振图中，使用主成分分析策略计算一组低维特征，该策略根据时间量化振动模式的类型、不规则性、横向对称性和同步性。使用两个不同的测试台数据集来验证该方法：（I）150名健康和病理受试者在持续发声期间接受检查。（II）20名健康和病理受试者接受了两次检查：一次是在持续发声期间，另一次是在从低音调到高音调的滑音过程中。为了评估提取特征的判别能力，训练了一个支持向量机来区分生理性和病理性振动。将持续发声序列的结果与先前的方法进行比较。最后，将静态分析程序的分类性能与滑音动作的瞬态分析进行比较。

结果

对于第一个测试台，所提出的程序优于先前的方法（所提出的特征集：准确率：91.3%，灵敏度：80%，特异性：97%，先前的方法：准确率：89.3%，灵敏度：76%，特异性：96%）。比较第二个测试台的分类性能进一步证实，分析瞬态范式提供了明显的额外诊断价值（滑音动作：准确率：90%，灵敏度：100%，特异性：80%，持续发声：准确率：75%，灵敏度：80%，特异性：70%）。

结论

纳入描述声带振动时间演变的参数明显改善了病理性振动模式的自动识别。此外，纳入动态发声范式提供了关于潜在喉部动力学的额外有价值信息，这些信息无法从持续发声条件中获得。所提出的广义方法比先前的方法具有更好的整体分类性能，因此构成了一种用于改进嗓音障碍临床诊断的新的有利工具。

相似文献

A generalized procedure for analyzing sustained and dynamic vocal fold vibrations from laryngeal high-speed videos using phonovibrograms.

Artif Intell Med. 2016 Jan;66:15-28. doi: 10.1016/j.artmed.2015.10.002. Epub 2015 Oct 30.

Classification of functional voice disorders based on phonovibrograms.

Artif Intell Med. 2010 May;49(1):51-9. doi: 10.1016/j.artmed.2010.01.001.

A Spatiotemporal Approach to the Objective Analysis of Initiation and Termination of Vocal-fold Oscillation With High-speed Videoendoscopy.

J Voice. 2016 Nov;30(6):756.e21-756.e30. doi: 10.1016/j.jvoice.2015.09.007. Epub 2015 Nov 30.

Flexible Fiber-Optic High-Speed Imaging of Vocal Fold Vibration: A Preliminary Report.

J Voice. 2017 Mar;31(2):175-181. doi: 10.1016/j.jvoice.2016.07.015.

A portable high-speed camera system for vocal fold examinations.

J Voice. 2014 Nov;28(6):681-7. doi: 10.1016/j.jvoice.2014.04.002. Epub 2014 Jul 5.

Quantifying spatiotemporal properties of vocal fold dynamics based on a multiscale analysis of phonovibrograms.

IEEE Trans Biomed Eng. 2014 Sep;61(9):2422-33. doi: 10.1109/TBME.2014.2318774. Epub 2014 Apr 23.

Phasegram Analysis of Vocal Fold Vibration Documented With Laryngeal High-speed Video Endoscopy.

J Voice. 2016 Nov;30(6):771.e1-771.e15. doi: 10.1016/j.jvoice.2015.11.006. Epub 2016 Feb 12.

A multiscale product approach for an automatic classification of voice disorders from endoscopic high-speed videos.

Annu Int Conf IEEE Eng Med Biol Soc. 2013;2013:7360-3. doi: 10.1109/EMBC.2013.6611258.

Phonovibrographic wavegrams: visualizing vocal fold kinematics.

J Acoust Soc Am. 2013 Feb;133(2):1055-64. doi: 10.1121/1.4774378.

Phonovibrogram visualization of entire vocal fold dynamics.

Laryngoscope. 2008 Apr;118(4):753-8. doi: 10.1097/MLG.0b013e318161f9e1.

引用本文的文献

The Laryngovibrogram as a normalized spatiotemporal representation of vocal fold dynamics.

Sci Rep. 2025 May 12;15(1):16473. doi: 10.1038/s41598-025-00966-8.

Empirical Distribution of Glottal Edges (EDGE): A Statistical Assessment of Vocal Fold Kinematics Using High-Speed Videoendoscopy.

IEEE J Biomed Health Inform. 2025 Feb;29(2):1087-1100. doi: 10.1109/JBHI.2024.3462632. Epub 2025 Feb 10.

Validation and enhancement of a vocal fold medial surface 3D reconstruction approach for in-vivo application.

Sci Rep. 2023 Jul 3;13(1):10705. doi: 10.1038/s41598-023-36022-6.

Fluid-structure-acoustic interactions in an ex vivo porcine phonation model.

J Acoust Soc Am. 2021 Mar;149(3):1657. doi: 10.1121/10.0003602.

Interdependencies between acoustic and high-speed videoendoscopy parameters.

PLoS One. 2021 Feb 2;16(2):e0246136. doi: 10.1371/journal.pone.0246136. eCollection 2021.

Machine learning based identification of relevant parameters for functional voice disorders derived from endoscopic high-speed recordings.

Sci Rep. 2020 Jun 29;10(1):10517. doi: 10.1038/s41598-020-66405-y.

Determination of Clinical Parameters Sensitive to Functional Voice Disorders Applying Boosted Decision Stumps.

IEEE J Transl Eng Health Med. 2020 May 22;8:2100511. doi: 10.1109/JTEHM.2020.2985026. eCollection 2020.

Fully automatic segmentation of glottis and vocal folds in endoscopic laryngeal high-speed videos using a deep Convolutional LSTM Network.

PLoS One. 2020 Feb 10;15(2):e0227791. doi: 10.1371/journal.pone.0227791. eCollection 2020.

Influence of spatial camera resolution in high-speed videoendoscopy on laryngeal parameters.

PLoS One. 2019 Apr 22;14(4):e0215168. doi: 10.1371/journal.pone.0215168. eCollection 2019.

Biomechanical simulation of vocal fold dynamics in adults based on laryngeal high-speed videoendoscopy.

PLoS One. 2017 Nov 9;12(11):e0187486. doi: 10.1371/journal.pone.0187486. eCollection 2017.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种使用声振图从喉部高速视频分析持续和动态声带振动的通用程序。

A generalized procedure for analyzing sustained and dynamic vocal fold vibrations from laryngeal high-speed videos using phonovibrograms.

作者信息

机构信息

出版信息

OBJECTIVE

MATERIALS AND METHODS

RESULTS

CONCLUSIONS

目的

材料与方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献