Tracing vocal fold vibrations using level set segmentation method.

Suppr

超能文献

作者信息

Shi Tailong, Kim Hyun June, Murry Thomas, Woo Peak, Yan Yuling

机构信息

Department of Bioengineering, Santa Clara University, Santa Clara, CA, USA.

Department of Otorhinolaryngology, Cornell University, NY, USA.

出版信息

Int J Numer Method Biomed Eng. 2015 Jun;31(6). doi: 10.1002/cnm.2715. Epub 2015 Apr 17.

DOI:10.1002/cnm.2715

PMID:25773889

Abstract

High-speed digital imaging (HSDI) of the larynx can provide important information on the vocal fold kinematics. This information is useful and may provide a better understanding of the mechanism of phonation and assist clinical assessment of voice disorders. Automatic tracing of the vocal fold vibration is a key step in the kinematic analysis and for correlative characterization of vocal fold vibrations and voice quality in normal and diseased states. In this study, we introduce a new approach for image segmentation and automatic tracing of vocal fold motion that combines the level set method and motion cue. This approach is applied to videokymogram (VKG)-form images, which are obtained from a sequence of laryngeal images captured using the HSDI. To utilize the motion cue for a more effective level set based segmentation on the VKG, we first construct a so-called standard deviation (STD) image by mapping the pixel-based measure of temporal intensity dispersion from the initial HSDI sequence. The STD image maps the extent of vocal fold motion, and followed by threshold operation, a region of interest (ROI) that encloses vocal fold motion, or glottal region, is identified. The performance and effectiveness of our approach are evaluated by using clinical datasets representing both normal and pathological voice conditions.

摘要

相似文献

Tracing vocal fold vibrations using level set segmentation method.

Int J Numer Method Biomed Eng. 2015 Jun;31(6). doi: 10.1002/cnm.2715. Epub 2015 Apr 17.

Automatic tracing of vocal-fold motion from high-speed digital images.

IEEE Trans Biomed Eng. 2006 Jul;53(7):1394-400. doi: 10.1109/TBME.2006.873751.

A generalized procedure for analyzing sustained and dynamic vocal fold vibrations from laryngeal high-speed videos using phonovibrograms.

Artif Intell Med. 2016 Jan;66:15-28. doi: 10.1016/j.artmed.2015.10.002. Epub 2015 Oct 30.

A portable high-speed camera system for vocal fold examinations.

J Voice. 2014 Nov;28(6):681-7. doi: 10.1016/j.jvoice.2014.04.002. Epub 2014 Jul 5.

Analysis of vocal-fold vibrations from high-speed laryngeal images using a Hilbert transform-based methodology.

J Voice. 2005 Jun;19(2):161-75. doi: 10.1016/j.jvoice.2004.04.006.

Investigation of the Immediate Effects of Humming on Vocal Fold Vibration Irregularity Using Electroglottography and High-speed Laryngoscopy in Patients With Organic Voice Disorders.

J Voice. 2017 Jan;31(1):48-56. doi: 10.1016/j.jvoice.2016.03.010. Epub 2016 May 10.

Flexible Fiber-Optic High-Speed Imaging of Vocal Fold Vibration: A Preliminary Report.

J Voice. 2017 Mar;31(2):175-181. doi: 10.1016/j.jvoice.2016.07.015.

Quantitative assessment of videolaryngostroboscopic images in patients with glottic pathologies.

Logoped Phoniatr Vocol. 2017 Jul;42(2):73-83. doi: 10.3109/14015439.2016.1174293. Epub 2016 May 2.

Graphical evaluation of vocal fold vibratory patterns by high-speed videolaryngoscopy.

J Voice. 2014 Jan;28(1):106-11. doi: 10.1016/j.jvoice.2013.07.014. Epub 2013 Nov 22.

A Spatiotemporal Approach to the Objective Analysis of Initiation and Termination of Vocal-fold Oscillation With High-speed Videoendoscopy.

J Voice. 2016 Nov;30(6):756.e21-756.e30. doi: 10.1016/j.jvoice.2015.09.007. Epub 2015 Nov 30.

引用本文的文献

Deep-Learning-Based Representation of Vocal Fold Dynamics in Adductor Spasmodic Dysphonia during Connected Speech in High-Speed Videoendoscopy.

J Voice. 2025 Mar;39(2):570.e1-570.e15. doi: 10.1016/j.jvoice.2022.08.022. Epub 2022 Sep 23.

A Deep Learning Approach for Quantifying Vocal Fold Dynamics During Connected Speech Using Laryngeal High-Speed Videoendoscopy.

J Speech Lang Hear Res. 2022 Jun 8;65(6):2098-2113. doi: 10.1044/2022_JSLHR-21-00540. Epub 2022 May 23.

Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings.

Sensors (Basel). 2022 Feb 23;22(5):1751. doi: 10.3390/s22051751.

LARNet-STC: Spatio-temporal orthogonal region selection network for laryngeal closure detection in endoscopy videos.

Comput Biol Med. 2022 May;144:105339. doi: 10.1016/j.compbiomed.2022.105339. Epub 2022 Feb 28.

A Hybrid Machine-Learning-Based Method for Analytic Representation of the Vocal Fold Edges during Connected Speech.

Appl Sci (Basel). 2021 Feb;11(3). doi: 10.3390/app11031179. Epub 2021 Jan 27.

Spatial Segmentation for Laryngeal High-Speed Videoendoscopy in Connected Speech.

J Voice. 2023 Jan;37(1):26-36. doi: 10.1016/j.jvoice.2020.10.017. Epub 2020 Nov 27.

Advanced computing solutions for analysis of laryngeal disorders.

Med Biol Eng Comput. 2019 Nov;57(11):2535-2552. doi: 10.1007/s11517-019-02031-9. Epub 2019 Sep 6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验