Suppr超能文献

高速视频内镜检查空间分割的不确定性及其时空依赖性

Uncertainty of Spatial Segmentation of High-Speed Videoendoscopy and Its Temporal and Spatial Dependency.

作者信息

Ghasemzadeh Hamzeh, Powell Maria E, Ford David S, Deliyski Dimitar D

机构信息

Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, MA; Department of Surgery, Harvard Medical School, Boston, MA; Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, MI.

Department of Otolaryngology-Head and Neck Surgery, Vanderbilt University Medical Center, Nashville, TN.

出版信息

J Voice. 2025 Mar 28. doi: 10.1016/j.jvoice.2025.03.007.

Abstract

OBJECTIVE

Spatial segmentation of high-speed videoendoscopy (HSV) is the process that detects the edges of the vocal folds and represents them in analytic form. The level of spatial segmentation uncertainty (ie, how close vs. far apart different experts marked the edges of the vocal folds) can have a great impact on the level of uncertainty of the final measures (ie, their dispersion). This study quantified the uncertainty of spatial segmentation and investigated its dependency on the phase of the glottal cycle and the location of vocal fold edges along the anterior-posterior direction.

METHOD

Three experts manually segmented the vocal fold edges of twelve HSV recordings using an iterative process consisting of an initial segmentation followed by a blinded reconciliation phase. Segmentation uncertainty was computed as the distance in pixels between the three-segmented edges at the end of the iterative process. The relationships between segmentation uncertainty and different sections of the glottis along the anterior-posterior direction and the relationships between segmentation uncertainty and different phases of the glottal cycle were quantified.

RESULTS

Segmentation uncertainties of the anterior and the posterior sections of the glottis were significantly higher than the middle section, while uncertainty of the anterior section was the highest and 40% larger than the middle section. The average segmentation uncertainty and normalized glottal area were positively correlated. Segmentation uncertainty of the most open glottal configurations was 31% larger than the most closed glottal configuration.

CONCLUSION

The uncertainty of spatial segmentation of the vocal fold edges depends on the phase of the glottal cycle and the location of the edge along the anterior-posterior direction; hence, it is expected for different HSV measures to have different levels of uncertainties. The implications of these findings for vocal fold velocity measures are discussed. Additionally, the findings from this study could provide direction for future automated spatial segmentation methods and for creating a robust and reliable automated HSV processing pipeline.

摘要

目的

高速视频内镜检查(HSV)的空间分割是检测声带边缘并以解析形式呈现它们的过程。空间分割不确定性水平(即不同专家标记声带边缘的接近程度与差异程度)会对最终测量的不确定性水平(即其离散程度)产生重大影响。本研究量化了空间分割的不确定性,并研究了其对声门周期阶段以及声带边缘在前后方向上位置的依赖性。

方法

三位专家使用一个迭代过程对手动分割十二个HSV记录的声带边缘,该过程包括初始分割,随后是盲态核对阶段。分割不确定性计算为迭代过程结束时三段分割边缘之间的像素距离。量化了分割不确定性与声门在前后方向上不同节段之间的关系,以及分割不确定性与声门周期不同阶段之间的关系。

结果

声门前部和后部的分割不确定性显著高于中部,而前部的不确定性最高,比中部大40%。平均分割不确定性与归一化声门面积呈正相关。声门最开放构型的分割不确定性比最闭合构型大31%。

结论

声带边缘空间分割的不确定性取决于声门周期阶段以及边缘在前后方向上的位置;因此,预计不同的HSV测量具有不同程度的不确定性。讨论了这些发现对声带速度测量的影响。此外,本研究结果可为未来的自动空间分割方法以及创建强大且可靠的自动HSV处理流程提供指导。

相似文献

3
Empirical Distribution of Glottal Edges (EDGE): A Statistical Assessment of Vocal Fold Kinematics Using High-Speed Videoendoscopy.
IEEE J Biomed Health Inform. 2025 Feb;29(2):1087-1100. doi: 10.1109/JBHI.2024.3462632. Epub 2025 Feb 10.
5
The Black Book of Psychotropic Dosing and Monitoring.
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
6
The agreement of phonetic transcriptions between paediatric speech and language therapists transcribing a disordered speech sample.
Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1981-1995. doi: 10.1111/1460-6984.13043. Epub 2024 Jun 8.
7
Machine learning based assessment of hoarseness severity: a multi-sensor approach centered on high-speed videoendoscopy.
Front Artif Intell. 2025 Jun 5;8:1601716. doi: 10.3389/frai.2025.1601716. eCollection 2025.
9
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

本文引用的文献

1
Vocal Fold Dissipated Power in Females with Hyperfunctional Voice Disorders.
J Voice. 2024 Oct 18. doi: 10.1016/j.jvoice.2024.09.039.
3
Framework for Indirect Spatial Calibration of the Horizontal Plane of Endoscopic Laryngeal Images.
J Voice. 2024 May;38(3):595-611. doi: 10.1016/j.jvoice.2021.11.019. Epub 2022 Jan 2.
6
Method for Horizontal Calibration of Laser-Projection Transnasal Fiberoptic High-Speed Videoendoscopy.
Appl Sci (Basel). 2021 Jan 2;11(2). doi: 10.3390/app11020822. Epub 2021 Jan 17.
7
Spatial Segmentation for Laryngeal High-Speed Videoendoscopy in Connected Speech.
J Voice. 2023 Jan;37(1):26-36. doi: 10.1016/j.jvoice.2020.10.017. Epub 2020 Nov 27.
8
Vocal Fold Collision Speed in vivo: The Effect of Loudness.
J Voice. 2022 Sep;36(5):608-621. doi: 10.1016/j.jvoice.2020.08.025. Epub 2020 Sep 28.
9
BAGLS, a multihospital Benchmark for Automatic Glottis Segmentation.
Sci Data. 2020 Jun 19;7(1):186. doi: 10.1038/s41597-020-0526-3.
10

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验