Suppr超能文献

3D VOSNet:用于分割喉部内窥镜图像并随后生成指标

3D VOSNet: Segmentation of endoscopic images of the larynx with subsequent generation of indicators.

作者信息

Chen I-Miao, Yeh Pin-Yu, Hsieh Ya-Chu, Chang Ting-Chi, Shih Samantha, Shen Wen-Fang, Chin Chiun-Li

机构信息

Department of Medical Informatics, Chung Shan Medical University, Taichung, Taiwan.

Morrison Academy Taichung, Taichung, Taiwan.

出版信息

Heliyon. 2023 Mar 3;9(3):e14242. doi: 10.1016/j.heliyon.2023.e14242. eCollection 2023 Mar.

Abstract

Video laryngoscope is available for visualizing the motion of vocal cords and aid in the assessment of analyzing the larynx-related lesion preliminarily. Laryngeal Electromyography (EMG) needs to be performed to diagnose the factors of vocal cord paralysis, which may cause patient feeling unwell. Thus, the problem is the lack of credible larynx indicators to evaluate larynx-related diseases in the department of otolaryngology. Therefore, this paper aims to propose a 3D VOSNet model, which has the characteristics of sequence segmentation to extract the time-series features in the video laryngoscope. The 3D VOSNet model can keep the time-series features of three images before and after of the specific image to achieve translation and occlusion invariance, which explicitly signifies that our model can segment and classify each item in the video of laryngoscopy not affected by extrinsic causes such as shaking or occlusion during laryngoscope. Numerical results revealed that the testing accuracy rates of the glottal, right vocal cord, and the left vocal cord are 89.91%, 94.63%, and 93.48%, respectively. Our proposed model can segment glottal and vocal cords from the sequence of laryngoscopy. Finally, using the proposed algorithm computes six larynx indicators, which are the area of the glottal, area of vocal cords, length of vocal cords, deviation of length of vocal cords, and symmetry of the vocal cords. In order to assist otolaryngologists in staying credible and objective when making decisions any doubt during diagnosis and also explaining the clinical symptoms of the larynx such as vocal cord paralysis to patients after diagnosis, our proposed algorithm provides otolaryngologists with explainable indicators (X-indicators).

摘要

视频喉镜可用于观察声带运动,并有助于初步分析喉部相关病变。需要进行喉肌电图(EMG)来诊断可能导致患者不适的声带麻痹因素。因此,问题在于耳鼻喉科缺乏可靠的喉部指标来评估喉部相关疾病。因此,本文旨在提出一种3D VOSNet模型,该模型具有序列分割的特点,可提取视频喉镜中的时间序列特征。3D VOSNet模型可以保留特定图像前后三幅图像的时间序列特征,以实现平移和遮挡不变性,这明确表明我们的模型可以对喉镜检查视频中的每个项目进行分割和分类,不受喉镜检查期间摇晃或遮挡等外在因素的影响。数值结果显示,声门、右侧声带和左侧声带的测试准确率分别为89.91%、94.63%和93.48%。我们提出的模型可以从喉镜检查序列中分割出声门和声带。最后,使用所提出的算法计算六个喉部指标,即声门面积、声带面积、声带长度、声带长度偏差和声带对称性。为了帮助耳鼻喉科医生在诊断时做出决策时保持可靠和客观,并且在诊断后向患者解释喉部的临床症状,如声带麻痹,我们提出的算法为耳鼻喉科医生提供了可解释的指标(X指标)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a17e/10009724/36bff4d3f848/gr1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验