LARNet-STC：用于内窥镜视频中声门闭合检测的时空正交区域选择网络。

LARNet-STC: Spatio-temporal orthogonal region selection network for laryngeal closure detection in endoscopy videos.

机构信息

Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, 65211, Missouri, USA.

Department of Otolaryngology - Head and Neck Surgery, University of Missouri, Columbia, 65211, Missouri, USA.

出版信息

Comput Biol Med. 2022 May;144:105339. doi: 10.1016/j.compbiomed.2022.105339. Epub 2022 Feb 28.

DOI:10.1016/j.compbiomed.2022.105339

PMID:35263687

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8995389/

Abstract

The vocal folds (VFs) are a pair of muscles in the larynx that play a critical role in breathing, swallowing, and speaking. VF function can be adversely affected by various medical conditions including head or neck injuries, stroke, tumor, and neurological disorders. In this paper, we propose a deep learning system for automated detection of laryngeal adductor reflex (LAR) events in laryngeal endoscopy videos to enable objective, quantitative analysis of VF function. The proposed deep learning system incorporates our novel orthogonal region selection network and temporal context. This network learns to directly map its input to a VF open/close state without first segmenting or tracking the VF region. This one-step approach drastically reduces manual annotation needs from labor-intensive segmentation masks or VF motion tracks to frame-level class labels. The proposed spatio-temporal network with an orthogonal region selection subnetwork allows integration of local image features, global image features, and VF state information in time for robust LAR event detection. The proposed network is evaluated against several network variations that incorporate temporal context and is shown to lead to better performance. The experimental results show promising performance for automated, objective, and quantitative analysis of LAR events from laryngeal endoscopy videos with over 90% and 99% F1 scores for LAR and non-LAR frames respectively.

摘要

声带是喉部的一对肌肉，在呼吸、吞咽和说话中起着至关重要的作用。声带功能可能会受到各种医疗状况的影响，包括头部或颈部受伤、中风、肿瘤和神经紊乱。在本文中，我们提出了一种深度学习系统，用于自动检测喉内收反射（LAR）事件的喉内窥镜视频，以实现对声带功能的客观、定量分析。所提出的深度学习系统结合了我们新颖的正交区域选择网络和时间上下文。该网络学会直接将其输入映射到声带打开/关闭状态，而无需首先对声带区域进行分割或跟踪。这种一步到位的方法大大减少了手动注释的需求，从劳动密集型分割掩模或声带运动轨迹到帧级别的类别标签。具有正交区域选择子网络的提出的时空网络允许在时间上集成局部图像特征、全局图像特征和声带状态信息，以实现稳健的 LAR 事件检测。所提出的网络针对几种结合时间上下文的网络变体进行了评估，并显示出更好的性能。实验结果表明，该网络在自动、客观和定量分析喉内窥镜视频中的 LAR 事件方面具有有前景的性能，对于 LAR 和非-LAR 帧，其 F1 得分分别超过 90%和 99%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b17/8995389/a7722f480765/nihms-1786290-f0001.jpg

相似文献

LARNet-STC: Spatio-temporal orthogonal region selection network for laryngeal closure detection in endoscopy videos.LARNet-STC：用于内窥镜视频中声门闭合检测的时空正交区域选择网络。

Comput Biol Med. 2022 May;144:105339. doi: 10.1016/j.compbiomed.2022.105339. Epub 2022 Feb 28.

Orthogonal Region Selection Network for Laryngeal Closure Detection in Laryngoscopy Videos.用于喉镜视频中喉关闭检测的正交区域选择网络

Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:2167-2172. doi: 10.1109/EMBC44109.2020.9176149.

Advancing Laryngeal Adductor Reflex Testing Beyond Sensory Threshold Detection.推进喉内收肌反射测试超越感觉阈检测。

Dysphagia. 2022 Oct;37(5):1151-1171. doi: 10.1007/s00455-021-10374-5. Epub 2021 Oct 22.

Fully automatic segmentation of glottis and vocal folds in endoscopic laryngeal high-speed videos using a deep Convolutional LSTM Network.使用深度卷积长短期记忆网络对喉内窥镜高速视频中的声门和声带进行全自动分割。

PLoS One. 2020 Feb 10;15(2):e0227791. doi: 10.1371/journal.pone.0227791. eCollection 2020.

The short-latency R1 response of the electrical laryngeal adductor reflex contributes to airway protection by initiating glottic closure.喉内收肌电反射的短潜伏期 R1 反应通过启动声门闭合来辅助气道保护。

Clin Neurophysiol. 2021 Dec;132(12):3160-3165. doi: 10.1016/j.clinph.2021.09.017. Epub 2021 Oct 22.

Improving the Utility of Laryngeal Adductor Reflex Testing: A Translational Tale of Mice and Men.提高喉内收肌反射测试的效用：一个关于小鼠与人类的转化故事

Otolaryngol Head Neck Surg. 2015 Jul;153(1):94-101. doi: 10.1177/0194599815578103. Epub 2015 Apr 1.

Computational Analysis of the Droplet-Stimulated Laryngeal Adductor Reflex in High-Speed Sequences.高速序列中液滴刺激喉内收肌反射的计算分析。

Laryngoscope. 2022 Dec;132(12):2412-2419. doi: 10.1002/lary.30041. Epub 2022 Feb 8.

Human laryngeal sensory receptor mapping illuminates the mechanisms of laryngeal adductor reflex control.人类喉感觉受体图谱揭示了喉内收肌反射控制的机制。

Laryngoscope. 2018 Nov;128(11):E365-E370. doi: 10.1002/lary.27248. Epub 2018 Sep 8.

[Laryngeal adduction reflex].[喉内收反射]

Laryngorhinootologie. 2014 Jul;93(7):446-9. doi: 10.1055/s-0034-1370928. Epub 2014 Jul 7.

Relationship Between Laryngeal Sensory Deficits, Aspiration, and Pneumonia in Patients with Dysphagia.吞咽困难患者喉感觉功能障碍、误吸与肺炎之间的关系

Dysphagia. 2018 Apr;33(2):192-199. doi: 10.1007/s00455-017-9845-8. Epub 2017 Sep 2.

引用本文的文献

Minimally Invasive Murine Laryngoscopy for Close-Up Imaging of Laryngeal Motion during Breathing and Swallowing.微创小鼠喉镜检查术用于在呼吸和吞咽期间近距离观察喉部运动。

J Vis Exp. 2023 Dec 1(202). doi: 10.3791/66089.

本文引用的文献

Advancing Laryngeal Adductor Reflex Testing Beyond Sensory Threshold Detection.推进喉内收肌反射测试超越感觉阈检测。

Dysphagia. 2022 Oct;37(5):1151-1171. doi: 10.1007/s00455-021-10374-5. Epub 2021 Oct 22.

A comprehensive review of image analysis methods for microorganism counting: from classical image processing to deep learning approaches.微生物计数图像分析方法综述：从经典图像处理到深度学习方法

Artif Intell Rev. 2022;55(4):2875-2944. doi: 10.1007/s10462-021-10082-4. Epub 2021 Sep 29.

Review of Deep Learning Based Automatic Segmentation for Lung Cancer Radiotherapy.基于深度学习的肺癌放疗自动分割综述

Front Oncol. 2021 Jul 8;11:717039. doi: 10.3389/fonc.2021.717039. eCollection 2021.

MRI and CT bladder segmentation from classical to deep learning based approaches: Current limitations and lessons.从经典到基于深度学习的 MRI 和 CT 膀胱分割方法：当前的局限性和教训。

Comput Biol Med. 2021 Jul;134:104472. doi: 10.1016/j.compbiomed.2021.104472. Epub 2021 May 18.

Deep learning and medical image processing for coronavirus (COVID-19) pandemic: A survey.用于冠状病毒（COVID-19）大流行的深度学习与医学图像处理：一项综述。

Sustain Cities Soc. 2021 Feb;65:102589. doi: 10.1016/j.scs.2020.102589. Epub 2020 Nov 5.

Orthogonal Region Selection Network for Laryngeal Closure Detection in Laryngoscopy Videos.用于喉镜视频中喉关闭检测的正交区域选择网络

Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:2167-2172. doi: 10.1109/EMBC44109.2020.9176149.

CINENet: deep learning-based 3D cardiac CINE MRI reconstruction with multi-coil complex-valued 4D spatio-temporal convolutions.CINENet：基于深度学习的多通道复值 4D 时空卷积的三维心脏 Cine MRI 重建

Sci Rep. 2020 Aug 13;10(1):13710. doi: 10.1038/s41598-020-70551-8.

ECG-based multi-class arrhythmia detection using spatio-temporal attention-based convolutional recurrent neural network.基于心电图的多类心律失常检测：使用基于时空注意力的卷积循环神经网络

Artif Intell Med. 2020 Jun;106:101856. doi: 10.1016/j.artmed.2020.101856. Epub 2020 May 11.

Spatio-temporal deep learning methods for motion estimation using 4D OCT image data.基于 4D-OCT 图像数据的运动估计的时空深度学习方法。

Int J Comput Assist Radiol Surg. 2020 Jun;15(6):943-952. doi: 10.1007/s11548-020-02178-z. Epub 2020 May 22.

PLoS One. 2020 Feb 10;15(2):e0227791. doi: 10.1371/journal.pone.0227791. eCollection 2020.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验