Department of Otolaryngology-Head and Neck Surgery, University of Texas Southwestern Medical Center, Dallas, Texas.
Department of Radiation Oncology, University of Texas Southwestern Medical Center, Dallas, Texas.
Laryngoscope. 2021 Feb;131(2):E619-E624. doi: 10.1002/lary.28695. Epub 2020 Apr 18.
OBJECTIVES/HYPOTHESIS: To determine if an automated vestibular schwannoma (VS) segmentation model has comparable performance to using the greatest linear dimension to detect growth.
Case-control Study.
Patients were selected from an internal database who had an initial gadolinium-enhanced T1-weighted magnetic resonance imaging scan and a follow-up scan captured at least 5 months later. Two observers manually segmented the VS to compute volumes, and one observer's segmentations were used to train a convolutional neural network model to automatically segment the VS and determine the volume. The results of automatic segmentation were compared to the observer whose measurements were not used in model development to measure agreement. We then examined the sensitivity, specificity, and area under the receiver-operating characteristic curve (AUC) to compare automated volumetric growth detection versus using the greatest linear dimension. Growth detection determined by the external observer's measurements served as the gold standard.
A total of 65 patients and 130 scans were studied. The automated method of segmentation demonstrated excellent agreement with the observer whose measurements were not used for model development for the initial scan (interclass correlational coefficient [ICC] = 0.995; 95% confidence interval [CI]: 0.991-0.997) and follow-up scan (ICC = 0.960; 95% CI: 0.935-0.975). The automated method of segmentation demonstrated increased sensitivity (72.2% vs. 63.9%), specificity (79.3% vs. 69.0%), and AUC (0.822 vs. 0.701) compared to using the greatest linear dimension for growth detection.
In detecting VS growth, a convolutional neural network model outperformed using the greatest linear dimension, demonstrating a potential application of artificial intelligence methods to VS surveillance.
4 Laryngoscope, 131:E619-E624, 2021.
目的/假设:确定自动听神经鞘瘤(VS)分割模型是否具有与使用最大线性尺寸检测生长相当的性能。
病例对照研究。
从内部数据库中选择了初始钆增强 T1 加权磁共振成像扫描和至少 5 个月后捕获的后续扫描的患者。两名观察者手动分割 VS 以计算体积,并且一名观察者的分割被用于训练卷积神经网络模型来自动分割 VS 并确定体积。自动分割的结果与未用于模型开发以进行测量的观察者的结果进行比较,以测量一致性。然后,我们检查了敏感性、特异性和接收者操作特征曲线下的面积(AUC),以比较自动体积生长检测与使用最大线性尺寸的情况。外部观察者的测量确定的生长检测作为金标准。
共研究了 65 名患者和 130 次扫描。分割的自动方法与未用于模型开发的观察者的初始扫描(组内相关系数[ICC] = 0.995;95%置信区间[CI]:0.991-0.997)和随访扫描(ICC = 0.960;95%CI:0.935-0.975)非常吻合。与使用最大线性尺寸进行生长检测相比,分割的自动方法显示出更高的敏感性(72.2% 对 63.9%)、特异性(79.3% 对 69.0%)和 AUC(0.822 对 0.701)。
在检测 VS 生长方面,卷积神经网络模型优于使用最大线性尺寸,这表明人工智能方法在 VS 监测中的潜在应用。
4 级喉镜,131:E619-E624,2021 年。