Wang Yi-Zhong, Birch David G
Retina Foundation of the Southwest, Dallas, TX, United States.
Department of Ophthalmology, University of Texas Southwestern Medical Center, Dallas, TX, United States.
Front Med (Lausanne). 2022 Jul 5;9:932498. doi: 10.3389/fmed.2022.932498. eCollection 2022.
Previously, we have shown the capability of a hybrid deep learning (DL) model that combines a U-Net and a sliding-window (SW) convolutional neural network (CNN) for automatic segmentation of retinal layers from OCT scan images in retinitis pigmentosa (RP). We found that one of the shortcomings of the hybrid model is that it tends to underestimate ellipsoid zone (EZ) width or area, especially when EZ extends toward or beyond the edge of the macula. In this study, we trained the model with additional data which included more OCT scans having extended EZ. We evaluated its performance in automatic measurement of EZ area on SD-OCT volume scans obtained from the participants of the RUSH2A natural history study by comparing the model's performance to the reading center's manual grading.
De-identified Spectralis high-resolution 9-mm 121-line macular volume scans as well as their EZ area measurements by a reading center were transferred from the management center of the RUSH2A study under the data transfer and processing agreement. A total of 86 baseline volume scans from 86 participants of the RUSH2A study were included to evaluate two hybrid models: the original RP240 model trained on 480 mid-line B-scans from 220 patients with retinitis pigmentosa (RP) and 20 participants with normal vision from a single site, and the new RP340 model trained on a revised RP340 dataset which included RP240 dataset plus an additional 200 mid-line B-scans from another 100 patients with RP. There was no overlap of patients between training and evaluation datasets. EZ and apical RPE in each B-scan image were automatically segmented by the hybrid model. EZ areas were determined by interpolating the discrete 2-dimensional B-scan EZ-RPE layer over the scan area. Dice similarity, correlation, linear regression, and Bland-Altman analyses were conducted to assess the agreement between the EZ areas measured by the hybrid model and by the reading center.
For EZ area > 1 mm, average dice coefficients ± SD between the EZ band segmentations determined by the DL model and the manual grading were 0.835 ± 0.132 and 0.867 ± 0.105 for RP240 and RP340 hybrid models, respectively ( < 0.0005; = 51). When compared to the manual grading, correlation coefficients (95% CI) were 0.991 (0.987-0.994) and 0.994 (0.991-0.996) for RP240 and RP340 hybrid models, respectively. Linear regression slopes (95% CI) were 0.918 (0.896-0.940) and 0.995 (0.975-1.014), respectively. Bland-Altman analysis revealed a mean difference ± SD of -0.137 ± 1.131 mm and 0.082 ± 0.825 mm, respectively.
Additional training data improved the hybrid model's performance, especially reducing the bias and narrowing the range of the 95% limit of agreement when compared to manual grading. The close agreement of DL models to manual grading suggests that DL may provide effective tools to significantly reduce the burden of reading centers to analyze OCT scan images. In addition to EZ area, our DL models can also provide the measurements of photoreceptor outer segment volume and thickness to further help assess disease progression and to facilitate the study of structure and function relationship in RP.
此前,我们已经展示了一种混合深度学习(DL)模型的能力,该模型结合了U-Net和滑动窗口(SW)卷积神经网络(CNN),用于从色素性视网膜炎(RP)的光学相干断层扫描(OCT)图像中自动分割视网膜层。我们发现混合模型的一个缺点是它往往会低估椭圆体带(EZ)的宽度或面积,特别是当EZ向黄斑边缘延伸或超出黄斑边缘时。在本研究中,我们用额外的数据训练该模型,这些数据包括更多具有延伸EZ的OCT扫描。我们通过将模型的性能与阅读中心的手动分级进行比较,评估了其在RUSH2A自然病史研究参与者的SD-OCT容积扫描中自动测量EZ面积的性能。
根据数据传输和处理协议,从RUSH2A研究的管理中心转移了去识别化的Spectralis高分辨率9毫米121线黄斑容积扫描及其由阅读中心进行的EZ面积测量。总共纳入了来自RUSH2A研究的86名参与者的86次基线容积扫描,以评估两个混合模型:原始的RP240模型,该模型在来自220名色素性视网膜炎(RP)患者和来自单个站点的20名视力正常参与者的480条中线B扫描上进行训练;以及新的RP340模型,该模型在修订后的RP340数据集上进行训练,该数据集包括RP240数据集以及来自另外100名RP患者的额外200条中线B扫描。训练数据集和评估数据集之间没有患者重叠。混合模型自动分割每个B扫描图像中的EZ和顶端视网膜色素上皮(RPE)。通过在扫描区域内对离散的二维B扫描EZ-RPE层进行插值来确定EZ面积。进行Dice相似性、相关性、线性回归和Bland-Altman分析,以评估混合模型测量的EZ面积与阅读中心测量的EZ面积之间的一致性。
对于EZ面积>1平方毫米,DL模型确定的EZ带分割与手动分级之间的平均Dice相似系数±标准差,RP240和RP340混合模型分别为0.835±0.132和0.867±0.105(<0.0005;n = 51)。与手动分级相比,RP240和RP340混合模型的相关系数(95%置信区间)分别为0.991(0.987 - 0.994)和0.994(0.991 - 0.996)。线性回归斜率(95%置信区间)分别为0.918(0.896 - 0.940)和0.995(0.975 - 1.014)。Bland-Altman分析显示平均差异±标准差分别为-0.137±1.131平方毫米和0.082±0.825平方毫米。
额外的训练数据提高了混合模型的性能,特别是与手动分级相比,减少了偏差并缩小了95%一致性界限的范围。DL模型与手动分级的密切一致性表明,DL可以提供有效的工具,显著减轻阅读中心分析OCT扫描图像的负担。除了EZ面积外,我们的DL模型还可以提供光感受器外段体积和厚度的测量,以进一步帮助评估疾病进展,并促进对RP中结构和功能关系的研究。