Suppr超能文献

肺癌CT图像中多危及器官的观察者间评估与基于深度卷积神经网络的分割差异的初步临床研究

Preliminary Clinical Study of the Differences Between Interobserver Evaluation and Deep Convolutional Neural Network-Based Segmentation of Multiple Organs at Risk in CT Images of Lung Cancer.

作者信息

Zhu Jinhan, Liu Yimei, Zhang Jun, Wang Yixuan, Chen Lixin

机构信息

State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, China.

出版信息

Front Oncol. 2019 Jul 5;9:627. doi: 10.3389/fonc.2019.00627. eCollection 2019.

Abstract

In this study, publicly datasets with organs at risk (OAR) structures were used as reference data to compare the differences of several observers. Convolutional neural network (CNN)-based auto-contouring was also used in the analysis. We evaluated the variations among observers and the effect of CNN-based auto-contouring in clinical applications. A total of 60 publicly available lung cancer CT with structures were used; 48 cases were used for training, and the other 12 cases were used for testing. The structures of the datasets were used as reference data. Three observers and a CNN-based program performed contouring for 12 testing cases, and the 3D dice similarity coefficient (DSC) and mean surface distance (MSD) were used to evaluate differences from the reference data. The three observers edited the CNN-based contours, and the results were compared to those of manual contouring. A value of P<0.05 was considered statistically significant. Compared to the reference data, no statistically significant differences were observed for the DSCs and MSDs among the manual contouring performed by the three observers at the same institution for the heart, esophagus, spinal cord, and left and right lungs. The 95% confidence interval (CI) and -values of the CNN-based auto-contouring results comparing to the manual results for the heart, esophagus, spinal cord, and left and right lungs were as follows: the DSCs were CNN vs. A: 0.9140.939( = 0.004), 0.7460.808( = 0.002), 0.8660.887( = 0.136), 0.9520.966( = 0.158) and 0.9600.972 ( = 0.136); CNN vs. B: 0.9130.936 ( = 0.002), 0.7450.807 ( = 0.005), 0.8640.894 ( = 0.239), 0.9520.964 ( = 0.308), and 0.9590.971 ( = 0.272); and CNN vs. C: 0.9120.933 ( = 0.004), 0.7480.804( = 0.002), 0.8670.890 ( = 0.530), 0.9520.964 ( = 0.308), and 0.958~0.970 ( = 0.480), respectively. The -values of MSDs are similar to DSCs. The -values of heart and esophagus is smaller than 0.05. No significant differences were found between the edited CNN-based auto-contouring results and the manual results. For the spinal cord, both lungs, no statistically significant differences were found between CNN-based auto-contouring and manual contouring. Further modifications to contouring of the heart and esophagus are necessary. Overall, editing based on CNN-based auto-contouring can effectively shorten the contouring time without affecting the results. CNNs have considerable potential for automatic contouring applications.

摘要

在本研究中,将具有危及器官(OAR)结构的公开数据集用作参考数据,以比较多位观察者之间的差异。分析中还使用了基于卷积神经网络(CNN)的自动轮廓描绘。我们评估了观察者之间的差异以及基于CNN的自动轮廓描绘在临床应用中的效果。总共使用了60例具有结构的公开可用肺癌CT;48例用于训练,另外12例用于测试。数据集的结构用作参考数据。三位观察者和一个基于CNN的程序对12个测试病例进行轮廓描绘,使用三维骰子相似系数(DSC)和平均表面距离(MSD)来评估与参考数据的差异。三位观察者编辑基于CNN的轮廓,并将结果与手动轮廓描绘的结果进行比较。P值<0.05被认为具有统计学意义。与参考数据相比,同一机构的三位观察者对心脏、食管、脊髓以及左右肺进行手动轮廓描绘时,DSC和MSD均未观察到统计学上的显著差异。基于CNN的自动轮廓描绘结果与心脏、食管、脊髓以及左右肺的手动结果比较的95%置信区间(CI)和P值如下:心脏的DSC为CNN与A比较:0.9140.939(P = 0.004),0.7460.808(P = 0.002),0.8660.887(P = 0.136),0.9520.966(P = 0.158)以及0.9600.972(P = 0.136);CNN与B比较:0.9130.936(P = 0.002),0.7450.807(P = 0.005),0.8640.894(P = 0.239),0.9520.964(P = 0.308)以及0.9590.971(P = 0.272);CNN与C比较:0.9120.933(P = 0.004),0.7480.804(P = 0.002),0.8670.890(P = 0.530),0.9520.964(P = 0.308)以及0.958~0.970(P = 0.480)。MSD的P值与DSC相似。心脏和食管的P值小于0.05。基于CNN的自动轮廓描绘编辑结果与手动结果之间未发现显著差异。对于脊髓和双肺,基于CNN的自动轮廓描绘与手动轮廓描绘之间未发现统计学上的显著差异。心脏和食管的轮廓描绘需要进一步修改。总体而言,基于CNN的自动轮廓描绘编辑可有效缩短轮廓描绘时间且不影响结果。CNN在自动轮廓描绘应用方面具有相当大的潜力。

相似文献

引用本文的文献

本文引用的文献

1
Generalised Dice Overlap as a Deep Learning Loss Function for Highly Unbalanced Segmentations.广义骰子重叠作为高度不平衡分割的深度学习损失函数
Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2017). 2017;2017:240-248. doi: 10.1007/978-3-319-67558-9_28. Epub 2017 Sep 9.
4
Lung Nodule Detection Deep Reinforcement Learning.肺结节检测 深度强化学习
Front Oncol. 2018 Apr 16;8:108. doi: 10.3389/fonc.2018.00108. eCollection 2018.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验