Department of Radiology, National Cancer Center/Cancer Hospital and Shenzhen Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Shenzhen, China.
Department of Radiology, National Cancer Center/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.
Thorac Cancer. 2019 Feb;10(2):183-192. doi: 10.1111/1759-7714.12931. Epub 2018 Dec 8.
The study was conducted to evaluate the performance of a state-of-the-art commercial deep learning-based computer-aided diagnosis (DL-CAD) system for detecting and characterizing pulmonary nodules.
Pulmonary nodules in 346 healthy subjects (male: female = 221:125, mean age 51 years) from a lung cancer screening program conducted from March to November 2017 were screened using a DL-CAD system and double reading independently, and their performance in nodule detection and characterization were evaluated. An expert panel combined the results of the DL-CAD system and double reading as the reference standard.
The DL-CAD system showed a higher detection rate than double reading, regardless of nodule size (86.2% vs. 79.2%; P < 0.001): nodules ≥ 5 mm (96.5% vs. 88.0%; P = 0.008); nodules < 5 mm (84.3% vs. 77.5%; P < 0.001). However, the false positive rate (per computed tomography scan) of the DL-CAD system (1.53, 529/346) was considerably higher than that of double reading (0.13, 44/346; P < 0.001). Regarding nodule characterization, the sensitivity and specificity of the DL-CAD system for distinguishing solid nodules > 5 mm (90.3% and 100.0%, respectively) and ground-glass nodules (100.0% and 96.1%, respectively) were close to that of double reading, but dropped to 55.5% and 93%, respectively, when discriminating part solid nodules.
Our DL-CAD system detected significantly more nodules than double reading. In the future, false positive findings should be further reduced and characterization accuracy improved.
本研究旨在评估一款先进的商业深度学习计算机辅助诊断(DL-CAD)系统在检测和描述肺部结节方面的性能。
对 2017 年 3 月至 11 月期间进行的一项肺癌筛查计划中的 346 名健康受试者(男:女=221:125,平均年龄 51 岁)的肺部结节进行了筛查,使用 DL-CAD 系统和双读独立进行,并评估了其在结节检测和特征描述方面的性能。一个专家小组将 DL-CAD 系统和双读的结果结合起来作为参考标准。
无论结节大小如何(≥5mm:96.5%比 88.0%;P=0.008;<5mm:84.3%比 77.5%;P<0.001),DL-CAD 系统的检测率均高于双读。然而,DL-CAD 系统的假阳性率(每台计算机断层扫描)(1.53,529/346)明显高于双读(0.13,44/346;P<0.001)。在结节特征描述方面,DL-CAD 系统用于区分>5mm 的实性结节(敏感性和特异性分别为 90.3%和 100.0%)和磨玻璃结节(敏感性和特异性分别为 100.0%和 96.1%)的性能与双读相近,但在区分部分实性结节时,敏感性和特异性分别降至 55.5%和 93%。
我们的 DL-CAD 系统比双读检测到更多的结节。在未来,应进一步降低假阳性发现,并提高特征描述的准确性。