Xie H N, Wang N, He M, Zhang L H, Cai H M, Xian J B, Lin M F, Zheng J, Yang Y Z
Department of Ultrasonic Medicine, Fetal Medical Center, First Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong, China.
Guangzhou Aiyunji Information Technology Co., Ltd, Guangdong, China.
Ultrasound Obstet Gynecol. 2020 Oct;56(4):579-587. doi: 10.1002/uog.21967.
To evaluate the feasibility of using deep-learning algorithms to classify as normal or abnormal sonographic images of the fetal brain obtained in standard axial planes.
We included in the study images retrieved from a large hospital database from 10 251 normal and 2529 abnormal pregnancies. Abnormal cases were confirmed by neonatal ultrasound, follow-up examination or autopsy. After a series of pretraining data processing steps, 15 372 normal and 14 047 abnormal fetal brain images in standard axial planes were obtained. These were divided into training and test datasets (at case level rather than image level), at a ratio of approximately 8:2. The training data were used to train the algorithms for three purposes: performance of image segmentation along the fetal skull, classification of the image as normal or abnormal and localization of the lesion. The accuracy was then tested on the test datasets, with performance of segmentation being assessed using precision, recall and Dice's coefficient (DICE), calculated to measure the extent of overlap between human-labeled and machine-segmented regions. We assessed classification accuracy by calculating the sensitivity and specificity for abnormal images. Additionally, for 2491 abnormal images, we determined how well each lesion had been localized by overlaying heat maps created by an algorithm on the segmented ultrasound images; an expert judged these in terms of how satisfactory was the lesion localization by the algorithm, classifying this as having been done precisely, closely or irrelevantly.
Segmentation precision, recall and DICE were 97.9%, 90.9% and 94.1%, respectively. For classification, the overall accuracy was 96.3%. The sensitivity and specificity for identification of abnormal images were 96.9% and 95.9%, respectively, and the area under the receiver-operating-characteristics curve was 0.989 (95% CI, 0.986-0.991). The algorithms located lesions precisely in 61.6% (1535/2491) of the abnormal images, closely in 24.6% (614/2491) and irrelevantly in 13.7% (342/2491).
Deep-learning algorithms can be trained for segmentation and classification of normal and abnormal fetal brain ultrasound images in standard axial planes and can provide heat maps for lesion localization. This study lays the foundation for further research on the differential diagnosis of fetal intracranial abnormalities. Copyright © 2020 ISUOG. Published by John Wiley & Sons Ltd.
评估使用深度学习算法对标准轴位平面获取的胎儿脑超声图像进行正常或异常分类的可行性。
我们纳入了从一家大型医院数据库中检索到的10251例正常妊娠和2529例异常妊娠的图像。异常病例通过新生儿超声、随访检查或尸检得以确诊。经过一系列预训练数据处理步骤后,获得了15372张标准轴位平面的正常胎儿脑图像和14047张异常胎儿脑图像。这些图像按照约8:2的比例被分为训练集和测试集(按病例级别而非图像级别划分)。训练数据用于训练算法以实现三个目的:沿胎儿颅骨进行图像分割、将图像分类为正常或异常以及病变定位。然后在测试集上测试准确性,使用精确率、召回率和Dice系数(DICE)评估分割性能,这些指标用于衡量人工标注区域与机器分割区域之间的重叠程度。我们通过计算异常图像的敏感性和特异性来评估分类准确性。此外,对于2491张异常图像,我们通过将算法创建的热图叠加在分割后的超声图像上来确定每个病变的定位效果;由一位专家根据算法对病变的定位是否令人满意进行判断,将其分类为定位精确、接近或不相关。
分割的精确率、召回率和Dice系数分别为97.9%、90.9%和94.1%。对于分类,总体准确率为96.3%。识别异常图像的敏感性和特异性分别为96.9%和95.9%,受试者工作特征曲线下面积为0.989(95%CI,0.986 - 0.991)。算法在61.6%(1535/2491)的异常图像中精确地定位了病变,在24.6%(614/2491)的图像中定位接近,在13.7%(342/2491)的图像中定位不相关。
深度学习算法可用于训练,以对标准轴位平面的正常和异常胎儿脑超声图像进行分割和分类,并可为病变定位提供热图。本研究为进一步研究胎儿颅内异常的鉴别诊断奠定了基础。版权所有©2020国际妇产科超声学会。由约翰·威利父子有限公司出版。