Ke Jia, Lv Yi, Ma Furong, Du Yali, Xiong Shan, Wang Junchen, Wang Jiang
Department of Otorhinolaryngology-Head and Neck Surgery, Peking University Third Hospital, Peking University, Beijing, China.
School of Mechanical Engineering and Automation, Beihang University, Beijing, China.
Quant Imaging Med Surg. 2023 Mar 1;13(3):1577-1591. doi: 10.21037/qims-22-658. Epub 2023 Feb 10.
Automatic segmentation of temporal bone computed tomography (CT) images is fundamental to image-guided otologic surgery and the intelligent analysis of CT images in the field of otology. This study was conducted to test a convolutional neural network (CNN) model that can automatically segment almost all temporal bone anatomy structures in adult and pediatric CT images.
A dataset comprising 80 annotated CT volumes was collected, of which 40 samples were obtained from adults and 40 from children. A further 60 annotated CT volumes (30 from adults and 30 from children) were used to train the model. The remaining 20 annotated CT volumes were employed to determine the model's generalizability for automatic segmentation. Finally, the Dice coefficient (DC) and average symmetric surface distance (ASSD) were utilized as metrics to evaluate the performance of the CNN model. Two independent-sample -tests were used to compare the test set results of adults and children.
In the adult test set, the mean DC values of all the structures ranged from 0.714 to 0.912, and the ASSD values were less than 0.24 mm for 11 structures. In the pediatric test set, the mean DC values of all the structures ranged from 0.658 to 0.915, and the ASSD values were less than 0.18 mm for 11 structures. There was no statistically significant difference between the adult and child test sets in most temporal bone structures.
Our CNN model shows excellent automatic segmentation performance and good generalizability for both adult and pediatric temporal bone CT images, which can help to advance otologist education, intelligent imaging diagnosis, surgery simulation, application of augmented reality, and preoperative planning for image-guided otology surgery.
颞骨计算机断层扫描(CT)图像的自动分割对于图像引导的耳科手术以及耳科学领域CT图像的智能分析至关重要。本研究旨在测试一种卷积神经网络(CNN)模型,该模型能够自动分割成人和儿童CT图像中几乎所有的颞骨解剖结构。
收集了一个包含80个标注CT容积的数据集,其中40个样本来自成人,40个来自儿童。另外60个标注CT容积(30个来自成人,30个来自儿童)用于训练模型。其余20个标注CT容积用于确定模型自动分割的泛化能力。最后,使用Dice系数(DC)和平均对称表面距离(ASSD)作为指标来评估CNN模型的性能。采用两个独立样本t检验来比较成人和儿童的测试集结果。
在成人测试集中,所有结构的平均DC值范围为0.714至0.912,11个结构的ASSD值小于0.24毫米。在儿童测试集中,所有结构的平均DC值范围为0.658至0.915,11个结构的ASSD值小于0.18毫米。在大多数颞骨结构中,成人和儿童测试集之间没有统计学上的显著差异。
我们的CNN模型在成人和儿童颞骨CT图像上均表现出出色的自动分割性能和良好的泛化能力,这有助于推进耳科医生教育、智能影像诊断、手术模拟、增强现实应用以及图像引导耳科手术的术前规划。