Wu Shaoju, Kurugol Sila, Kleinman Paul K, Ecklund Kirsten, Walters Michele, Connolly Susan A, Johnston Patrick, Tsai Andy
Department of Radiology, Boston Children's Hospital, Harvard Medical School, Boston, MA 02115, United States.
Radiol Adv. 2024 Jul 4;1(2):umae018. doi: 10.1093/radadv/umae018. eCollection 2024 Jul.
The classic metaphyseal lesion (CML) is a distinctive fracture highly specific to infant abuse. To increase the size and diversity of the training CML database for automated deep-learning detection of this fracture, we developed a mask conditional diffusion model (MaC-DM) to generate synthetic images with and without CMLs.
To objectively and subjectively assess the synthetic radiographic images with and without CMLs generated by MaC-DM.
For retrospective testing, we randomly chose 100 real images (50 normals and 50 with CMLs; 39 infants, male = 22, female = 17; mean age = 4.1 months; SD = 3.1 months) from an existing distal tibia dataset (177 normal, 73 with CMLs), and generated 100 synthetic distal tibia images via MaC-DM (50 normals and 50 with CMLs). These test images were shown to 3 blinded radiologists. In the first session, radiologists determined if the images were normal or had CMLs. In the second session, they determined if the images were real or synthetic. We analyzed the radiologists' interpretations and employed t-distributed stochastic neighbor embedding technique to analyze the data distribution of the test images.
When presented with the 200 images (100 synthetic, 100 with CMLs), radiologists reliably and accurately diagnosed CMLs (kappa = 0.90, 95% CI = [0.88-0.92]; accuracy = 92%, 95% CI = [89-97]). However, they were inaccurate in differentiating between real and synthetic images (kappa = 0.05, 95% CI = [0.03-0.07]; accuracy = 53%, 95% CI = [49-59]). The t-distributed stochastic neighbor embedding analysis showed substantial differences in the data distribution between normal images and those with CMLs (area under the curve = 0.996, 95% CI = [0.992-1.000], < .01), but minor differences between real and synthetic images (area under the curve = 0.566, 95% CI = [0.486-0.647], = .11).
Radiologists accurately diagnosed images with distal tibial CMLs but were unable to distinguish real from synthetically generated ones, indicating that our generative model could synthesize realistic images. Thus, MaC-DM holds promise as an effective strategy for data augmentation in training machine-learning models for diagnosis of distal tibial CMLs.
经典干骺端病变(CML)是一种对虐待婴儿具有高度特异性的独特骨折。为了增加用于自动深度学习检测这种骨折的训练CML数据库的规模和多样性,我们开发了一种掩码条件扩散模型(MaC-DM)来生成有和没有CML的合成图像。
客观和主观地评估由MaC-DM生成的有和没有CML的合成X线图像。
为进行回顾性测试,我们从现有的胫骨远端数据集中(177例正常,73例有CML)随机选择100张真实图像(50例正常,50例有CML;39名婴儿,男22例,女17例;平均年龄4.1个月;标准差3.1个月),并通过MaC-DM生成100张合成胫骨远端图像(50例正常,50例有CML)。这些测试图像展示给3名不知情的放射科医生。在第一阶段,放射科医生确定图像是正常的还是有CML。在第二阶段,他们确定图像是真实的还是合成的。我们分析了放射科医生的解读,并采用t分布随机邻域嵌入技术分析测试图像的数据分布。
当面对这200张图像(100张合成图像,100张有CML)时,放射科医生能够可靠且准确地诊断出CML(kappa = 0.90,95%可信区间 = [0.88 - 0.92];准确率 = 92%,95%可信区间 = [89 - 97])。然而,他们在区分真实图像和合成图像方面不准确(kappa = 0.05,95%可信区间 = [0.03 - 0.07];准确率 = 53%,95%可信区间 = [49 - 59])。t分布随机邻域嵌入分析显示正常图像和有CML的图像之间的数据分布存在显著差异(曲线下面积 = 0.996,95%可信区间 = [0.992 - 1.000],P <.01),但真实图像和合成图像之间差异较小(曲线下面积 = 0.566,95%可信区间 = [0.486 - 0.647],P =.11)。
放射科医生能够准确诊断出有胫骨远端CML的图像,但无法区分真实图像和合成图像,这表明我们的生成模型可以合成逼真的图像。因此,MaC-DM有望成为一种有效的数据增强策略,用于训练机器学习模型以诊断胫骨远端CML。