Flaharty Kendall A, Chandrasekar Vibha, Castillo Irene J, Duong Dat, Ferreira Carlos R, Hanchard Suzanna Ledgister, Hu Ping, Waikel Rebekah L, Rossignol Francis, Introne Wendy J, Solomon Benjamin D
Medical Genomics Unit, Medical Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, USA.
Human Biochemical Genetics Section, Medical Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, USA.
medRxiv. 2025 Mar 12:2025.03.11.25323762. doi: 10.1101/2025.03.11.25323762.
Deep learning (DL) is increasingly used to analyze medical imaging, but is less refined for rare conditions, which require novel pre-processing and analytical approaches. To assess DL in the context of rare diseases, we focused on alkaptonuria (AKU), a rare disorder that affects the spine and involves other sequelae; treatments include the medication nitisinone. Since assessing X-rays to determine disease severity can be a slow, manual process requiring considerable expertise, we aimed to determine whether our DL methods could accurately identify overall spine severity, severity at specific regions of the spine, and whether DL could detect whether patients were receiving nitisinone. We evaluated DL performance versus clinical experts using cervical and lumbar spine radiographs. DL models predicted global severity scores (30-point scale) within 1.72 ± 1.96 points of expert clinician scores for cervical and 2.51 ± 1.96 points for lumbar radiographs. For region-specific metrics, we assessed the degree of narrowing, calcium, and vacuum phenomena at each intervertebral space (IVS). Our model's narrowing scores were within 0.191-0.557 points from clinician scores (6-point scale), calcium was predicted with 78-90% accuracy (present, absent, or disc fusion), while vacuum disc phenomenon predictions were less consistent (41-90%). Intriguingly, DL models predicted nitisinone treatment status with 68-77% accuracy, while expert clinicians appeared unable to discern nitisinone status (51% accuracy) (p = 2.0 × 10). This highlights the potential for DL to augment certain types of clinical assessments in rare disease, as well as identifying occult features like treatment status.
深度学习(DL)越来越多地用于分析医学影像,但对于罕见病的处理还不够精细,罕见病需要新颖的预处理和分析方法。为了评估在罕见病背景下的深度学习,我们聚焦于黑尿症(AKU),这是一种影响脊柱并涉及其他后遗症的罕见疾病;治疗方法包括使用药物尼替西农。由于通过评估X射线来确定疾病严重程度可能是一个缓慢的、需要相当专业知识的手动过程,我们旨在确定我们的深度学习方法是否能够准确识别脊柱的整体严重程度、脊柱特定区域的严重程度,以及深度学习是否能够检测患者是否正在接受尼替西农治疗。我们使用颈椎和腰椎X线片评估了深度学习相对于临床专家的性能。深度学习模型预测的全球严重程度评分(30分制)与专家临床医生对颈椎X线片的评分相差在1.72±1.96分以内,对腰椎X线片的评分相差在2.51±1.96分以内。对于特定区域的指标,我们评估了每个椎间隙(IVS)的狭窄程度、钙化情况和真空现象。我们模型的狭窄评分与临床医生评分(6分制)相差在0.191 - 0.557分以内,钙化预测的准确率为78 - 90%(存在、不存在或椎间盘融合),而真空椎间盘现象的预测一致性较差(41 - 90%)。有趣的是,深度学习模型预测尼替西农治疗状态的准确率为68 - 77%,而专家临床医生似乎无法辨别尼替西农治疗状态(准确率为51%)(p = 2.0×10)。这凸显了深度学习在罕见病中增强某些类型临床评估的潜力,以及识别如治疗状态等隐匿特征的能力。