Venäläinen Mikko S, Biehl Alexander, Holstila Milja, Kuusalo Laura, Elo Laura L
Turku Bioscience Centre, University of Turku and Åbo Akademi University, Turku, Finland.
Department of Medical Physics, Turku University Hospital, Turku, Finland.
Rheumatology (Oxford). 2025 Mar 1;64(3):1068-1076. doi: 10.1093/rheumatology/keae215.
Although deep learning has demonstrated substantial potential in automatic quantification of joint damage in RA, evidence for detecting longitudinal changes at an individual patient level is lacking. Here, we introduce and externally validate our automated RA scoring algorithm (AuRA), and demonstrate its utility for monitoring radiographic progression in a real-world setting.
The algorithm, originally developed during the Rheumatoid Arthritis 2-Dialogue for Reverse Engineering Assessment and Methods (RA2-DREAM) challenge, was trained to predict expert-curated Sharp-van der Heijde total scores in hand and foot radiographs from two previous clinical studies (n = 367). We externally validated AuRA against data (n = 205) from Turku University Hospital and compared the performance against two top-performing RA2-DREAM solutions. Finally, for 54 patients, we extracted additional radiograph sets from another control visit to the clinic (average time interval of 4.6 years).
In the external validation cohort, with a root mean square error (RMSE) of 23.6, AuRA outperformed both top-performing RA2-DREAM algorithms (RMSEs 35.0 and 35.6). The improved performance was explained mostly by lower errors at higher expert-assessed scores. The longitudinal changes predicted by our algorithm were significantly correlated with changes in expert-assessed scores (Pearson's R = 0.74, P < 0.001).
AuRA had the best external validation performance and demonstrated potential for detecting longitudinal changes in joint damage. Available from https://hub.docker.com/r/elolab/aura, our algorithm can easily be applied for automatic detection of radiographic progression in the future, reducing the need for laborious manual scoring.
尽管深度学习在类风湿关节炎(RA)关节损伤的自动量化方面已显示出巨大潜力,但在个体患者层面检测纵向变化的证据仍不足。在此,我们介绍并对外验证了我们的自动化RA评分算法(AuRA),并展示了其在现实环境中监测放射学进展的效用。
该算法最初是在类风湿关节炎二维逆向工程评估与方法对话(RA2-DREAM)挑战赛中开发的,用于根据之前两项临床研究(n = 367)的手部和足部X光片预测专家整理的Sharp-van der Heijde总分。我们使用图尔库大学医院的数据(n = 205)对外验证了AuRA,并将其性能与两个表现最佳的RA2-DREAM解决方案进行了比较。最后,对于54名患者,我们从另一次门诊对照访视中提取了额外的X光片集(平均时间间隔为4.6年)。
在外部验证队列中,AuRA的均方根误差(RMSE)为23.6,优于两个表现最佳的RA2-DREAM算法(RMSE分别为35.0和35.6)。性能的提高主要是由于在专家评估分数较高时误差较低。我们算法预测的纵向变化与专家评估分数的变化显著相关(Pearson相关系数R = 0.74,P < 0.001)。
AuRA具有最佳的外部验证性能,并显示出检测关节损伤纵向变化的潜力。我们的算法可从https://hub.docker.com/r/elolab/aura获取,未来可轻松应用于放射学进展的自动检测,减少繁琐的手动评分需求。