University of Alabama at Birmingham, Birmingham.
Division of Rheumatology, Department of Medicine, Hospital for Special Surgery, New York, New York.
JAMA Netw Open. 2022 Aug 1;5(8):e2227423. doi: 10.1001/jamanetworkopen.2022.27423.
An automated, accurate method is needed for unbiased assessment quantifying accrual of joint space narrowing and erosions on radiographic images of the hands and wrists, and feet for clinical trials, monitoring of joint damage over time, assisting rheumatologists with treatment decisions. Such a method has the potential to be directly integrated into electronic health records.
To design and implement an international crowdsourcing competition to catalyze the development of machine learning methods to quantify radiographic damage in rheumatoid arthritis (RA).
DESIGN, SETTING, AND PARTICIPANTS: This diagnostic/prognostic study describes the Rheumatoid Arthritis 2-Dialogue for Reverse Engineering Assessment and Methods (RA2-DREAM Challenge), which used existing radiographic images and expert-curated Sharp-van der Heijde (SvH) scores from 2 clinical studies (674 radiographic sets from 562 patients) for training (367 sets), leaderboard (119 sets), and final evaluation (188 sets). Challenge participants were tasked with developing methods to automatically quantify overall damage (subchallenge 1), joint space narrowing (subchallenge 2), and erosions (subchallenge 3). The challenge was finished on June 30, 2020.
Scores derived from submitted algorithms were compared with the expert-curated SvH scores, and a baseline model was created for benchmark comparison. Performances were ranked using weighted root mean square error (RMSE). The performance and reproductivity of each algorithm was assessed using Bayes factor from bootstrapped data, and further evaluated with a postchallenge independent validation data set.
The RA2-DREAM Challenge received a total of 173 submissions from 26 participants or teams in 7 countries for the leaderboard round, and 13 submissions were included in the final evaluation. The weighted RMSEs metric showed that the winning algorithms produced scores that were very close to the expert-curated SvH scores. Top teams included Team Shirin for subchallenge 1 (weighted RMSE, 0.44), HYL-YFG (Hongyang Li and Yuanfang Guan) subchallenge 2 (weighted RMSE, 0.38), and Gold Therapy for subchallenge 3 (weighted RMSE, 0.43). Bootstrapping/Bayes factor approach and the postchallenge independent validation confirmed the reproducibility and the estimation concordance indices between final evaluation and postchallenge independent validation data set were 0.71 for subchallenge 1, 0.78 for subchallenge 2, and 0.82 for subchallenge 3.
The RA2-DREAM Challenge resulted in the development of algorithms that provide feasible, quick, and accurate methods to quantify joint damage in RA. Ultimately, these methods could help research studies on RA joint damage and may be integrated into electronic health records to help clinicians serve patients better by providing timely, reliable, and quantitative information for making treatment decisions to prevent further damage.
需要一种自动化、准确的方法来公正评估手部和腕部以及足部 X 光图像上关节间隙狭窄和侵蚀的进展,以便用于临床试验、随时间监测关节损伤、协助风湿病学家做出治疗决策。这种方法有可能直接集成到电子健康记录中。
设计并实施国际众包竞赛,以促进开发用于量化类风湿关节炎 (RA) 放射学损伤的机器学习方法。
设计、设置和参与者:本诊断/预后研究描述了类风湿关节炎 2-对话以进行反向工程评估和方法(RA2-DREAM 挑战赛),该挑战赛使用了来自 2 项临床研究(562 名患者的 674 个 X 光组)的现有 X 光图像和专家 curated 的 Sharp-van der Heijde (SvH) 评分进行训练(367 个)、排行榜(119 个)和最终评估(188 个)。挑战赛参与者的任务是开发自动量化整体损伤(子挑战 1)、关节间隙狭窄(子挑战 2)和侵蚀(子挑战 3)的方法。挑战赛于 2020 年 6 月 30 日结束。
从提交的算法中得出的分数与专家 curated 的 SvH 分数进行了比较,并创建了一个基线模型进行基准比较。使用加权均方根误差 (RMSE) 对性能进行排名。使用来自引导数据的贝叶斯因子评估每个算法的性能和可再现性,并使用挑战赛后的独立验证数据集进行进一步评估。
RA2-DREAM 挑战赛共收到来自 7 个国家的 26 名参与者或团队的 173 份参赛作品,用于排行榜轮次,其中 13 份参赛作品包含在最终评估中。加权 RMSE 指标表明,获奖算法产生的分数与专家 curated 的 SvH 评分非常接近。顶级团队包括 Shirin 团队(子挑战 1 的加权 RMSE,0.44)、HYL-YFG(Hongyang Li 和 Yuanfang Guan)(子挑战 2 的加权 RMSE,0.38)和 Gold Therapy(子挑战 3 的加权 RMSE,0.43)。引导/Bayes 因子方法和挑战赛后的独立验证证实了可再现性,最终评估和挑战赛后的独立验证数据集中的估计一致性指数分别为 0.71(子挑战 1)、0.78(子挑战 2)和 0.82(子挑战 3)。
RA2-DREAM 挑战赛促成了开发可行、快速和准确的方法来量化 RA 中的关节损伤。最终,这些方法可以帮助研究 RA 关节损伤,并可能集成到电子健康记录中,通过提供及时、可靠和定量的信息来帮助临床医生更好地为患者服务,以便做出治疗决策,防止进一步的损伤。