Department of Orthopaedic, Harris Orthopaedics Laboratory, Massachusetts General Hospital, Boston, MA, USA.
Department of Orthopaedic Surgery, Harvard Medical School, Boston, MA, USA.
Med Phys. 2021 May;48(5):2327-2336. doi: 10.1002/mp.14705. Epub 2021 Mar 23.
A crucial step in the preoperative planning for a revision total hip replacement (THR) surgery is the accurate identification of the failed implant design, especially if one or more well-fixed/functioning components are to be retained. Manual identification of the implant design from preoperative radiographic images can be time-consuming and inaccurate, which can ultimately lead to increased operating room time, more complex surgery, and increased healthcare costs.
In this study, we present a novel approach to identifying THR femoral implants' design from plain radiographs using a convolutional neural network (CNN). We evaluated a total of 402 radiographs of nine different THR implant designs including, Accolade II (130 radiographs), Corail (89 radiographs), M/L Taper (31 radiographs), Summit (31 radiographs), Anthology (26 radiographs), Versys (26 radiographs), S-ROM (24 radiographs), Taperloc Standard Offset (24 radiographs), and Taperloc High Offset (21 radiographs). We implemented a transfer learning approach and adopted a DenseNet-201 CNN architecture by replacing the final classifier with nine fully connected neurons. Furthermore, we used saliency maps to explain the CNN decision-making process by visualizing the most important pixels in a given radiograph on the CNN's outcome. We also compared the CNN's performance with three board-certified and fellowship-trained orthopedic surgeons.
The CNN achieved the same or higher performance than at least one of the surgeons in identifying eight of nine THR implant designs and underperformed all of the surgeons in identifying one THR implant design (Anthology). Overall, the CNN achieved a lower Cohen's kappa (0.78) than surgeon 1 (1.00), the same Cohen's kappa as surgeon 2 (0.78), and a slightly higher Cohen's kappa than surgeon 3 (0.76) in identifying all the nine THR implant designs. Furthermore, the saliency maps showed that the CNN generally focused on each implant's unique design features to make a decision. Regarding the time spent performing the implant identification, the CNN accomplished this task in ~0.06 s per radiograph. The surgeon's identification time varied based on the method they utilized. When using their personal experience to identify the THR implant design, they spent negligible time. However, the identification time increased to an average of 8.4 min (standard deviation 6.1 min) per radiograph when they used another identification method (online search, consulting with the orthopedic company representative, and using image atlas), which occurred in about 17% of cases in the test subset (40 radiographs).
CNNs such as the one developed in this study can be used to automatically identify the design of a failed THR femoral implant preoperatively in just a fraction of a second, saving time and in some cases improving identification accuracy.
在翻修全髋关节置换 (THR) 手术的术前规划中,一个关键步骤是准确识别失败的植入物设计,特别是如果要保留一个或多个固定/功能良好的部件。从术前放射图像手动识别植入物设计既耗时又不准确,这最终可能导致手术室时间延长、手术更复杂以及医疗保健成本增加。
在这项研究中,我们提出了一种使用卷积神经网络 (CNN) 从普通 X 光片中识别 THR 股骨植入物设计的新方法。我们总共评估了 402 张来自 9 种不同 THR 植入物设计的 X 光片,包括 Accolade II(130 张 X 光片)、Corail(89 张 X 光片)、M/L 锥形(31 张 X 光片)、Summit(31 张 X 光片)、Anthology(26 张 X 光片)、Versys(26 张 X 光片)、S-ROM(24 张 X 光片)、Taperloc 标准偏移(24 张 X 光片)和 Taperloc 高偏移(21 张 X 光片)。我们采用了迁移学习方法,并通过用九个完全连接的神经元替换最终分类器,采用 DenseNet-201 CNN 架构。此外,我们使用显着性映射通过在 CNN 的结果上可视化给定 X 光片中最重要的像素,来解释 CNN 的决策过程。我们还将 CNN 的性能与三名经过董事会认证和专业培训的骨科医生进行了比较。
在识别 9 种 THR 植入物设计中的 8 种时,CNN 的表现与至少一名外科医生相同或更好,而在识别一种 THR 植入物设计(Anthology)时表现不如所有外科医生。总体而言,与外科医生 1(1.00)相比,CNN 的 Cohen's kappa 略低(0.78),与外科医生 2(0.78)相同,而与外科医生 3(0.76)相比略高。在识别所有 9 种 THR 植入物设计时,用于识别所有 9 种 THR 植入物设计。此外,显着性映射表明,CNN 通常专注于每个植入物的独特设计特征来做出决策。关于执行植入物识别所花费的时间,CNN 大约每幅 X 光片完成任务需要 0.06 秒。外科医生的识别时间取决于他们使用的方法。当使用他们的个人经验来识别 THR 植入物设计时,他们几乎不需要花费时间。但是,当他们使用另一种识别方法(在线搜索、咨询骨科公司代表和使用图像图谱)时,识别时间会增加到平均每幅 X 光片 8.4 分钟(标准差为 6.1 分钟),这种情况发生在测试子集(40 张 X 光片)的大约 17%的病例中。
像本研究中开发的那样的 CNN 可以在短短几分之一秒内自动识别术前失败的 THR 股骨植入物的设计,从而节省时间并在某些情况下提高识别准确性。