University of Pennsylvania, Philadelphia, Pennsylvania.
University of Rochester Medical Center, Rochester, New York.
J Arthroplasty. 2023 Oct;38(10):2004-2008. doi: 10.1016/j.arth.2023.03.039. Epub 2023 Mar 20.
Surgical management of complications following knee arthroplasty demands accurate and timely identification of implant manufacturer and model. Automated image processing using deep machine learning has been previously developed and internally validated; however, external validation is essential prior to scaling clinical implementation for generalizability.
We trained, validated, and externally tested a deep learning system to classify knee arthroplasty systems as one of the 9 models from 4 manufacturers derived from 4,724 original, retrospectively collected anteroposterior plain knee radiographs across 3 academic referral centers. From these radiographs, 3,568 were used for training, 412 for validation, and 744 for external testing. Augmentation was applied to the training set (n = 3,568,000) to increase model robustness. Performance was determined by the area under the receiver operating characteristic curve, sensitivity, specificity, and accuracy. Implant identification processing speed was calculated. The training and testing sets were drawn from statistically different populations of implants (P < .001).
After 1,000 training epochs by the deep learning system, the system discriminated 9 implant models with a mean area under the receiver operating characteristic curve of 0.989, accuracy of 97.4%, sensitivity of 89.2%, and specificity of 99.0% in the external testing dataset of 744 anteroposterior radiographs. The software classified implants at a mean speed of 0.02 seconds per image.
An artificial intelligence-based software for identifying knee arthroplasty implants demonstrated excellent internal and external validation. Although continued surveillance is necessary with implant library expansion, this software represents a responsible and meaningful clinical application of artificial intelligence with immediate potential to globally scale and assist in preoperative planning prior to revision knee arthroplasty.
膝关节置换术后并发症的手术处理需要准确、及时地识别植入物制造商和型号。以前已经开发并内部验证了使用深度学习的自动图像处理;然而,在扩大临床实施范围以实现通用性之前,外部验证是必不可少的。
我们训练、验证和外部测试了一个深度学习系统,以将膝关节置换系统分类为来自 4 个制造商的 9 个模型之一,这些模型源自 3 个学术转诊中心的 4724 张原始、回顾性收集的前后位膝关节平片。从这些射线照片中,使用 3568 张用于训练,412 张用于验证,744 张用于外部测试。对训练集(n=3568000)进行了增强,以提高模型的稳健性。性能通过接受者操作特征曲线下的面积、敏感性、特异性和准确性来确定。计算了植入物识别处理速度。训练集和测试集取自植入物的统计学不同人群(P<.001)。
深度学习系统经过 1000 个训练周期后,该系统在 744 张前后射线照片的外部测试数据集中,以平均接收者操作特征曲线下面积 0.989、准确率 97.4%、敏感性 89.2%和特异性 99.0%的平均值区分了 9 种植入模型。该软件对植入物的分类速度平均为每张图像 0.02 秒。
用于识别膝关节置换植入物的人工智能软件表现出出色的内部和外部验证。尽管随着植入物库的扩展需要继续进行监测,但该软件代表了人工智能的负责任和有意义的临床应用,具有在全球范围内扩展和协助翻修膝关节置换术前规划的即时潜力。