University of Ulsan College of Medicine, Seoul, Korea.
AI Center, Korea University Anam Hospital, Korea University College of Medicine, Seoul, Korea.
PLoS One. 2022 Aug 18;17(8):e0272715. doi: 10.1371/journal.pone.0272715. eCollection 2022.
Artificial intelligence (AI) algorithms have been applied to diagnose temporomandibular disorders (TMDs). However, studies have used different patient selection criteria, disease subtypes, input data, and outcome measures. Resultantly, the performance of the AI models varies.
This study aimed to systematically summarize the current literature on the application of AI technologies for diagnosis of different TMD subtypes, evaluate the quality of these studies, and assess the diagnostic accuracy of existing AI models.
The study protocol was carried out based on the preferred reporting items for systematic review and meta-analysis protocols (PRISMA). The PubMed, Embase, and Web of Science databases were searched to find relevant articles from database inception to June 2022. Studies that used AI algorithms to diagnose at least one subtype of TMD and those that assessed the performance of AI algorithms were included. We excluded studies on orofacial pain that were not directly related to the TMD, such as studies on atypical facial pain and neuropathic pain, editorials, book chapters, and excerpts without detailed empirical data. The risk of bias was assessed using the QUADAS-2 tool. We used Grading of Recommendations, Assessment, Development, and Evaluations (GRADE) to provide certainty of evidence.
A total of 17 articles for automated diagnosis of masticatory muscle disorders, TMJ osteoarthrosis, internal derangement, and disc perforation were included; they were retrospective studies, case-control studies, cohort studies, and a pilot study. Seven studies were subjected to a meta-analysis for diagnostic accuracy. According to the GRADE, the certainty of evidence was very low. The performance of the AI models had accuracy and specificity ranging from 84% to 99.9% and 73% to 100%, respectively. The pooled accuracy was 0.91 (95% CI 0.76-0.99), I2 = 97% (95% CI 0.96-0.98), p < 0.001.
Various AI algorithms developed for diagnosing TMDs may provide additional clinical expertise to increase diagnostic accuracy. However, it should be noted that a high risk of bias was present in the included studies. Also, certainty of evidence was very low. Future research of higher quality is strongly recommended.
人工智能 (AI) 算法已被应用于诊断颞下颌关节紊乱 (TMD)。然而,研究采用了不同的患者选择标准、疾病亚型、输入数据和结果测量方法。因此,AI 模型的性能存在差异。
本研究旨在系统总结目前关于 AI 技术应用于不同 TMD 亚型诊断的文献,评估这些研究的质量,并评估现有 AI 模型的诊断准确性。
研究方案根据系统评价和荟萃分析报告的首选项目 (PRISMA) 进行。从数据库建立到 2022 年 6 月,在 PubMed、Embase 和 Web of Science 数据库中搜索相关文章。纳入使用 AI 算法诊断至少一种 TMD 亚型且评估 AI 算法性能的研究。我们排除了与 TMD 没有直接关系的口腔颌面疼痛研究,例如非典型面部疼痛和神经性疼痛研究、社论、书籍章节和没有详细实证数据的摘录。使用 QUADAS-2 工具评估偏倚风险。我们使用推荐评估、制定与评价分级 (GRADE) 提供证据确定性。
共纳入 17 篇关于咀嚼肌疾病、TMJ 骨关节炎、内部紊乱和盘穿孔的自动诊断的文章;它们是回顾性研究、病例对照研究、队列研究和一项试点研究。有 7 项研究进行了诊断准确性的荟萃分析。根据 GRADE,证据确定性非常低。AI 模型的性能具有 84%至 99.9%的准确性和 73%至 100%的特异性。汇总准确性为 0.91(95%CI 0.76-0.99),I2=97%(95%CI 0.96-0.98),p<0.001。
为诊断 TMD 而开发的各种 AI 算法可能为提高诊断准确性提供额外的临床专业知识。然而,应该注意到,纳入的研究存在较高的偏倚风险。此外,证据确定性非常低。强烈建议开展更高质量的未来研究。