Kazimierczak Wojciech, Jedliński Maciej, Issa Julien, Kazimierczak Natalia, Janiszewska-Olszowska Joanna, Dyszkiewicz-Konwińska Marta, Różyło-Kalinowska Ingrid, Serafin Zbigniew, Orhan Kaan
Department of Radiology and Diagnostic Imaging, Collegium Medicum, Nicolaus Copernicus University in Torun, Jagiellońska 13-15, 85-067 Bydgoszcz, Poland.
Kazimierczak Private Medical Practice, Dworcowa 13/u6a, 85-009 Bydgoszcz, Poland.
J Clin Med. 2024 Jul 10;13(14):4047. doi: 10.3390/jcm13144047.
To systematically review and summarize the existing scientific evidence on the diagnostic performance of artificial intelligence (AI) in assessing cervical vertebral maturation (CVM). This review aimed to evaluate the accuracy and reliability of AI algorithms in comparison to those of experienced clinicians. Comprehensive searches were conducted across multiple databases, including PubMed, Scopus, Web of Science, and Embase, using a combination of Boolean operators and MeSH terms. The inclusion criteria were cross-sectional studies with neural network research, reporting diagnostic accuracy, and involving human subjects. Data extraction and quality assessment were performed independently by two reviewers, with a third reviewer resolving any disagreements. The Quality Assessment of Diagnostic Accuracy Studies (QUADAS)-2 tool was used for bias assessment. Eighteen studies met the inclusion criteria, predominantly employing supervised learning techniques, especially convolutional neural networks (CNNs). The diagnostic accuracy of AI models for CVM assessment varied widely, ranging from 57% to 95%. The factors influencing accuracy included the type of AI model, training data, and study methods. Geographic concentration and variability in the experience of radiograph readers also impacted the results. AI has considerable potential for enhancing the accuracy and reliability of CVM assessments in orthodontics. However, the variability in AI performance and the limited number of high-quality studies suggest the need for further research.
系统回顾和总结关于人工智能(AI)评估颈椎成熟度(CVM)诊断性能的现有科学证据。本综述旨在评估AI算法与经验丰富的临床医生相比的准确性和可靠性。使用布尔运算符和医学主题词(MeSH)相结合的方式,对多个数据库进行全面检索,包括PubMed、Scopus、科学网和Embase。纳入标准为进行神经网络研究的横断面研究,报告诊断准确性且涉及人类受试者。由两名审阅者独立进行数据提取和质量评估,如有分歧则由第三名审阅者解决。使用诊断准确性研究质量评估(QUADAS)-2工具进行偏倚评估。18项研究符合纳入标准,主要采用监督学习技术,尤其是卷积神经网络(CNN)。用于CVM评估的AI模型的诊断准确性差异很大,从57%到95%不等。影响准确性的因素包括AI模型的类型、训练数据和研究方法。X线片阅片者经验的地域集中性和变异性也对结果产生影响。AI在提高正畸中CVM评估的准确性和可靠性方面具有相当大的潜力。然而,AI性能的变异性和高质量研究数量有限表明需要进一步研究。