Getzmann Jonas M, Nulle Kitija, Mennini Cinzia, Viglino Umberto, Serpi Francesca, Albano Domenico, Messina Carmelo, Fusco Stefano, Gitto Salvatore, Sconfienza Luca Maria
IRCCS Istituto Ortopedico Galeazzi, Milan, Italy.
Radiology Department, Riga East Clinical University Hospital, Riga, Latvia.
Insights Imaging. 2025 Aug 9;16(1):173. doi: 10.1186/s13244-025-02046-x.
To analyze the methodological quality of studies on deep learning (DL) in rib fracture imaging with the Must AI Criteria-10 (MAIC-10) checklist, and to report insights and experiences regarding the applicability of the MAIC-10 checklist.
An electronic literature search was conducted on the PubMed database. After selection of articles, three radiologists independently rated the articles according to MAIC-10. Differences of the MAIC-10 score for each checklist item were assessed using the Fleiss' kappa coefficient.
A total of 25 original articles discussing DL applications in rib fracture imaging were identified. Most studies focused on fracture detection (n = 21, 84%). In most of the research papers, internal cross-validation of the dataset was performed (n = 16, 64%), while only six studies (24%) conducted external validation. The mean MAIC-10 score of the 25 studies was 5.63 (SD, 1.84; range 1-8), with the item "clinical need" being reported most consistently (100%) and the item "study design" being most frequently reported incompletely (94.8%). The average inter-rater agreement for the MAIC-10 score was 0.771.
The MAIC-10 checklist is a valid tool for assessing the quality of AI research in medical imaging with good inter-rater agreement. With regard to rib fracture imaging, items such as "study design", "explainability", and "transparency" were often not comprehensively addressed.
AI in medical imaging has become increasingly common. Therefore, quality control systems of published literature such as the MAIC-10 checklist are needed to ensure high quality research output.
Quality control systems are needed for research on AI in medical imaging. The MAIC-10 checklist is a valid tool to assess AI in medical imaging research quality. Checklist items such as "study design", "explainability", and "transparency" are frequently addressed incomprehensively.
使用Must AI标准10(MAIC - 10)清单分析肋骨骨折成像深度学习(DL)研究的方法学质量,并报告有关MAIC - 10清单适用性的见解和经验。
在PubMed数据库进行电子文献检索。文章筛选后,三名放射科医生根据MAIC - 10独立对文章进行评分。使用Fleiss' kappa系数评估每个清单项目的MAIC - 10评分差异。
共识别出25篇讨论DL在肋骨骨折成像中应用的原创文章。大多数研究聚焦于骨折检测(n = 21,84%)。在大多数研究论文中,对数据集进行了内部交叉验证(n = 16,64%),而只有六项研究(24%)进行了外部验证。25项研究的MAIC - 10平均评分为5.63(标准差,1.84;范围1 - 8),其中“临床需求”项目报告最为一致(100%),“研究设计”项目报告不完整的情况最为频繁(94.8%)。MAIC - 10评分的平均评分者间一致性为0.771。
MAIC - 10清单是评估医学成像中人工智能研究质量的有效工具,评分者间一致性良好。关于肋骨骨折成像,“研究设计”“可解释性”和“透明度”等项目往往未得到全面解决。
医学成像中的人工智能已越来越普遍。因此,需要诸如MAIC - 10清单这样的已发表文献质量控制系统来确保高质量的研究成果。
医学成像中人工智能研究需要质量控制系统。MAIC - 10清单是评估医学成像人工智能研究质量的有效工具。“研究设计”“可解释性”和“透明度”等清单项目经常未得到全面解决。