Department of Radiology, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Korea.
Yonsei Med J. 2024 Oct;65(10):611-618. doi: 10.3349/ymj.2024.0007.
This study aimed to evaluate the quality of artificial intelligence (AI)/machine learning (ML) studies on hemorrhagic stroke using the Minimum Information for Medical AI Reporting (MINIMAR) and Minimum Information About Clinical Artificial Intelligence Modeling (MI-CLAIM) frameworks to promote clinical application.
PubMed, MEDLINE, and Embase were searched for AI/ML studies on hemorrhagic stroke. Out of the 531 articles found, 29 relevant original research articles were included. MINIMAR and MI-CLAIM scores were assigned by two experienced radiologists to assess the quality of the studies.
We analyzed 29 investigations that utilized AI/ML in the field of hemorrhagic stroke, involving a median of 224.5 patients. The majority of studies focused on diagnostic outcomes using computed tomography scans (89.7%) and were published in computer science journals (48.3%). The overall adherence rates to reporting guidelines, as assessed through the MINIMAR and MI-CLAIM frameworks, were 47.6% and 46.0%, respectively. In MINIMAR, none of the studies reported the socioeconomic status of the patients or how missing values had been addressed. In MI-CLAIM, only two studies applied model-examination techniques to improve model interpretability. Transparency and reproducibility were limited, as only 10.3% of the studies had publicly shared their code. Cohen's kappa between the two radiologists was 0.811 and 0.779 for MINIMAR and MI-CLAIM, respectively.
The overall reporting quality of published AI/ML studies on hemorrhagic stroke is suboptimal. It is necessary to incorporate model examination techniques for interpretability and promote code openness to enhance transparency and increase the clinical applicability of AI/ML studies.
本研究旨在使用最小化医学人工智能报告信息(MINIMAR)和最小化临床人工智能建模信息(MI-CLAIM)框架评估关于出血性中风的人工智能(AI)/机器学习(ML)研究的质量,以促进临床应用。
在 PubMed、MEDLINE 和 Embase 中搜索关于出血性中风的 AI/ML 研究。在 531 篇文章中,有 29 篇相关的原始研究文章被纳入。两位有经验的放射科医生对 MINIMAR 和 MI-CLAIM 评分进行了评估,以评估研究的质量。
我们分析了 29 项利用 AI/ML 研究出血性中风的研究,中位数为 224.5 例患者。大多数研究的重点是使用计算机断层扫描(CT)扫描的诊断结果(89.7%),并发表在计算机科学期刊上(48.3%)。通过 MINIMAR 和 MI-CLAIM 框架评估的报告指南的总体遵守率分别为 47.6%和 46.0%。在 MINIMAR 中,没有一项研究报告了患者的社会经济地位或如何处理缺失值。在 MI-CLAIM 中,只有两项研究应用了模型检查技术来提高模型的可解释性。透明度和可重复性有限,因为只有 10.3%的研究公开分享了他们的代码。两位放射科医生在 MINIMAR 和 MI-CLAIM 上的 Cohen's kappa 值分别为 0.811 和 0.779。
已发表的关于出血性中风的 AI/ML 研究的整体报告质量欠佳。有必要纳入模型检查技术以提高可解释性,并促进代码公开,以提高 AI/ML 研究的透明度和临床适用性。