Suppr超能文献

基于神经影像学的人工智能模型在精神疾病诊断中的偏倚风险评估:系统综述。

Evaluation of Risk of Bias in Neuroimaging-Based Artificial Intelligence Models for Psychiatric Diagnosis: A Systematic Review.

机构信息

School of Psychology, Third Military Medical University, Chongqing, China.

Experimental Research Center for Medical and Psychological Science, Third Military Medical University, Chongqing, China.

出版信息

JAMA Netw Open. 2023 Mar 1;6(3):e231671. doi: 10.1001/jamanetworkopen.2023.1671.

Abstract

IMPORTANCE

Neuroimaging-based artificial intelligence (AI) diagnostic models have proliferated in psychiatry. However, their clinical applicability and reporting quality (ie, feasibility) for clinical practice have not been systematically evaluated.

OBJECTIVE

To systematically assess the risk of bias (ROB) and reporting quality of neuroimaging-based AI models for psychiatric diagnosis.

EVIDENCE REVIEW

PubMed was searched for peer-reviewed, full-length articles published between January 1, 1990, and March 16, 2022. Studies aimed at developing or validating neuroimaging-based AI models for clinical diagnosis of psychiatric disorders were included. Reference lists were further searched for suitable original studies. Data extraction followed the CHARMS (Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modeling Studies) and PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-analyses) guidelines. A closed-loop cross-sequential design was used for quality control. The PROBAST (Prediction Model Risk of Bias Assessment Tool) and modified CLEAR (Checklist for Evaluation of Image-Based Artificial Intelligence Reports) benchmarks were used to systematically evaluate ROB and reporting quality.

FINDINGS

A total of 517 studies presenting 555 AI models were included and evaluated. Of these models, 461 (83.1%; 95% CI, 80.0%-86.2%) were rated as having a high overall ROB based on the PROBAST. The ROB was particular high in the analysis domain, including inadequate sample size (398 of 555 models [71.7%; 95% CI, 68.0%-75.6%]), poor model performance examination (with 100% of models lacking calibration examination), and lack of handling data complexity (550 of 555 models [99.1%; 95% CI, 98.3%-99.9%]). None of the AI models was perceived to be applicable to clinical practices. Overall reporting completeness (ie, number of reported items/number of total items) for the AI models was 61.2% (95% CI, 60.6%-61.8%), and the completeness was poorest for the technical assessment domain with 39.9% (95% CI, 38.8%-41.1%).

CONCLUSIONS AND RELEVANCE

This systematic review found that the clinical applicability and feasibility of neuroimaging-based AI models for psychiatric diagnosis were challenged by a high ROB and poor reporting quality. Particularly in the analysis domain, ROB in AI diagnostic models should be addressed before clinical application.

摘要

重要性

基于神经影像学的人工智能 (AI) 诊断模型在精神病学中已经大量涌现。然而,它们在临床实践中的临床适用性和报告质量(即可行性)尚未得到系统评估。

目的

系统评估基于神经影像学的 AI 模型在精神疾病诊断中的偏倚风险 (ROB) 和报告质量。

证据回顾

检索了 1990 年 1 月 1 日至 2022 年 3 月 16 日期间发表的同行评议的、全文的研究。纳入了旨在开发或验证用于临床诊断精神障碍的基于神经影像学的 AI 模型的研究。进一步搜索了参考文献列表以获取合适的原始研究。数据提取遵循 CHARMS(系统评价中预测模型研究的批判性评价和数据提取清单)和 PRISMA(系统评价和荟萃分析的首选报告项目)指南。采用闭环交叉序列设计进行质量控制。使用 PROBAST(预测模型风险偏倚评估工具)和修改后的 CLEAR(基于图像的人工智能报告评估清单)基准来系统评估 ROB 和报告质量。

发现

共纳入并评估了 517 项研究,涉及 555 个 AI 模型。基于 PROBAST,其中 461 个(83.1%;95%CI,80.0%-86.2%)模型被评为总体 ROB 较高。在分析域中,ROB 特别高,包括样本量不足(555 个模型中有 398 个[71.7%;95%CI,68.0%-75.6%])、模型性能检查不佳(所有模型均缺乏校准检查)以及缺乏处理数据复杂性(555 个模型中有 550 个[99.1%;95%CI,98.3%-99.9%])。没有一个 AI 模型被认为适用于临床实践。AI 模型的整体报告完整性(即报告项目数/总项目数)为 61.2%(95%CI,60.6%-61.8%),技术评估域的完整性最差,为 39.9%(95%CI,38.8%-41.1%)。

结论和相关性

本系统评价发现,基于神经影像学的 AI 模型在精神疾病诊断中的临床适用性和可行性受到 ROB 较高和报告质量较差的挑战。特别是在分析域中,在临床应用之前,应该解决 AI 诊断模型中的 ROB 问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df39/9989906/e1ba0b6736db/jamanetwopen-e231671-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验