Suppr
超能文献

基于神经影像学的人工智能模型在精神疾病诊断中的偏倚风险评估：系统综述。

Evaluation of Risk of Bias in Neuroimaging-Based Artificial Intelligence Models for Psychiatric Diagnosis: A Systematic Review.

机构信息

School of Psychology, Third Military Medical University, Chongqing, China.

Experimental Research Center for Medical and Psychological Science, Third Military Medical University, Chongqing, China.

出版信息

JAMA Netw Open. 2023 Mar 1;6(3):e231671. doi: 10.1001/jamanetworkopen.2023.1671.

DOI:10.1001/jamanetworkopen.2023.1671

PMID:36877519

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9989906/

Abstract

IMPORTANCE

Neuroimaging-based artificial intelligence (AI) diagnostic models have proliferated in psychiatry. However, their clinical applicability and reporting quality (ie, feasibility) for clinical practice have not been systematically evaluated.

OBJECTIVE

To systematically assess the risk of bias (ROB) and reporting quality of neuroimaging-based AI models for psychiatric diagnosis.

EVIDENCE REVIEW

PubMed was searched for peer-reviewed, full-length articles published between January 1, 1990, and March 16, 2022. Studies aimed at developing or validating neuroimaging-based AI models for clinical diagnosis of psychiatric disorders were included. Reference lists were further searched for suitable original studies. Data extraction followed the CHARMS (Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modeling Studies) and PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-analyses) guidelines. A closed-loop cross-sequential design was used for quality control. The PROBAST (Prediction Model Risk of Bias Assessment Tool) and modified CLEAR (Checklist for Evaluation of Image-Based Artificial Intelligence Reports) benchmarks were used to systematically evaluate ROB and reporting quality.

FINDINGS

A total of 517 studies presenting 555 AI models were included and evaluated. Of these models, 461 (83.1%; 95% CI, 80.0%-86.2%) were rated as having a high overall ROB based on the PROBAST. The ROB was particular high in the analysis domain, including inadequate sample size (398 of 555 models [71.7%; 95% CI, 68.0%-75.6%]), poor model performance examination (with 100% of models lacking calibration examination), and lack of handling data complexity (550 of 555 models [99.1%; 95% CI, 98.3%-99.9%]). None of the AI models was perceived to be applicable to clinical practices. Overall reporting completeness (ie, number of reported items/number of total items) for the AI models was 61.2% (95% CI, 60.6%-61.8%), and the completeness was poorest for the technical assessment domain with 39.9% (95% CI, 38.8%-41.1%).

CONCLUSIONS AND RELEVANCE

This systematic review found that the clinical applicability and feasibility of neuroimaging-based AI models for psychiatric diagnosis were challenged by a high ROB and poor reporting quality. Particularly in the analysis domain, ROB in AI diagnostic models should be addressed before clinical application.

摘要

重要性

基于神经影像学的人工智能 (AI) 诊断模型在精神病学中已经大量涌现。然而，它们在临床实践中的临床适用性和报告质量（即可行性）尚未得到系统评估。

目的

系统评估基于神经影像学的 AI 模型在精神疾病诊断中的偏倚风险 (ROB) 和报告质量。

证据回顾

检索了 1990 年 1 月 1 日至 2022 年 3 月 16 日期间发表的同行评议的、全文的研究。纳入了旨在开发或验证用于临床诊断精神障碍的基于神经影像学的 AI 模型的研究。进一步搜索了参考文献列表以获取合适的原始研究。数据提取遵循 CHARMS（系统评价中预测模型研究的批判性评价和数据提取清单）和 PRISMA（系统评价和荟萃分析的首选报告项目）指南。采用闭环交叉序列设计进行质量控制。使用 PROBAST（预测模型风险偏倚评估工具）和修改后的 CLEAR（基于图像的人工智能报告评估清单）基准来系统评估 ROB 和报告质量。

发现

共纳入并评估了 517 项研究，涉及 555 个 AI 模型。基于 PROBAST，其中 461 个（83.1%；95%CI，80.0%-86.2%）模型被评为总体 ROB 较高。在分析域中，ROB 特别高，包括样本量不足（555 个模型中有 398 个[71.7%；95%CI，68.0%-75.6%]）、模型性能检查不佳（所有模型均缺乏校准检查）以及缺乏处理数据复杂性（555 个模型中有 550 个[99.1%；95%CI，98.3%-99.9%]）。没有一个 AI 模型被认为适用于临床实践。AI 模型的整体报告完整性（即报告项目数/总项目数）为 61.2%（95%CI，60.6%-61.8%），技术评估域的完整性最差，为 39.9%（95%CI，38.8%-41.1%）。

结论和相关性

本系统评价发现，基于神经影像学的 AI 模型在精神疾病诊断中的临床适用性和可行性受到 ROB 较高和报告质量较差的挑战。特别是在分析域中，在临床应用之前，应该解决 AI 诊断模型中的 ROB 问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df39/9989906/e1ba0b6736db/jamanetwopen-e231671-g001.jpg

相似文献

Evaluation of Risk of Bias in Neuroimaging-Based Artificial Intelligence Models for Psychiatric Diagnosis: A Systematic Review.

JAMA Netw Open. 2023 Mar 1;6(3):e231671. doi: 10.1001/jamanetworkopen.2023.1671.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Protocol for development of a reporting guideline (TRIPOD-AI) and risk of bias tool (PROBAST-AI) for diagnostic and prognostic prediction model studies based on artificial intelligence.

BMJ Open. 2021 Jul 9;11(7):e048008. doi: 10.1136/bmjopen-2020-048008.

Machine learning models for diabetes management in acute care using electronic medical records: A systematic review.

Int J Med Inform. 2022 Apr 2;162:104758. doi: 10.1016/j.ijmedinf.2022.104758.

Application of Artificial Intelligence in Community-Based Primary Health Care: Systematic Scoping Review and Critical Appraisal.

J Med Internet Res. 2021 Sep 3;23(9):e29839. doi: 10.2196/29839.

Use of artificial intelligence in obstetric and gynaecological diagnostics: a protocol for a systematic review and meta-analysis.

BMJ Open. 2024 May 8;14(5):e082287. doi: 10.1136/bmjopen-2023-082287.

Technical skill assessment in minimally invasive surgery using artificial intelligence: a systematic review.

Surg Endosc. 2023 Oct;37(10):7412-7424. doi: 10.1007/s00464-023-10335-z. Epub 2023 Aug 16.

The Need to Prioritize Model-Updating Processes in Clinical Artificial Intelligence (AI) Models: Protocol for a Scoping Review.

JMIR Res Protoc. 2023 Feb 16;12:e37685. doi: 10.2196/37685.

Artificial Intelligence for Hip Fracture Detection and Outcome Prediction: A Systematic Review and Meta-analysis.

JAMA Netw Open. 2023 Mar 1;6(3):e233391. doi: 10.1001/jamanetworkopen.2023.3391.

What Are the Applications and Limitations of Artificial Intelligence for Fracture Detection and Classification in Orthopaedic Trauma Imaging? A Systematic Review.

Clin Orthop Relat Res. 2019 Nov;477(11):2482-2491. doi: 10.1097/CORR.0000000000000848.

引用本文的文献

Methodological and reporting quality of machine learning studies on cancer diagnosis, treatment, and prognosis.

Front Oncol. 2025 Apr 14;15:1555247. doi: 10.3389/fonc.2025.1555247. eCollection 2025.

Bias recognition and mitigation strategies in artificial intelligence healthcare applications.

NPJ Digit Med. 2025 Mar 11;8(1):154. doi: 10.1038/s41746-025-01503-7.

Integrative diagnosis of psychiatric conditions using ChatGPT and fMRI data.

BMC Psychiatry. 2025 Feb 19;25(1):145. doi: 10.1186/s12888-025-06586-w.

A scoping review of magnetic resonance angiography and perfusion image synthesis.

Front Dement. 2024 Nov 11;3:1408782. doi: 10.3389/frdem.2024.1408782. eCollection 2024.

The Transition From Homogeneous to Heterogeneous Machine Learning in Neuropsychiatric Research.

Biol Psychiatry Glob Open Sci. 2024 Sep 26;5(1):100397. doi: 10.1016/j.bpsgos.2024.100397. eCollection 2025 Jan.

Racial and ethnic socioenvironmental inequity and neuroimaging in psychiatry: a brief review of the past and recommendations for the future.

Neuropsychopharmacology. 2024 Nov;50(1):3-15. doi: 10.1038/s41386-024-01901-7. Epub 2024 Jun 20.

A new era in cognitive neuroscience: the tidal wave of artificial intelligence (AI).

BMC Neurosci. 2024 May 6;25(1):23. doi: 10.1186/s12868-024-00869-w.

Transcriptomic and neuroimaging data integration enhances machine learning classification of schizophrenia.

Psychoradiology. 2024 Mar 26;4:kkae005. doi: 10.1093/psyrad/kkae005. eCollection 2024.

Data leakage inflates prediction performance in connectome-based machine learning models.

Nat Commun. 2024 Feb 28;15(1):1829. doi: 10.1038/s41467-024-46150-w.

BrainAGE, brain health, and mental disorders: A systematic review.

Neurosci Biobehav Rev. 2024 Apr;159:105581. doi: 10.1016/j.neubiorev.2024.105581. Epub 2024 Feb 13.

本文引用的文献

Assessment of Adherence to Reporting Guidelines by Commonly Used Clinical Prediction Models From a Single Vendor: A Systematic Review.

JAMA Netw Open. 2022 Aug 1;5(8):e2227779. doi: 10.1001/jamanetworkopen.2022.27779.

A Fusion-Based Technique With Hybrid Swarm Algorithm and Deep Learning for Biosignal Classification.

Front Hum Neurosci. 2022 Jun 3;16:895761. doi: 10.3389/fnhum.2022.895761. eCollection 2022.

Feature and decision-level fusion for schizophrenia detection based on resting-state fMRI data.

PLoS One. 2022 May 24;17(5):e0265300. doi: 10.1371/journal.pone.0265300. eCollection 2022.

The Altered Pattern of the Functional Connectome Related to Pathological Biomarkers in Individuals for Autism Spectrum Disorder Identification.

Front Neurosci. 2022 May 6;16:913377. doi: 10.3389/fnins.2022.913377. eCollection 2022.

Reporting guideline for the early-stage clinical evaluation of decision support systems driven by artificial intelligence: DECIDE-AI.

Nat Med. 2022 May;28(5):924-933. doi: 10.1038/s41591-022-01772-9. Epub 2022 May 18.

Detection of Schizophrenia Cases From Healthy Controls With Combination of Neurocognitive and Electrophysiological Features.

Front Psychiatry. 2022 Apr 5;13:810362. doi: 10.3389/fpsyt.2022.810362. eCollection 2022.

Deep Learning Enabled Diagnosis of Children's ADHD Based on the Big Data of Video Screen Long-Range EEG.

J Healthc Eng. 2022 Apr 4;2022:5222136. doi: 10.1155/2022/5222136. eCollection 2022.

Application of Machine Learning Techniques to Detect the Children with Autism Spectrum Disorder.

J Healthc Eng. 2022 Mar 25;2022:9340027. doi: 10.1155/2022/9340027. eCollection 2022.

Clinical prediction models in psychiatry: a systematic review of two decades of progress and challenges.

Mol Psychiatry. 2022 Jun;27(6):2700-2708. doi: 10.1038/s41380-022-01528-4. Epub 2022 Apr 1.

An End-to-End Depression Recognition Method Based on EEGNet.

Front Psychiatry. 2022 Mar 11;13:864393. doi: 10.3389/fpsyt.2022.864393. eCollection 2022.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

基于神经影像学的人工智能模型在精神疾病诊断中的偏倚风险评估：系统综述。

Evaluation of Risk of Bias in Neuroimaging-Based Artificial Intelligence Models for Psychiatric Diagnosis: A Systematic Review.

机构信息

出版信息

IMPORTANCE

OBJECTIVE

EVIDENCE REVIEW

FINDINGS

CONCLUSIONS AND RELEVANCE

重要性

目的

证据回顾

发现

结论和相关性

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译