Suppr
超能文献

抽样不等式会影响基于神经影像学的精神病学诊断分类器的泛化。

Sampling inequalities affect generalization of neuroimaging-based diagnostic classifiers in psychiatry.

机构信息

Experimental Research Center for Medical and Psychological Science (ERC-MPS), School of Psychology, Third Military Medical University, Chongqing, China.

Faculty of Psychology, Southwest University, Chongqing, China.

出版信息

BMC Med. 2023 Jul 3;21(1):241. doi: 10.1186/s12916-023-02941-4.

DOI:10.1186/s12916-023-02941-4

PMID:37400814

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10318841/

Abstract

BACKGROUND

The development of machine learning models for aiding in the diagnosis of mental disorder is recognized as a significant breakthrough in the field of psychiatry. However, clinical practice of such models remains a challenge, with poor generalizability being a major limitation.

METHODS

Here, we conducted a pre-registered meta-research assessment on neuroimaging-based models in the psychiatric literature, quantitatively examining global and regional sampling issues over recent decades, from a view that has been relatively underexplored. A total of 476 studies (n = 118,137) were included in the current assessment. Based on these findings, we built a comprehensive 5-star rating system to quantitatively evaluate the quality of existing machine learning models for psychiatric diagnoses.

RESULTS

A global sampling inequality in these models was revealed quantitatively (sampling Gini coefficient (G) = 0.81, p < .01), varying across different countries (regions) (e.g., China, G = 0.47; the USA, G = 0.58; Germany, G = 0.78; the UK, G = 0.87). Furthermore, the severity of this sampling inequality was significantly predicted by national economic levels (β = - 2.75, p < .001, R = 0.40; r = - .84, 95% CI: - .41 to - .97), and was plausibly predictable for model performance, with higher sampling inequality for reporting higher classification accuracy. Further analyses showed that lack of independent testing (84.24% of models, 95% CI: 81.0-87.5%), improper cross-validation (51.68% of models, 95% CI: 47.2-56.2%), and poor technical transparency (87.8% of models, 95% CI: 84.9-90.8%)/availability (80.88% of models, 95% CI: 77.3-84.4%) are prevailing in current diagnostic classifiers despite improvements over time. Relating to these observations, model performances were found decreased in studies with independent cross-country sampling validations (all p < .001, BF > 15). In light of this, we proposed a purpose-built quantitative assessment checklist, which demonstrated that the overall ratings of these models increased by publication year but were negatively associated with model performance.

CONCLUSIONS

Together, improving sampling economic equality and hence the quality of machine learning models may be a crucial facet to plausibly translating neuroimaging-based diagnostic classifiers into clinical practice.

摘要

背景

机器学习模型在辅助精神障碍诊断方面的发展被认为是精神病学领域的重大突破。然而，此类模型在临床实践中的应用仍然具有挑战性，泛化能力差是主要限制因素。

方法

在这里，我们对精神病学文献中的基于神经影像学的模型进行了预先注册的元研究评估，从一个相对较少被探索的角度定量地检查了近几十年来的全球和区域抽样问题。共有 476 项研究（n=118137）被纳入本评估。基于这些发现，我们建立了一个全面的 5 星级评分系统，对现有的用于精神科诊断的机器学习模型的质量进行定量评估。

结果

这些模型中定量揭示了全球抽样不平等（抽样基尼系数（G）=0.81，p<.01），不同国家（地区）之间存在差异（例如，中国，G=0.47；美国，G=0.58；德国，G=0.78；英国，G=0.87）。此外，国家经济水平显著预测了这种抽样不平等的严重程度（β=-2.75，p<.001，R=0.40；r=-0.84，95%置信区间：-0.41 至-0.97），并且可能对模型性能进行预测，报告的分类准确率越高，抽样不平等程度越高。进一步的分析表明，缺乏独立测试（84.24%的模型，95%置信区间：81.0-87.5%）、不当的交叉验证（51.68%的模型，95%置信区间：47.2-56.2%）和技术透明度差（87.8%的模型，95%置信区间：84.9-90.8%）/可用性（80.88%的模型，95%置信区间：77.3-84.4%）在当前的诊断分类器中普遍存在，尽管随着时间的推移有所改进。与这些观察结果一致的是，在具有独立跨国抽样验证的研究中，模型性能下降（均 p<.001，BF>15）。有鉴于此，我们提出了一个专门设计的定量评估清单，该清单表明，这些模型的总体评分随着出版年份的增加而提高，但与模型性能呈负相关。

结论

总之，提高机器学习模型的抽样经济平等程度，从而提高模型质量，可能是将基于神经影像学的诊断分类器转化为临床实践的一个重要方面。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/619f/10318841/ea854af19a21/12916_2023_2941_Fig1_HTML.jpg

相似文献

Sampling inequalities affect generalization of neuroimaging-based diagnostic classifiers in psychiatry.

BMC Med. 2023 Jul 3;21(1):241. doi: 10.1186/s12916-023-02941-4.

Letter to the Editor: CONVERGENCES AND DIVERGENCES IN THE ICD-11 VS. DSM-5 CLASSIFICATION OF MOOD DISORDERS.

Turk Psikiyatri Derg. 2021;32(4):293-295. doi: 10.5080/u26899.

Machine Learning With Neuroimaging: Evaluating Its Applications in Psychiatry.

Biol Psychiatry Cogn Neurosci Neuroimaging. 2020 Aug;5(8):791-798. doi: 10.1016/j.bpsc.2019.11.007. Epub 2019 Nov 27.

Letter to the Editor: EDUCATIONAL ACTIVITIES RELATED TO THE ICD-11 CHAPTER ON MENTAL DISORDERS.

Turk Psikiyatri Derg. 2021;32(4):291-292. doi: 10.5080/u26898.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Making Individual Prognoses in Psychiatry Using Neuroimaging and Machine Learning.

Biol Psychiatry Cogn Neurosci Neuroimaging. 2018 Sep;3(9):798-808. doi: 10.1016/j.bpsc.2018.04.004. Epub 2018 Apr 22.

The potential of precision psychiatry: what is in reach?

Br J Psychiatry. 2022 Apr;220(4):175-178. doi: 10.1192/bjp.2022.23.

Translational machine learning for psychiatric neuroimaging.

Prog Neuropsychopharmacol Biol Psychiatry. 2019 Apr 20;91:113-121. doi: 10.1016/j.pnpbp.2018.09.014. Epub 2018 Oct 2.

Response to letter to the editor from Dr Rahman Shiri: The challenging topic of suicide across occupational groups.

Scand J Work Environ Health. 2018 Jan 1;44(1):108-110. doi: 10.5271/sjweh.3698. Epub 2017 Dec 8.

Evaluation of Risk of Bias in Neuroimaging-Based Artificial Intelligence Models for Psychiatric Diagnosis: A Systematic Review.

JAMA Netw Open. 2023 Mar 1;6(3):e231671. doi: 10.1001/jamanetworkopen.2023.1671.

引用本文的文献

Risk of bias and low reproducibility in meta-analytic evidence from fast-tracked publications during the coronavirus disease 2019 pandemic.

PNAS Nexus. 2025 Jul 29;4(8):pgaf238. doi: 10.1093/pnasnexus/pgaf238. eCollection 2025 Aug.

The global burden of stroke attributable to high alcohol use from 1990 to 2021: An analysis for the global burden of disease study 2021.

PLoS One. 2025 Jul 14;20(7):e0328135. doi: 10.1371/journal.pone.0328135. eCollection 2025.

Towards collaborative data science in mental health research: The ECNP neuroimaging network accessible data repository.

Neurosci Appl. 2024 Dec 9;4:105407. doi: 10.1016/j.nsa.2024.105407. eCollection 2025.

Leveraging Stacked Classifiers for Multi-task Executive Function in Schizophrenia Yields Diagnostic and Prognostic Insights.

medRxiv. 2024 Dec 8:2024.12.05.24318587. doi: 10.1101/2024.12.05.24318587.

Our Hopes for .

JAACAP Open. 2023 Jul 24;1(2):77-79. doi: 10.1016/j.jaacop.2023.07.001. eCollection 2023 Sep.

Local-structure-preservation and redundancy-removal-based feature selection method and its application to the identification of biomarkers for schizophrenia.

Neuroimage. 2024 Oct 1;299:120839. doi: 10.1016/j.neuroimage.2024.120839. Epub 2024 Sep 7.

From Maps to Models: A Survey on the Reliability of Small Studies of Task-Based fMRI.

bioRxiv. 2024 Aug 7:2024.08.05.606611. doi: 10.1101/2024.08.05.606611.

Current best practices and future opportunities for reproducible findings using large-scale neuroimaging in psychiatry.

Neuropsychopharmacology. 2024 Nov;50(1):37-51. doi: 10.1038/s41386-024-01938-8. Epub 2024 Aug 8.

Racial and ethnic socioenvironmental inequity and neuroimaging in psychiatry: a brief review of the past and recommendations for the future.

Neuropsychopharmacology. 2024 Nov;50(1):3-15. doi: 10.1038/s41386-024-01901-7. Epub 2024 Jun 20.

Why and how to collect representative study samples in educational neuroscience research.

Trends Neurosci Educ. 2024 Jun;35:100231. doi: 10.1016/j.tine.2024.100231. Epub 2024 May 26.

本文引用的文献

Evaluation of Risk of Bias in Neuroimaging-Based Artificial Intelligence Models for Psychiatric Diagnosis: A Systematic Review.

JAMA Netw Open. 2023 Mar 1;6(3):e231671. doi: 10.1001/jamanetworkopen.2023.1671.

Sources of bias in artificial intelligence that perpetuate healthcare disparities-A global review.

PLOS Digit Health. 2022 Mar 31;1(3):e0000022. doi: 10.1371/journal.pdig.0000022. eCollection 2022 Mar.

One Size Does Not Fit All: Methodological Considerations for Brain-Based Predictive Modeling in Psychiatry.

Biol Psychiatry. 2023 Apr 15;93(8):717-728. doi: 10.1016/j.biopsych.2022.09.024. Epub 2022 Sep 29.

Leveraging Machine Learning for Gaining Neurobiological and Nosological Insights in Psychiatric Research.

Biol Psychiatry. 2023 Jan 1;93(1):18-28. doi: 10.1016/j.biopsych.2022.07.025. Epub 2022 Aug 6.

Challenges for machine learning in clinical translation of big data imaging studies.

Neuron. 2022 Dec 7;110(23):3866-3881. doi: 10.1016/j.neuron.2022.09.012. Epub 2022 Oct 10.

Brain-phenotype models fail for individuals who defy sample stereotypes.

Nature. 2022 Sep;609(7925):109-118. doi: 10.1038/s41586-022-05118-w. Epub 2022 Aug 24.

Recommendations for machine learning benchmarks in neuroimaging.

Neuroimage. 2022 Aug 15;257:119298. doi: 10.1016/j.neuroimage.2022.119298. Epub 2022 May 10.

A Systemic Approach to the Oversight of Machine Learning Clinical Translation.

Am J Bioeth. 2022 May;22(5):23-25. doi: 10.1080/15265161.2022.2055216.

Machine learning for medical imaging: methodological failures and recommendations for the future.

NPJ Digit Med. 2022 Apr 12;5(1):48. doi: 10.1038/s41746-022-00592-y.

Addressing racial and phenotypic bias in human neuroscience methods.

Nat Neurosci. 2022 Apr;25(4):410-414. doi: 10.1038/s41593-022-01046-0. Epub 2022 Apr 5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

抽样不等式会影响基于神经影像学的精神病学诊断分类器的泛化。

Sampling inequalities affect generalization of neuroimaging-based diagnostic classifiers in psychiatry.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译