• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

系统地低估了机器学习在抑郁症神经影像学研究中的性能。

Systematic misestimation of machine learning performance in neuroimaging studies of depression.

机构信息

Department of Psychiatry, University of Münster, Münster, Germany.

Faculty of Mathematics and Computer Science, University of Münster, Münster, Germany.

出版信息

Neuropsychopharmacology. 2021 Jul;46(8):1510-1517. doi: 10.1038/s41386-021-01020-7. Epub 2021 May 6.

DOI:10.1038/s41386-021-01020-7
PMID:33958703
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8209109/
Abstract

We currently observe a disconcerting phenomenon in machine learning studies in psychiatry: While we would expect larger samples to yield better results due to the availability of more data, larger machine learning studies consistently show much weaker performance than the numerous small-scale studies. Here, we systematically investigated this effect focusing on one of the most heavily studied questions in the field, namely the classification of patients suffering from Major Depressive Disorder (MDD) and healthy controls based on neuroimaging data. Drawing upon structural MRI data from a balanced sample of N = 1868 MDD patients and healthy controls from our recent international Predictive Analytics Competition (PAC), we first trained and tested a classification model on the full dataset which yielded an accuracy of 61%. Next, we mimicked the process by which researchers would draw samples of various sizes (N = 4 to N = 150) from the population and showed a strong risk of misestimation. Specifically, for small sample sizes (N = 20), we observe accuracies of up to 95%. For medium sample sizes (N = 100) accuracies up to 75% were found. Importantly, further investigation showed that sufficiently large test sets effectively protect against performance misestimation whereas larger datasets per se do not. While these results question the validity of a substantial part of the current literature, we outline the relatively low-cost remedy of larger test sets, which is readily available in most cases.

摘要

我们目前在精神病学的机器学习研究中观察到一个令人不安的现象

尽管我们期望更大的样本由于数据的可用性而产生更好的结果,但更大的机器学习研究始终显示出比众多小规模研究弱得多的性能。在这里,我们集中研究了这个问题,重点是该领域研究最多的问题之一,即基于神经影像学数据对患有重度抑郁症(MDD)和健康对照的患者进行分类。利用我们最近的国际预测分析竞赛(PAC)中来自 N = 1868 名 MDD 患者和健康对照的平衡样本的结构 MRI 数据,我们首先在全数据集上训练和测试了分类模型,该模型的准确率为 61%。接下来,我们模拟了研究人员从人群中抽取各种大小样本(N = 4 到 N = 150)的过程,并显示出严重的估计错误风险。具体来说,对于小样本量(N = 20),我们观察到高达 95%的准确率。对于中等样本量(N = 100),发现准确率高达 75%。重要的是,进一步的调查表明,足够大的测试集可以有效地防止性能估计错误,而大数据集本身并不能。虽然这些结果对当前文献的很大一部分提出了质疑,但我们概述了相对低成本的补救措施,即更大的测试集,这在大多数情况下都可以轻松获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efe0/8209109/d68181894f27/41386_2021_1020_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efe0/8209109/2a34ecf7b2a9/41386_2021_1020_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efe0/8209109/9d1815797770/41386_2021_1020_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efe0/8209109/b5ca4e4588e9/41386_2021_1020_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efe0/8209109/d68181894f27/41386_2021_1020_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efe0/8209109/2a34ecf7b2a9/41386_2021_1020_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efe0/8209109/9d1815797770/41386_2021_1020_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efe0/8209109/b5ca4e4588e9/41386_2021_1020_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efe0/8209109/d68181894f27/41386_2021_1020_Fig4_HTML.jpg

相似文献

1
Systematic misestimation of machine learning performance in neuroimaging studies of depression.系统地低估了机器学习在抑郁症神经影像学研究中的性能。
Neuropsychopharmacology. 2021 Jul;46(8):1510-1517. doi: 10.1038/s41386-021-01020-7. Epub 2021 May 6.
2
The Application of a Machine Learning-Based Brain Magnetic Resonance Imaging Approach in Major Depression.基于机器学习的脑磁共振成像方法在重度抑郁症中的应用。
Adv Exp Med Biol. 2021;1305:57-69. doi: 10.1007/978-981-33-6044-0_4.
3
A Systematic Evaluation of Machine Learning-Based Biomarkers for Major Depressive Disorder.基于机器学习的重度抑郁症生物标志物的系统评价
JAMA Psychiatry. 2024 Apr 1;81(4):386-395. doi: 10.1001/jamapsychiatry.2023.5083.
4
Sleep disturbance-related neuroimaging features as potential biomarkers for the diagnosis of major depressive disorder: A multicenter study based on machine learning.睡眠障碍相关的神经影像学特征作为重度抑郁症诊断的潜在生物标志物:一项基于机器学习的多中心研究。
J Affect Disord. 2021 Dec 1;295:148-155. doi: 10.1016/j.jad.2021.08.027. Epub 2021 Aug 23.
5
Multi-site benchmark classification of major depressive disorder using machine learning on cortical and subcortical measures.基于皮质和皮质下指标的机器学习对重度抑郁症的多部位基准分类。
Sci Rep. 2024 Jan 11;14(1):1084. doi: 10.1038/s41598-023-47934-8.
6
The alterations of brain functional connectivity networks in major depressive disorder detected by machine learning through multisite rs-fMRI data.机器学习通过多中心 rs-fMRI 数据检测到的重度抑郁症患者脑功能连接网络的改变。
Behav Brain Res. 2022 Oct 28;435:114058. doi: 10.1016/j.bbr.2022.114058. Epub 2022 Aug 20.
7
Accuracy of automated classification of major depressive disorder as a function of symptom severity.重度抑郁症自动分类的准确性与症状严重程度的关系。
Neuroimage Clin. 2016 Jul 27;12:320-31. doi: 10.1016/j.nicl.2016.07.012. eCollection 2016.
8
Making Individual Prognoses in Psychiatry Using Neuroimaging and Machine Learning.使用神经影像学和机器学习对精神病学进行个体预后预测。
Biol Psychiatry Cogn Neurosci Neuroimaging. 2018 Sep;3(9):798-808. doi: 10.1016/j.bpsc.2018.04.004. Epub 2018 Apr 22.
9
Will machine learning applied to neuroimaging in bipolar disorder help the clinician? A critical review and methodological suggestions.机器学习在双相情感障碍神经影像学中的应用能否帮助临床医生?批判性评价和方法学建议。
Bipolar Disord. 2020 Jun;22(4):334-355. doi: 10.1111/bdi.12895. Epub 2020 Mar 20.
10
Prediction of Individual Response to Electroconvulsive Therapy via Machine Learning on Structural Magnetic Resonance Imaging Data.基于结构磁共振成像数据的机器学习预测个体对电抽搐治疗的反应。
JAMA Psychiatry. 2016 Jun 1;73(6):557-64. doi: 10.1001/jamapsychiatry.2016.0316.

引用本文的文献

1
Smartphone sensor-based depression detection in campus environments: a proof-of-concept study with small-sample behavioral analysis.校园环境中基于智能手机传感器的抑郁症检测:一项带有小样本行为分析的概念验证研究。
Front Psychiatry. 2025 Aug 7;16:1468334. doi: 10.3389/fpsyt.2025.1468334. eCollection 2025.
2
Taking a look at your speech: identifying diagnostic status and negative symptoms of psychosis using convolutional neural networks.审视你的言语:使用卷积神经网络识别精神病的诊断状态和阴性症状。
NPP Digit Psychiatry Neurosci. 2025;3(1):19. doi: 10.1038/s44277-025-00040-1. Epub 2025 Jul 8.
3
Using the excitation/inhibition ratio to optimize the classification of autism and schizophrenia.
利用兴奋/抑制比优化自闭症和精神分裂症的分类。
Transl Psychiatry. 2025 Jul 9;15(1):234. doi: 10.1038/s41398-025-03455-8.
4
Much More Than the Malady: The Promise of a Web-Based Digital Platform Incorporating Self-Report for Research and Clinical Care in Mild Cognitive Impairment.远不止病症本身:一个基于网络的数字平台在轻度认知障碍研究与临床护理中纳入自我报告的前景。
Mayo Clin Proc Digit Health. 2025 May 6;3(2):100224. doi: 10.1016/j.mcpdig.2025.100224. eCollection 2025 Jun.
5
Predicting the onset of mental health problems in adolescents.预测青少年心理健康问题的发作
Psychol Med. 2025 Apr 30;55:e128. doi: 10.1017/S003329172500087X.
6
Individual-Level Prediction of Exposure Therapy Outcome Using Structural and Functional MRI Data in Spider Phobia: A Machine-Learning Study.利用结构和功能磁共振成像数据对蜘蛛恐惧症暴露疗法结果进行个体水平预测:一项机器学习研究
Depress Anxiety. 2023 Aug 22;2023:8594273. doi: 10.1155/2023/8594273. eCollection 2023.
7
Brain structural correlates of an impending initial major depressive episode.即将发生的首次重度抑郁发作的脑结构关联
Neuropsychopharmacology. 2025 Jun;50(7):1176-1185. doi: 10.1038/s41386-025-02075-6. Epub 2025 Mar 12.
8
Neural representation of consciously seen and unseen information.有意识看到和未看到信息的神经表征。
Sci Rep. 2025 Mar 6;15(1):7888. doi: 10.1038/s41598-025-92490-y.
9
Opportunities and Challenges for Clinical Practice in Detecting Depression Using EEG and Machine Learning.利用脑电图(EEG)和机器学习检测抑郁症在临床实践中的机遇与挑战
Sensors (Basel). 2025 Jan 12;25(2):409. doi: 10.3390/s25020409.
10
Predicting Antidepressant Treatment Response From Cortical Structure on MRI: A Mega-Analysis From the ENIGMA-MDD Working Group.基于MRI皮质结构预测抗抑郁治疗反应:ENIGMA-MDD工作组的一项荟萃分析
Hum Brain Mapp. 2025 Jan;46(1):e70053. doi: 10.1002/hbm.70053.