• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

无意义的比较会导致医学机器学习中出现虚假的乐观情绪。

Meaningless comparisons lead to false optimism in medical machine learning.

作者信息

DeMasi Orianna, Kording Konrad, Recht Benjamin

机构信息

Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, California, United States of America.

Department of Bioengineering, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.

出版信息

PLoS One. 2017 Sep 26;12(9):e0184604. doi: 10.1371/journal.pone.0184604. eCollection 2017.

DOI:10.1371/journal.pone.0184604
PMID:28949964
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5614525/
Abstract

A new trend in medicine is the use of algorithms to analyze big datasets, e.g. using everything your phone measures about you for diagnostics or monitoring. However, these algorithms are commonly compared against weak baselines, which may contribute to excessive optimism. To assess how well an algorithm works, scientists typically ask how well its output correlates with medically assigned scores. Here we perform a meta-analysis to quantify how the literature evaluates their algorithms for monitoring mental wellbeing. We find that the bulk of the literature (∼77%) uses meaningless comparisons that ignore patient baseline state. For example, having an algorithm that uses phone data to diagnose mood disorders would be useful. However, it is possible to explain over 80% of the variance of some mood measures in the population by simply guessing that each patient has their own average mood-the patient-specific baseline. Thus, an algorithm that just predicts that our mood is like it usually is can explain the majority of variance, but is, obviously, entirely useless. Comparing to the wrong (population) baseline has a massive effect on the perceived quality of algorithms and produces baseless optimism in the field. To solve this problem we propose "user lift" that reduces these systematic errors in the evaluation of personalized medical monitoring.

摘要

医学领域的一个新趋势是利用算法来分析大型数据集,例如利用手机测量的关于你的所有数据进行诊断或监测。然而,这些算法通常是与较弱的基线进行比较,这可能会导致过度乐观。为了评估算法的效果如何,科学家们通常会询问其输出与医学指定分数的关联程度。在此,我们进行一项荟萃分析,以量化文献中对监测心理健康算法的评估方式。我们发现,大部分文献(约77%)使用的是无意义的比较,忽略了患者的基线状态。例如,有一个利用手机数据诊断情绪障碍的算法可能会很有用。然而,通过简单猜测每个患者都有自己的平均情绪——即患者特定的基线,就有可能解释人群中某些情绪指标超过80%的方差。因此,一个仅仅预测我们的情绪和平时一样的算法可以解释大部分方差,但显然完全没有用处。与错误的(总体)基线进行比较,会对算法的感知质量产生巨大影响,并在该领域产生毫无根据的乐观情绪。为了解决这个问题,我们提出了“用户提升”方法,以减少个性化医疗监测评估中的这些系统误差。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b116/5614525/55c486961526/pone.0184604.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b116/5614525/bcf4b59c7d07/pone.0184604.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b116/5614525/69e429f7dcc2/pone.0184604.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b116/5614525/69de89465a09/pone.0184604.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b116/5614525/55c486961526/pone.0184604.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b116/5614525/bcf4b59c7d07/pone.0184604.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b116/5614525/69e429f7dcc2/pone.0184604.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b116/5614525/69de89465a09/pone.0184604.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b116/5614525/55c486961526/pone.0184604.g004.jpg

相似文献

1
Meaningless comparisons lead to false optimism in medical machine learning.无意义的比较会导致医学机器学习中出现虚假的乐观情绪。
PLoS One. 2017 Sep 26;12(9):e0184604. doi: 10.1371/journal.pone.0184604. eCollection 2017.
2
Applications of machine learning algorithms to predict therapeutic outcomes in depression: A meta-analysis and systematic review.机器学习算法在预测抑郁症治疗效果中的应用:一项荟萃分析和系统评价。
J Affect Disord. 2018 Dec 1;241:519-532. doi: 10.1016/j.jad.2018.08.073. Epub 2018 Aug 14.
3
Mood Prediction of Patients With Mood Disorders by Machine Learning Using Passive Digital Phenotypes Based on the Circadian Rhythm: Prospective Observational Cohort Study.基于昼夜节律的被动数字表型通过机器学习对心境障碍患者进行心境预测:前瞻性观察队列研究
J Med Internet Res. 2019 Apr 17;21(4):e11029. doi: 10.2196/11029.
4
Overtaking method based on sand-sifter mechanism: Why do optimistic value functions find optimal solutions in multi-armed bandit problems?基于筛沙机制的超越方法:为何乐观值函数能在多臂老虎机问题中找到最优解?
Biosystems. 2015 Sep;135:55-65. doi: 10.1016/j.biosystems.2015.06.009. Epub 2015 Jul 10.
5
An algorithm for direct causal learning of influences on patient outcomes.一种用于直接因果学习对患者预后影响的算法。
Artif Intell Med. 2017 Jan;75:1-15. doi: 10.1016/j.artmed.2016.10.003. Epub 2016 Nov 5.
6
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测:机器学习在 1 型糖尿病中的应用。
Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.
7
Understanding Mood Disorders in Children.了解儿童情绪障碍。
Adv Exp Med Biol. 2019;1192:251-261. doi: 10.1007/978-981-32-9721-0_12.
8
Optimism in Active Learning.主动学习中的乐观主义。
Comput Intell Neurosci. 2015;2015:973696. doi: 10.1155/2015/973696. Epub 2015 Nov 23.
9
Robust differential expression analysis by learning discriminant boundary in multi-dimensional space of statistical attributes.通过在统计属性的多维空间中学习判别边界进行稳健的差异表达分析。
BMC Bioinformatics. 2016 Dec 19;17(1):541. doi: 10.1186/s12859-016-1386-x.
10
A novel technique to evaluate fluctuations of mood: implications for evaluating course and treatment effects in bipolar/affective disorders.一种评估情绪波动的新方法:对双相情感障碍/情感性疾病病程及治疗效果评估的意义
Bipolar Disord. 2006 Oct;8(5 Pt 1):453-66. doi: 10.1111/j.1399-5618.2006.00374.x.

引用本文的文献

1
Estimation of minimal data sets sizes for machine learning predictions in digital mental health interventions.数字心理健康干预中机器学习预测的最小数据集大小估计
NPJ Digit Med. 2024 Dec 18;7(1):361. doi: 10.1038/s41746-024-01360-w.
2
Methodological choices and clinical usefulness for machine learning predictions of outcome in Internet-based cognitive behavioural therapy.基于互联网的认知行为疗法中机器学习对结果预测的方法选择及临床实用性
Commun Med (Lond). 2024 Oct 10;4(1):196. doi: 10.1038/s43856-024-00626-4.
3
Dataset size versus homogeneity: A machine learning study on pooling intervention data in e-mental health dropout predictions.

本文引用的文献

1
Discovering Shifts to Suicidal Ideation from Mental Health Content in Social Media.从社交媒体上的心理健康内容中发现向自杀意念的转变。
Proc SIGCHI Conf Hum Factor Comput Syst. 2016 May;2016:2098-2110. doi: 10.1145/2858036.2858207.
2
Predicting students' happiness from physiology, phone, mobility, and behavioral data.通过生理、手机、移动性和行为数据预测学生的幸福感。
Int Conf Affect Comput Intell Interact Workshops. 2015 Sep;2015:222-228. doi: 10.1109/ACII.2015.7344575. Epub 2015 Dec 7.
3
Predicting suicides after outpatient mental health visits in the Army Study to Assess Risk and Resilience in Servicemembers (Army STARRS).
数据集规模与同质性:一项关于在电子心理健康辍学预测中合并干预数据的机器学习研究。
Digit Health. 2024 May 15;10:20552076241248920. doi: 10.1177/20552076241248920. eCollection 2024 Jan-Dec.
4
REFORMS: Consensus-based Recommendations for Machine-learning-based Science.改革:基于共识的机器学习科学建议。
Sci Adv. 2024 May 3;10(18):eadk3452. doi: 10.1126/sciadv.adk3452. Epub 2024 May 1.
5
Development and validation of a machine learning model using electronic health records to predict trauma- and stressor-related psychiatric disorders after hospitalization with sepsis.利用电子健康记录开发和验证机器学习模型,以预测脓毒症住院后与创伤和应激相关的精神障碍。
Transl Psychiatry. 2023 Dec 18;13(1):400. doi: 10.1038/s41398-023-02699-6.
6
Finding the Best Match - a Case Study on the (Text-)Feature and Model Choice in Digital Mental Health Interventions.寻找最佳匹配——数字心理健康干预中(文本)特征与模型选择的案例研究
J Healthc Inform Res. 2023 Sep 18;7(4):447-479. doi: 10.1007/s41666-023-00148-z. eCollection 2023 Dec.
7
Development and Validation of a Machine Learning Prediction Model of Posttraumatic Stress Disorder After Military Deployment.军事部署后创伤后应激障碍的机器学习预测模型的开发和验证。
JAMA Netw Open. 2023 Jun 1;6(6):e2321273. doi: 10.1001/jamanetworkopen.2023.21273.
8
Predicting therapy outcome in a digital mental health intervention for depression and anxiety: A machine learning approach.预测针对抑郁和焦虑的数字心理健康干预的治疗效果:一种机器学习方法。
Digit Health. 2021 Nov 29;7:20552076211060659. doi: 10.1177/20552076211060659. eCollection 2021 Jan-Dec.
9
From Hume to Wuhan: An Epistemological Journey on the Problem of Induction in COVID-19 Machine Learning Models and its Impact Upon Medical Research.从休谟到武汉:关于新冠机器学习模型中归纳问题及其对医学研究影响的认识论之旅
IEEE Access. 2021 Jul 6;9:97243-97250. doi: 10.1109/ACCESS.2021.3095222. eCollection 2021.
10
Machine learning applications in radiation oncology.机器学习在放射肿瘤学中的应用。
Phys Imaging Radiat Oncol. 2021 Jun 24;19:13-24. doi: 10.1016/j.phro.2021.05.007. eCollection 2021 Jul.
在陆军评估军人风险与恢复力研究(Army STARRS)中预测门诊心理健康就诊后的自杀情况。
Mol Psychiatry. 2017 Apr;22(4):544-551. doi: 10.1038/mp.2016.110. Epub 2016 Jul 19.
4
Mobile Phone-Based Unobtrusive Ecological Momentary Assessment of Day-to-Day Mood: An Explorative Study.基于手机的日常情绪非侵入性生态瞬时评估:一项探索性研究。
J Med Internet Res. 2016 Mar 29;18(3):e72. doi: 10.2196/jmir.5505.
5
Machine Learning and the Profession of Medicine.机器学习与医学职业。
JAMA. 2016 Feb 9;315(6):551-2. doi: 10.1001/jama.2015.18421.
6
Modeling perceived stress via HRV and accelerometer sensor streams.通过心率变异性(HRV)和加速度计传感器数据流对感知压力进行建模。
Annu Int Conf IEEE Eng Med Biol Soc. 2015 Aug;2015:1625-8. doi: 10.1109/EMBC.2015.7318686.
7
The application of mHealth to mental health: opportunities and challenges.移动健康在心理健康领域的应用:机遇与挑战。
Lancet Psychiatry. 2015 Oct;2(10):942-8. doi: 10.1016/S2215-0366(15)00268-0. Epub 2015 Sep 29.
8
Mobile Phone Sensor Correlates of Depressive Symptom Severity in Daily-Life Behavior: An Exploratory Study.日常生活行为中手机传感器与抑郁症状严重程度的相关性:一项探索性研究。
J Med Internet Res. 2015 Jul 15;17(7):e175. doi: 10.2196/jmir.4273.
9
Mobile mental health: a challenging research agenda.移动心理健康:一项具有挑战性的研究议程。
Eur J Psychotraumatol. 2015 May 19;6:27882. doi: 10.3402/ejpt.v6.27882. eCollection 2015.
10
Next-generation psychiatric assessment: Using smartphone sensors to monitor behavior and mental health.下一代精神病学评估:利用智能手机传感器监测行为和心理健康。
Psychiatr Rehabil J. 2015 Sep;38(3):218-226. doi: 10.1037/prj0000130. Epub 2015 Apr 6.