• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

不仅仅是“大”数据:样本量、测量误差和无信息预测变量对数字干预预后模型开发的重要性。

Not just "big" data: Importance of sample size, measurement error, and uninformative predictors for developing prognostic models for digital interventions.

作者信息

McNamara Mary E, Zisser Mackenzie, Beevers Christopher G, Shumake Jason

机构信息

Department of Psychology and Institute for Mental Health Research, University of Texas at Austin, USA.

Department of Psychology and Institute for Mental Health Research, University of Texas at Austin, USA.

出版信息

Behav Res Ther. 2022 Jun;153:104086. doi: 10.1016/j.brat.2022.104086. Epub 2022 Apr 14.

DOI:10.1016/j.brat.2022.104086
PMID:35462242
Abstract

There is strong interest in developing a more efficient mental health care system. Digital interventions and predictive models of treatment prognosis will likely play an important role in this endeavor. This article reviews the application of popular machine learning models to the prediction of treatment prognosis, with a particular focus on digital interventions. Assuming that the prediction of treatment prognosis will involve modeling a complex combination of interacting features with measurement error in both the predictors and outcomes, our simulations suggest that to optimize complex prediction models, sample sizes in the thousands will be required. Machine learning methods capable of discovering complex interactions and nonlinear effects (e.g., decision tree ensembles such as gradient boosted machines) perform particularly well in large samples when the predictors and outcomes have virtually no measurement error. However, in the presence of moderate measurement error, these methods provide little or no benefit over regularized linear regression, even with very large sample sizes (N = 100,000) and a non-linear ground truth. Given these sample size requirements, we argue that the scalability of digital interventions, especially when used in combination with optimal measurement practices, provides one of the most effective ways to study treatment prediction models. We conclude with suggestions about how to implement these algorithms into clinical practice.

摘要

人们对开发更高效的精神卫生保健系统有着浓厚兴趣。数字干预和治疗预后预测模型可能会在这一努力中发挥重要作用。本文回顾了流行的机器学习模型在治疗预后预测中的应用,特别关注数字干预。假设治疗预后的预测将涉及对具有测量误差的预测变量和结果中相互作用特征的复杂组合进行建模,我们的模拟表明,为了优化复杂的预测模型,将需要数千个样本量。当预测变量和结果几乎没有测量误差时,能够发现复杂相互作用和非线性效应的机器学习方法(例如梯度提升机等决策树集成方法)在大样本中表现尤其出色。然而,在存在中等测量误差的情况下,即使样本量非常大(N = 100,000)且真实情况为非线性,这些方法相对于正则化线性回归也几乎没有或没有优势。鉴于这些样本量要求,我们认为数字干预的可扩展性,特别是与最佳测量实践结合使用时,提供了研究治疗预测模型最有效的方法之一。我们最后就如何将这些算法应用于临床实践提出了建议。

相似文献

1
Not just "big" data: Importance of sample size, measurement error, and uninformative predictors for developing prognostic models for digital interventions.不仅仅是“大”数据:样本量、测量误差和无信息预测变量对数字干预预后模型开发的重要性。
Behav Res Ther. 2022 Jun;153:104086. doi: 10.1016/j.brat.2022.104086. Epub 2022 Apr 14.
2
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
3
Machine Learning in Aging: An Example of Developing Prediction Models for Serious Fall Injury in Older Adults.机器学习与衰老:以老年人严重跌倒伤害预测模型的开发为例。
J Gerontol A Biol Sci Med Sci. 2021 Mar 31;76(4):647-654. doi: 10.1093/gerona/glaa138.
4
Prediction of 30-Day All-Cause Readmissions in Patients Hospitalized for Heart Failure: Comparison of Machine Learning and Other Statistical Approaches.预测因心力衰竭住院患者的 30 天全因再入院率:机器学习与其他统计学方法的比较。
JAMA Cardiol. 2017 Feb 1;2(2):204-209. doi: 10.1001/jamacardio.2016.3956.
5
Machine Learning-Based Prediction of Clinical Outcomes for Children During Emergency Department Triage.基于机器学习的急诊科分诊中儿童临床结局预测。
JAMA Netw Open. 2019 Jan 4;2(1):e186937. doi: 10.1001/jamanetworkopen.2018.6937.
6
Machine Learning and Psychological Research: The Unexplored Effect of Measurement.机器学习与心理研究:测量的未知效应
Perspect Psychol Sci. 2020 May;15(3):809-816. doi: 10.1177/1745691620902467. Epub 2020 Apr 29.
7
Implementing machine learning methods with complex survey data: Lessons learned on the impacts of accounting sampling weights in gradient boosting.运用复杂调查数据实施机器学习方法:梯度提升中会计抽样权重影响的经验教训。
PLoS One. 2023 Jan 13;18(1):e0280387. doi: 10.1371/journal.pone.0280387. eCollection 2023.
8
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测:机器学习在 1 型糖尿病中的应用。
Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.
9
The effect of machine learning regression algorithms and sample size on individualized behavioral prediction with functional connectivity features.机器学习回归算法和样本量对功能连接特征的个体化行为预测的影响。
Neuroimage. 2018 Sep;178:622-637. doi: 10.1016/j.neuroimage.2018.06.001. Epub 2018 Jun 2.
10
A comparative evaluation of the generalised predictive ability of eight machine learning algorithms across ten clinical metabolomics data sets for binary classification.八种机器学习算法在十个临床代谢组学数据集上进行二进制分类的广义预测能力的比较评估。
Metabolomics. 2019 Nov 15;15(12):150. doi: 10.1007/s11306-019-1612-4.

引用本文的文献

1
Prediction of treatment outcome in patients receiving internet-delivered cognitive behavioural therapy for depressive and anxiety symptoms: a machine learning analysis of data from a healthcare-embedded longitudinal study.接受互联网认知行为疗法治疗抑郁和焦虑症状患者的治疗结果预测:一项基于医疗保健嵌入式纵向研究数据的机器学习分析
BMJ Open. 2025 Sep 3;15(9):e097651. doi: 10.1136/bmjopen-2024-097651.
2
Utilizing Predictive Analytics to Understand Neurogenic Bladder Symptom Score (NBSS) Variations in Adults With Acquired Spinal Cord Injury.利用预测分析了解成人获得性脊髓损伤患者的神经源性膀胱症状评分(NBSS)变化
Neurourol Urodyn. 2025 Sep;44(7):1466-1473. doi: 10.1002/nau.70116. Epub 2025 Jul 9.
3
Development and validation of a clinical prognosis prediction model for malignant intestinal obstruction: A retrospective cohort study.
恶性肠梗阻临床预后预测模型的开发与验证:一项回顾性队列研究。
Sci Rep. 2025 Apr 4;15(1):11550. doi: 10.1038/s41598-025-96593-4.
4
Prediction of individual patient outcomes to psychotherapy vs medication for major depression.重度抑郁症患者接受心理治疗与药物治疗的个体预后预测。
Npj Ment Health Res. 2025 Feb 5;4(1):4. doi: 10.1038/s44184-025-00119-9.
5
How measurement noise limits the accuracy of brain-behaviour predictions.测量噪声如何限制脑行为预测的准确性。
Nat Commun. 2024 Dec 12;15(1):10678. doi: 10.1038/s41467-024-54022-6.
6
The Relation Between Passively Collected GPS Mobility Metrics and Depressive Symptoms: Systematic Review and Meta-Analysis.被动收集的 GPS 移动性指标与抑郁症状的关系:系统评价和荟萃分析。
J Med Internet Res. 2024 Nov 1;26:e51875. doi: 10.2196/51875.
7
Complex Emotion Dynamics Contribute to the Prediction of Depression: A Machine Learning and Time Series Feature Extraction Approach.复杂情绪动态有助于预测抑郁症:一种机器学习和时间序列特征提取方法。
Affect Sci. 2024 Aug 3;5(3):259-272. doi: 10.1007/s42761-024-00249-x. eCollection 2024 Sep.
8
Enhancing EEG-based attachment style prediction: unveiling the impact of feature domains.增强基于脑电图的依恋风格预测:揭示特征域的影响。
Front Psychol. 2024 Jan 22;15:1326791. doi: 10.3389/fpsyg.2024.1326791. eCollection 2024.
9
A prognostic index to predict symptom and functional outcomes of a coached, web-based intervention for trauma-exposed veterans.一种预测针对创伤暴露退伍军人的基于网络的辅导干预的症状和功能结果的预后指数。
Psychol Serv. 2024 Nov;21(4):849-858. doi: 10.1037/ser0000828. Epub 2023 Dec 21.
10
Optimizing the predictive power of depression screenings using machine learning.利用机器学习优化抑郁症筛查的预测能力。
Digit Health. 2023 Aug 29;9:20552076231194939. doi: 10.1177/20552076231194939. eCollection 2023 Jan-Dec.