• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于调查研究的基于树的机器学习方法。

Tree-based Machine Learning Methods for Survey Research.

作者信息

Kern Christoph, Klausch Thomas, Kreuter Frauke

机构信息

University of Mannheim.

VU University Medical Center.

出版信息

Surv Res Methods. 2019 Apr 11;13(1):73-93.

PMID:32802211
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7425836/
Abstract

Predictive modeling methods from the field of machine learning have become a popular tool across various disciplines for exploring and analyzing diverse data. These methods often do not require specific prior knowledge about the functional form of the relationship under study and are able to adapt to complex non-linear and non-additive interrelations between the outcome and its predictors while focusing specifically on prediction performance. This modeling perspective is beginning to be adopted by survey researchers in order to adjust or improve various aspects of data collection and/or survey management. To facilitate this strand of research, this paper (1) provides an introduction to prominent tree-based machine learning methods, (2) reviews and discusses previous and (potential) prospective applications of tree-based supervised learning in survey research, and (3) exemplifies the usage of these techniques in the context of modeling and predicting nonresponse in panel surveys.

摘要

机器学习领域的预测建模方法已成为各学科中用于探索和分析各类数据的常用工具。这些方法通常不需要关于所研究关系的函数形式的特定先验知识,并且能够适应结果及其预测变量之间复杂的非线性和非加性相互关系,同时特别关注预测性能。这种建模视角正开始被调查研究人员采用,以调整或改进数据收集和/或调查管理的各个方面。为推动这一研究方向,本文(1)介绍了著名的基于树的机器学习方法,(2)回顾并讨论了基于树的监督学习在调查研究中的先前及(潜在)未来应用,(3)举例说明了这些技术在面板调查中对无回答进行建模和预测的背景下的用法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4166/7425836/ce6fe185ed1f/nihms-1594055-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4166/7425836/d69344024489/nihms-1594055-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4166/7425836/eb873c4567da/nihms-1594055-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4166/7425836/91fe0b1304f1/nihms-1594055-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4166/7425836/f2e7766fc535/nihms-1594055-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4166/7425836/ce6fe185ed1f/nihms-1594055-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4166/7425836/d69344024489/nihms-1594055-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4166/7425836/eb873c4567da/nihms-1594055-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4166/7425836/91fe0b1304f1/nihms-1594055-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4166/7425836/f2e7766fc535/nihms-1594055-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4166/7425836/ce6fe185ed1f/nihms-1594055-f0005.jpg

相似文献

1
Tree-based Machine Learning Methods for Survey Research.用于调查研究的基于树的机器学习方法。
Surv Res Methods. 2019 Apr 11;13(1):73-93.
2
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测:机器学习在 1 型糖尿病中的应用。
Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.
3
The power of online panel paradata to predict unit nonresponse and voluntary attrition in a longitudinal design.在线面板辅助数据在纵向设计中预测单位无应答和自愿退出的能力。
Qual Quant. 2023;57(2):1055-1078. doi: 10.1007/s11135-022-01385-x. Epub 2022 Apr 25.
4
Machine learning applications in cancer prognosis and prediction.机器学习在癌症预后和预测中的应用。
Comput Struct Biotechnol J. 2014 Nov 15;13:8-17. doi: 10.1016/j.csbj.2014.11.005. eCollection 2015.
5
A machine learning-based approach to prognostic analysis of thoracic transplantations.基于机器学习的方法对胸移植进行预后分析。
Artif Intell Med. 2010 May;49(1):33-42. doi: 10.1016/j.artmed.2010.01.002. Epub 2010 Feb 13.
6
Machine Learning-Based Prediction Models for Depression Symptoms Among Chinese Healthcare Workers During the Early COVID-19 Outbreak in 2020: A Cross-Sectional Study.2020年新冠疫情早期中国医护人员抑郁症状的机器学习预测模型:一项横断面研究
Front Psychiatry. 2022 Apr 29;13:876995. doi: 10.3389/fpsyt.2022.876995. eCollection 2022.
7
Potential benefits and limitations of machine learning in the field of eating disorders: current research and future directions.机器学习在饮食失调领域的潜在益处与局限性:当前研究与未来方向
J Eat Disord. 2022 May 8;10(1):66. doi: 10.1186/s40337-022-00581-2.
8
Reporting of Model Performance and Statistical Methods in Studies That Use Machine Learning to Develop Clinical Prediction Models: Protocol for a Systematic Review.使用机器学习开发临床预测模型的研究中模型性能和统计方法的报告:系统评价方案
JMIR Res Protoc. 2022 Mar 3;11(3):e30956. doi: 10.2196/30956.
9
A data-driven approach to predicting diabetes and cardiovascular disease with machine learning.基于机器学习的数据驱动方法预测糖尿病和心血管疾病。
BMC Med Inform Decis Mak. 2019 Nov 6;19(1):211. doi: 10.1186/s12911-019-0918-5.
10
Predicting vaginal birth after previous cesarean: Using machine-learning models and a population-based cohort in Sweden.预测既往剖宫产术后的阴道分娩:使用机器学习模型和瑞典的基于人群队列。
Acta Obstet Gynecol Scand. 2021 Mar;100(3):513-520. doi: 10.1111/aogs.14020. Epub 2020 Oct 31.

引用本文的文献

1
Effects of survey administration mode on response profiles are predictable, and robust across countries: Evidence from 29 countries using machine-learning models.调查管理模式对回答情况的影响是可预测的,并且在各国都具有稳健性:来自29个国家的机器学习模型证据。
PLoS One. 2025 Sep 12;20(9):e0330182. doi: 10.1371/journal.pone.0330182. eCollection 2025.
2
Decoding vital variables in predicting different phases of suicide among young adults with childhood sexual abuse: a machine learning approach.解码童年期性虐待的年轻成年人自杀不同阶段预测中的关键变量:一种机器学习方法。
Transl Psychiatry. 2025 Apr 24;15(1):158. doi: 10.1038/s41398-025-03360-0.
3

本文引用的文献

1
Comments on Fifty Years of Classification and Regression Trees.关于分类与回归树五十年的评论
Int Stat Rev. 2014 Dec 1;82(3):359-361. doi: 10.1111/insr.12060.
2
Random forest methodology for model-based recursive partitioning: the mobForest package for R.基于模型的递归分割的随机森林方法:R 中的 mobForest 包。
BMC Bioinformatics. 2013 Apr 11;14:125. doi: 10.1186/1471-2105-14-125.
3
An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests.
Latent Variable Forests for Latent Variable Score Estimation.
用于潜在变量得分估计的潜在变量森林
Educ Psychol Meas. 2024 Dec;84(6):1138-1172. doi: 10.1177/00131644241237502. Epub 2024 Apr 1.
4
Utilizing machine learning to predict participant response to follow-up health surveys in the Millennium Cohort Study.利用机器学习预测千禧年队列研究中参与者对后续健康调查的反应。
Sci Rep. 2024 Oct 28;14(1):25764. doi: 10.1038/s41598-024-77563-8.
5
Developing an Early Warning System for Financial Networks: An Explainable Machine Learning Approach.开发金融网络预警系统:一种可解释的机器学习方法。
Entropy (Basel). 2024 Sep 17;26(9):796. doi: 10.3390/e26090796.
6
Letter to the editor for the article "A machine learning approach using stone volume to predict stone-free status at ureteroscopy".致编辑的信:关于《一种使用结石体积预测输尿管镜检查无结石状态的机器学习方法》一文
World J Urol. 2024 Aug 22;42(1):492. doi: 10.1007/s00345-024-05198-5.
7
Detecting Differential Item Functioning in Multidimensional Graded Response Models With Recursive Partitioning.使用递归划分法在多维等级反应模型中检测项目差异功能
Appl Psychol Meas. 2024 May;48(3):83-103. doi: 10.1177/01466216241238743. Epub 2024 Mar 13.
8
Leveraging multivariate approaches to advance the science of early-life adversity.利用多变量方法推动早期生活逆境科学的发展。
Child Abuse Negl. 2024 Mar 22:106754. doi: 10.1016/j.chiabu.2024.106754.
9
Calibration and XGBoost reweighting to reduce coverage and non-response biases in overlapping panel surveys: application to the Healthcare and Social Survey.校准和 XGBoost 再加权以减少重叠面板调查中的覆盖和非响应偏差:在医疗保健和社会调查中的应用。
BMC Med Res Methodol. 2024 Feb 15;24(1):36. doi: 10.1186/s12874-024-02171-z.
10
Which client with generalized anxiety disorder benefits from a mindfulness ecological momentary intervention versus a self-monitoring app? Developing a multivariable machine learning predictive model.哪种广泛性焦虑障碍患者从正念生态瞬间干预中获益,而不是自我监测应用程序?开发多变量机器学习预测模型。
J Anxiety Disord. 2024 Mar;102:102825. doi: 10.1016/j.janxdis.2024.102825. Epub 2024 Jan 5.
递归分区介绍:分类和回归树、装袋和随机森林的原理、应用和特点。
Psychol Methods. 2009 Dec;14(4):323-48. doi: 10.1037/a0016973.
4
Bias in random forest variable importance measures: illustrations, sources and a solution.随机森林变量重要性度量中的偏差:示例、来源及解决方案
BMC Bioinformatics. 2007 Jan 25;8:25. doi: 10.1186/1471-2105-8-25.