• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习预测千禧年队列研究中参与者对后续健康调查的反应。

Utilizing machine learning to predict participant response to follow-up health surveys in the Millennium Cohort Study.

机构信息

Deployment Health Research Department, Naval Health Research Center, San Diego, CA, USA.

Leidos, Inc, San Diego, CA, USA.

出版信息

Sci Rep. 2024 Oct 28;14(1):25764. doi: 10.1038/s41598-024-77563-8.

DOI:10.1038/s41598-024-77563-8
PMID:39468293
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11519444/
Abstract

The Millennium Cohort Study is a longitudinal study which collects self-reported data from surveys to examine the long-term effects of military service. Participant nonresponse to follow-up surveys presents a potential threat to the validity and generalizability of study findings. In recent years, predictive analytics has emerged as a promising tool to identify predictors of nonresponse. Here, we develop a high-skill classifier using machine learning techniques to predict participant response to follow-up surveys of the Millennium Cohort Study. Six supervised algorithms were employed to predict response to the 2021 follow-up survey. Using latent class analysis (LCA), we classified participants based on historical survey response and compared prediction performance with and without this variable. Feature analysis was subsequently conducted on the best-performing model. Including the LCA variable in the machine learning analysis, all six algorithms performed comparably. Without the LCA variable, random forest outperformed the benchmark regression model, however overall prediction performance decreased. Feature analysis showed the LCA variable as the most important predictor. Our findings highlight the importance of historical response to improve prediction performance of participant response to follow-up surveys. Machine learning algorithms can be especially valuable when historical data are not available. Implementing these methods in longitudinal studies can enhance outreach efforts by strategically targeting participants, ultimately boosting survey response rates and mitigating nonresponse.

摘要

千禧年队列研究是一项纵向研究,通过调查收集自我报告数据,以研究兵役的长期影响。参与者对后续调查的无回应可能对研究结果的有效性和普遍性构成威胁。近年来,预测分析已成为识别无回应预测因素的有前途的工具。在这里,我们使用机器学习技术开发了一种高精度分类器,以预测千禧年队列研究参与者对后续调查的回应。使用了六种有监督算法来预测对 2021 年后续调查的回应。我们使用潜在类别分析(LCA)根据历史调查回应对参与者进行分类,并比较了有和没有此变量的预测性能。随后对表现最佳的模型进行了特征分析。在机器学习分析中包含 LCA 变量时,所有六种算法的表现相当。没有 LCA 变量时,随机森林的表现优于基准回归模型,但整体预测性能下降。特征分析表明 LCA 变量是最重要的预测因素。我们的研究结果强调了历史回应对于改善参与者对后续调查回应的预测性能的重要性。当没有历史数据时,机器学习算法尤其有价值。在纵向研究中实施这些方法可以通过有策略地针对参与者来增强外展工作,最终提高调查回应率并减轻无回应的影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da8a/11519444/a6182c765353/41598_2024_77563_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da8a/11519444/36c4358de5dd/41598_2024_77563_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da8a/11519444/a7338fe80965/41598_2024_77563_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da8a/11519444/a6182c765353/41598_2024_77563_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da8a/11519444/36c4358de5dd/41598_2024_77563_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da8a/11519444/a7338fe80965/41598_2024_77563_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da8a/11519444/a6182c765353/41598_2024_77563_Fig3_HTML.jpg

相似文献

1
Utilizing machine learning to predict participant response to follow-up health surveys in the Millennium Cohort Study.利用机器学习预测千禧年队列研究中参与者对后续健康调查的反应。
Sci Rep. 2024 Oct 28;14(1):25764. doi: 10.1038/s41598-024-77563-8.
2
Survey response over 15 years of follow-up in the Millennium Cohort Study.在千禧队列研究中进行了超过 15 年的随访后的调查反馈。
BMC Med Res Methodol. 2023 Sep 9;23(1):205. doi: 10.1186/s12874-023-02018-z.
3
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
4
Determinants of Visual Impairment Among Chinese Middle-Aged and Older Adults: Risk Prediction Model Using Machine Learning Algorithms.中国中老年人群视力障碍的决定因素:基于机器学习算法的风险预测模型。
JMIR Aging. 2024 Oct 9;7:e59810. doi: 10.2196/59810.
5
Predicting diabetes in adults: identifying important features in unbalanced data over a 5-year cohort study using machine learning algorithm.预测成年人糖尿病:使用机器学习算法在 5 年队列研究中识别不平衡数据中的重要特征。
BMC Med Res Methodol. 2024 Sep 27;24(1):220. doi: 10.1186/s12874-024-02341-z.
6
Predicting hospitalization following psychiatric crisis care using machine learning.运用机器学习预测精神科危机护理后的住院情况。
BMC Med Inform Decis Mak. 2020 Dec 10;20(1):332. doi: 10.1186/s12911-020-01361-1.
7
Assessing nonresponse bias at follow-up in a large prospective cohort of relatively young and mobile military service members.评估大型前瞻性队列中相对年轻且流动性较强的现役军人随访时的无应答偏倚。
BMC Med Res Methodol. 2010 Oct 21;10:99. doi: 10.1186/1471-2288-10-99.
8
Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.用于预测埃塞俄比亚 COVID-19 死亡率的机器学习算法。
BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0.
9
Predictive etiological classification of acute ischemic stroke through interpretable machine learning algorithms: a multicenter, prospective cohort study.通过可解释的机器学习算法对急性缺血性脑卒中进行预测病因分类:一项多中心前瞻性队列研究。
BMC Med Res Methodol. 2024 Sep 10;24(1):199. doi: 10.1186/s12874-024-02331-1.
10
Can Machine-learning Algorithms Predict Early Revision TKA in the Danish Knee Arthroplasty Registry?机器学习算法能否预测丹麦膝关节置换登记处的早期翻修 TKA?
Clin Orthop Relat Res. 2020 Sep;478(9):2088-2101. doi: 10.1097/CORR.0000000000001343.

本文引用的文献

1
Part I: A friendly introduction to latent class analysis.第一部分:潜类分析简介。
J Clin Epidemiol. 2022 Jul;147:168-170. doi: 10.1016/j.jclinepi.2022.05.008. Epub 2022 May 27.
2
The Millennium Cohort Study: The first 20 years of research dedicated to understanding the long-term health of US Service Members and Veterans.千禧队列研究:致力于了解美国军人和退伍军人长期健康状况的首个20年研究。
Ann Epidemiol. 2022 Mar;67:61-72. doi: 10.1016/j.annepidem.2021.12.002. Epub 2021 Dec 11.
3
Tree-based Machine Learning Methods for Survey Research.
用于调查研究的基于树的机器学习方法。
Surv Res Methods. 2019 Apr 11;13(1):73-93.
4
Random forests for high-dimensional longitudinal data.随机森林在高维纵向数据中的应用。
Stat Methods Med Res. 2021 Jan;30(1):166-184. doi: 10.1177/0962280220946080. Epub 2020 Aug 9.
5
Using Marketing Automation to Modernize Data Collection in the California Teachers Study Cohort.利用营销自动化实现加利福尼亚教师研究队列的数据收集现代化。
Cancer Epidemiol Biomarkers Prev. 2020 Apr;29(4):714-723. doi: 10.1158/1055-9965.EPI-19-0841. Epub 2020 Feb 13.
6
Retention strategies in longitudinal cohort studies: a systematic review and meta-analysis.纵向队列研究中的保留策略:系统评价和荟萃分析。
BMC Med Res Methodol. 2018 Nov 26;18(1):151. doi: 10.1186/s12874-018-0586-7.
7
Statistical notes for clinical researchers: Chi-squared test and Fisher's exact test.临床研究人员的统计学笔记:卡方检验与费舍尔精确检验。
Restor Dent Endod. 2017 May;42(2):152-155. doi: 10.5395/rde.2017.42.2.152. Epub 2017 Mar 30.
8
Longitudinal studies.纵向研究。
J Thorac Dis. 2015 Nov;7(11):E537-40. doi: 10.3978/j.issn.2072-1439.2015.10.63.
9
Using a birth cohort to study ageing: representativeness and response rates in the National Survey of Health and Development.利用出生队列研究衰老:英国国家健康与发展调查中的代表性及应答率
Eur J Ageing. 2013 Jun;10(2):145-157. doi: 10.1007/s10433-013-0258-8.
10
Assessing nonresponse bias at follow-up in a large prospective cohort of relatively young and mobile military service members.评估大型前瞻性队列中相对年轻且流动性较强的现役军人随访时的无应答偏倚。
BMC Med Res Methodol. 2010 Oct 21;10:99. doi: 10.1186/1471-2288-10-99.