• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

表 2 描述性流行病学中的谬误:将机器学习引入其中。

Table 2 Fallacy in Descriptive Epidemiology: Bringing Machine Learning to the Table.

机构信息

Dalla Lana School of Public Health, University of Toronto, Toronton, ON M5T 3M7, Canada.

Center for Addictions and Mental Health, Toronto, ON M6J 1H4, Canada.

出版信息

Int J Environ Res Public Health. 2023 Jun 21;20(13):6194. doi: 10.3390/ijerph20136194.

DOI:10.3390/ijerph20136194
PMID:37444042
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10340623/
Abstract

There is a lack of rigorous methodological development for descriptive epidemiology, where the goal is to describe and identify the most important associations with an outcome given a large set of potential predictors. This has often led to the Table 2 fallacy, where one presents the coefficient estimates for all covariates from a single multivariable regression model, which are often uninterpretable in a descriptive analysis. We argue that machine learning (ML) is a potential solution to this problem. We illustrate the power of ML with an example analysis identifying the most important predictors of alcohol abuse among sexual minority youth. The framework we propose for this analysis is as follows: (1) Identify a few ML methods for the analysis, (2) optimize the parameters using the whole data with a nested cross-validation approach, (3) rank the variables using variable importance scores, (4) present partial dependence plots (PDP) to illustrate the association between the important variables and the outcome, (5) and identify the strength of the interaction terms using the PDPs. We discuss the potential strengths and weaknesses of using ML methods for descriptive analysis and future directions for research. R codes to reproduce these analyses are provided, which we invite other researchers to use.

摘要

描述性流行病学方法的发展缺乏严谨性,其目标是在给定大量潜在预测因子的情况下,描述和识别与结果最相关的因素。这往往导致了表 2 谬误,即从单变量回归模型中呈现所有协变量的系数估计值,而这些估计值在描述性分析中通常是不可解释的。我们认为,机器学习(ML)是解决这个问题的一种潜在方法。我们通过一个分析性示例来说明 ML 的强大功能,该示例确定了性少数青年酗酒的最重要预测因素。我们为这种分析提出的框架如下:(1)确定几种用于分析的 ML 方法,(2)使用嵌套交叉验证方法优化整个数据的参数,(3)使用变量重要性得分对变量进行排名,(4)展示部分依赖图(PDP)以说明重要变量与结果之间的关联,(5)并使用 PDP 来识别交互项的强度。我们讨论了使用 ML 方法进行描述性分析的潜在优势和劣势,以及未来的研究方向。我们提供了可重现这些分析的 R 代码,我们邀请其他研究人员使用这些代码。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b44/10340623/462ee287e2c2/ijerph-20-06194-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b44/10340623/540d4e0b2494/ijerph-20-06194-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b44/10340623/9429771219e2/ijerph-20-06194-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b44/10340623/462ee287e2c2/ijerph-20-06194-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b44/10340623/540d4e0b2494/ijerph-20-06194-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b44/10340623/9429771219e2/ijerph-20-06194-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b44/10340623/462ee287e2c2/ijerph-20-06194-g003.jpg

相似文献

1
Table 2 Fallacy in Descriptive Epidemiology: Bringing Machine Learning to the Table.表 2 描述性流行病学中的谬误:将机器学习引入其中。
Int J Environ Res Public Health. 2023 Jun 21;20(13):6194. doi: 10.3390/ijerph20136194.
2
Using Machine Learning to Predict Young People's Internet Health and Social Service Information Seeking.使用机器学习预测年轻人的互联网健康和社会服务信息寻求。
Prev Sci. 2021 Nov;22(8):1173-1184. doi: 10.1007/s11121-021-01255-2. Epub 2021 May 11.
3
Parent-child Relationships and Sexual Minority Youth: Implications for Adult Alcohol Abuse.亲子关系与性少数青年:对成年酒精滥用的影响。
J Youth Adolesc. 2020 Oct;49(10):2034-2046. doi: 10.1007/s10964-020-01299-7. Epub 2020 Aug 9.
4
Applied machine learning to identify differential risk groups underlying externalizing and internalizing problem behaviors trajectories: A case study using a cohort of Asian American children.应用机器学习识别外化和内化问题行为轨迹背后的差异风险群体:一项以亚裔美国儿童队列为例的研究
PLoS One. 2023 Mar 3;18(3):e0282235. doi: 10.1371/journal.pone.0282235. eCollection 2023.
5
Serious and persistent suicidality among European sexual minority youth.欧洲性少数青年群体中的严重和持续自杀倾向。
PLoS One. 2020 Oct 16;15(10):e0240840. doi: 10.1371/journal.pone.0240840. eCollection 2020.
6
Sexual and Gender Minority Youth Alcohol Use: Within-Group Differences in Associations with Internalized Stigma and Victimization.性少数和跨性别青年的饮酒问题:内在污名和受害经历与内部群体差异的关联。
J Youth Adolesc. 2019 Dec;48(12):2403-2417. doi: 10.1007/s10964-019-01130-y. Epub 2019 Oct 11.
7
Sexual Minority Disparities in Substance Use Willingness Among Youth.青少年中物质使用意愿方面的性少数群体差异
Subst Use Misuse. 2018 Jan 2;53(1):170-175. doi: 10.1080/10826084.2017.1327977. Epub 2017 Aug 4.
8
Associations Between Latent Classes of Trauma Exposure and Minority Stressors and Substance Use Among Cisgender Sexual Minority Women.跨性别性少数群体女性的创伤暴露和少数群体应激源与物质使用之间的关联。
J Interpers Violence. 2023 Jul;38(13-14):8286-8315. doi: 10.1177/08862605231153886. Epub 2023 Feb 26.
9
Sexual Minority Youth at Risk of Early and Persistent Alcohol, Tobacco, and Marijuana Use.性少数青年有早发和持续使用酒精、烟草和大麻的风险。
Arch Sex Behav. 2019 May;48(4):1073-1086. doi: 10.1007/s10508-018-1275-7. Epub 2019 Jan 2.
10
Uncovering the Most Important Factors for Predicting Sexual Desire Using Explainable Machine Learning.利用可解释机器学习揭示预测性欲的最重要因素
J Sex Med. 2021 Jul;18(7):1198-1216. doi: 10.1016/j.jsxm.2021.04.010. Epub 2021 Jun 25.

引用本文的文献

1
Comparison of model feature importance statistics to identify covariates that contribute most to model accuracy in prediction of insomnia.比较模型特征重要性统计数据,以确定对预测失眠模型准确性贡献最大的协变量。
PLoS One. 2024 Jul 2;19(7):e0306359. doi: 10.1371/journal.pone.0306359. eCollection 2024.

本文引用的文献

1
Describing a complex primary health care population to support future decision support initiatives.描述一个复杂的初级医疗保健人群,以支持未来的决策支持计划。
Int J Popul Data Sci. 2022 Oct 24;7(1):1756. doi: 10.23889/ijpds.v7i1.1756. eCollection 2022.
2
A Machine Learning Approach Reveals Distinct Predictors of Vaping Dependence for Adolescent Daily and Non-Daily Vapers in the COVID-19 Era.一种机器学习方法揭示了 COVID-19 时代青少年日常和非日常电子烟使用者电子烟依赖的不同预测因素。
Healthcare (Basel). 2023 May 18;11(10):1465. doi: 10.3390/healthcare11101465.
3
Development and evaluation of a risk algorithm predicting alcohol dependence after early onset of regular alcohol use.
早期规律饮酒后预测酒精依赖风险算法的开发与评估。
Addiction. 2023 May;118(5):954-966. doi: 10.1111/add.16122. Epub 2023 Jan 31.
4
Correlates of wanting to seek help for mental health and substance use concerns by sexual and gender minority young adults during the COVID-19 pandemic: A machine learning analysis.性少数和性别少数青年在 COVID-19 大流行期间寻求心理健康和物质使用问题帮助的意愿相关因素:机器学习分析。
PLoS One. 2022 Nov 16;17(11):e0277438. doi: 10.1371/journal.pone.0277438. eCollection 2022.
5
A Framework for Descriptive Epidemiology.描述性流行病学框架。
Am J Epidemiol. 2022 Nov 19;191(12):2063-2070. doi: 10.1093/aje/kwac115.
6
Correlates of past year suicidal thoughts among sexual and gender minority young adults: A machine learning analysis.性少数和性别少数青年过去一年自杀想法的相关因素:一项机器学习分析。
J Psychiatr Res. 2022 Aug;152:269-277. doi: 10.1016/j.jpsychires.2022.06.013. Epub 2022 Jun 12.
7
The relationship between sexual and gender stigma and suicide attempt and ideation among LGBTQI + populations in Thailand: findings from a national survey.泰国 LGBTQI+ 群体中的性与性别污名与自杀企图和意念之间的关系:一项全国性调查的结果。
Soc Psychiatry Psychiatr Epidemiol. 2022 Oct;57(10):1987-1997. doi: 10.1007/s00127-022-02292-0. Epub 2022 May 23.
8
LGBTQ Identity-Related Victimization During COVID-19 Is Associated with Moderate to Severe Psychological Distress Among Young Adults.在 COVID-19 期间,与 LGBTQ 身份相关的受害与年轻人的中重度心理困扰相关。
LGBT Health. 2022 Jul;9(5):303-312. doi: 10.1089/lgbt.2021.0280. Epub 2022 May 10.
9
On the Need to Revitalize Descriptive Epidemiology.有必要复兴描述性流行病学。
Am J Epidemiol. 2022 Jun 27;191(7):1174-1179. doi: 10.1093/aje/kwac056.
10
Describing Intersectional Health Outcomes: An Evaluation of Data Analysis Methods.描述交叉健康结局:数据分析方法的评估。
Epidemiology. 2022 May 1;33(3):395-405. doi: 10.1097/EDE.0000000000001466.