• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过自举聚合进行有效的基因-环境交互作用检验。

Efficient gene-environment interaction testing through bootstrap aggregating.

机构信息

Mathematical Institute, Heinrich Heine University, Düsseldorf, Germany.

IUF - Leibniz Research Institute for Environmental Medicine, Düsseldorf, Germany.

出版信息

Sci Rep. 2023 Jan 17;13(1):937. doi: 10.1038/s41598-023-28172-4.

DOI:10.1038/s41598-023-28172-4
PMID:36650248
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9845231/
Abstract

Gene-environment (GxE) interactions are an important and sophisticated component in the manifestation of complex phenotypes. Simple univariate tests lack statistical power due to the need for multiple testing adjustment and not incorporating potential interplay between several genetic loci. Approaches based on internally constructed genetic risk scores (GRS) require the partitioning of the available sample into training and testing data sets, thus, lowering the effective sample size for testing the GxE interaction itself. To overcome these issues, we propose a statistical test that employs bagging (bootstrap aggregating) in the GRS construction step and utilizes its out-of-bag prediction mechanism. This approach has the key advantage that the full available data set can be used for both constructing the GRS and testing the GxE interaction. To also incorporate interactions between genetic loci, we, furthermore, investigate if using random forests as the GRS construction method in GxE interaction testing further increases the statistical power. In a simulation study, we show that both novel procedures lead to a higher statistical power for detecting GxE interactions, while still controlling the type I error. The random-forests-based test outperforms a bagging-based test that uses the elastic net as its base learner in most scenarios. An application of the testing procedures to a real data set from a German cohort study suggests that there might be a GxE interaction involving exposure to air pollution regarding rheumatoid arthritis.

摘要

基因-环境(GxE)相互作用是复杂表型表现的一个重要而复杂的组成部分。由于需要进行多次测试调整,并且没有纳入几个遗传位点之间的潜在相互作用,简单的单变量测试缺乏统计学效力。基于内部构建的遗传风险评分(GRS)的方法需要将可用样本划分为训练和测试数据集,从而降低了测试 GxE 相互作用本身的有效样本量。为了克服这些问题,我们提出了一种统计检验方法,该方法在 GRS 构建步骤中使用装袋(自举聚合),并利用其袋外预测机制。这种方法的主要优点是可以使用整个可用数据集来构建 GRS 和测试 GxE 相互作用。为了进一步纳入遗传位点之间的相互作用,我们还研究了在 GxE 相互作用测试中使用随机森林作为 GRS 构建方法是否会进一步提高统计效力。在一项模拟研究中,我们表明,这两种新方法都可以提高检测 GxE 相互作用的统计效力,同时仍能控制第一类错误。在大多数情况下,基于随机森林的检验优于使用弹性网络作为基础学习者的基于装袋的检验。对来自德国队列研究的真实数据集的测试程序的应用表明,在暴露于空气污染与类风湿关节炎之间可能存在 GxE 相互作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/595b/9845231/e71f6825ba73/41598_2023_28172_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/595b/9845231/60ac03d49b63/41598_2023_28172_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/595b/9845231/f5f323cba347/41598_2023_28172_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/595b/9845231/942b78945027/41598_2023_28172_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/595b/9845231/157d88bb32da/41598_2023_28172_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/595b/9845231/efa73358c5fd/41598_2023_28172_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/595b/9845231/e71f6825ba73/41598_2023_28172_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/595b/9845231/60ac03d49b63/41598_2023_28172_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/595b/9845231/f5f323cba347/41598_2023_28172_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/595b/9845231/942b78945027/41598_2023_28172_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/595b/9845231/157d88bb32da/41598_2023_28172_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/595b/9845231/efa73358c5fd/41598_2023_28172_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/595b/9845231/e71f6825ba73/41598_2023_28172_Fig6_HTML.jpg

相似文献

1
Efficient gene-environment interaction testing through bootstrap aggregating.通过自举聚合进行有效的基因-环境交互作用检验。
Sci Rep. 2023 Jan 17;13(1):937. doi: 10.1038/s41598-023-28172-4.
2
Comparison of weighting approaches for genetic risk scores in gene-environment interaction studies.基因-环境相互作用研究中遗传风险评分加权方法的比较
BMC Genet. 2017 Dec 16;18(1):115. doi: 10.1186/s12863-017-0586-3.
3
Detection of gene-environment interactions in the presence of linkage disequilibrium and noise by using genetic risk scores with internal weights from elastic net regression.通过使用具有弹性网络回归内部权重的遗传风险评分,在连锁不平衡和噪声存在的情况下检测基因-环境相互作用。
BMC Genet. 2017 Jun 12;18(1):55. doi: 10.1186/s12863-017-0519-1.
4
Evaluation of tree-based statistical learning methods for constructing genetic risk scores.基于树的统计学习方法构建遗传风险评分的评估。
BMC Bioinformatics. 2022 Mar 21;23(1):97. doi: 10.1186/s12859-022-04634-w.
5
Testing gene-environment interactions in the presence of confounders and mismeasured environmental exposures.检测混杂因素和环境暴露测量误差存在情况下的基因-环境交互作用。
G3 (Bethesda). 2021 Sep 27;11(10). doi: 10.1093/g3journal/jkab236.
6
Mitigating type 1 error inflation and power loss in GxE PRS: Genotype-environment interaction in polygenic risk score models.缓解 GxE PRS 中的 1 型错误膨胀和效力损失:多基因风险评分模型中的基因型-环境交互作用。
Genet Epidemiol. 2024 Mar;48(2):85-100. doi: 10.1002/gepi.22546. Epub 2024 Feb 1.
7
Smooth-threshold multivariate genetic prediction incorporating gene-environment interactions.纳入基因-环境相互作用的平滑阈值多元遗传预测。
G3 (Bethesda). 2021 Dec 8;11(12). doi: 10.1093/g3journal/jkab278.
8
A unified powerful set-based test for sequencing data analysis of GxE interactions.一种用于基因与环境相互作用测序数据分析的统一的基于强大集的检验。
Biostatistics. 2017 Jan;18(1):119-131. doi: 10.1093/biostatistics/kxw034. Epub 2016 Jul 28.
9
A robust and adaptive framework for interaction testing in quantitative traits between multiple genetic loci and exposure variables.用于多遗传位点与暴露变量间数量性状相互作用检测的稳健自适应框架。
PLoS Genet. 2022 Nov 16;18(11):e1010464. doi: 10.1371/journal.pgen.1010464. eCollection 2022 Nov.
10
Complete effect-profile assessment in association studies with multiple genetic and multiple environmental factors.在涉及多个遗传因素和多个环境因素的关联研究中进行完整的效应概况评估。
Genet Epidemiol. 2015 Feb;39(2):122-33. doi: 10.1002/gepi.21877. Epub 2014 Dec 23.

引用本文的文献

1
Recent advances in research on gene polymorphisms in Kawasaki disease.川崎病基因多态性的研究新进展
World J Clin Pediatr. 2025 Sep 9;14(3):106693. doi: 10.5409/wjcp.v14.i3.106693.
2
Parity modifies the effect of genetic variants associated with gestational duration and birth weight.产次会改变与妊娠期和出生体重相关的基因变异的影响。
medRxiv. 2025 Jun 17:2025.06.17.25329777. doi: 10.1101/2025.06.17.25329777.
3
Associations of Perceived Neighborhood Factors and Alzheimer's Disease Polygenic Score with Cognition: Evidence from the Health and Retirement Study.

本文引用的文献

1
Evaluation of tree-based statistical learning methods for constructing genetic risk scores.基于树的统计学习方法构建遗传风险评分的评估。
BMC Bioinformatics. 2022 Mar 21;23(1):97. doi: 10.1186/s12859-022-04634-w.
2
A two-step approach to testing overall effect of gene-environment interaction for multiple phenotypes.一种用于检验基因-环境相互作用对多种表型总体效应的两步法。
Bioinformatics. 2021 Apr 5;36(24):5640-5648. doi: 10.1093/bioinformatics/btaa1083.
3
From Basic Science to Clinical Application of Polygenic Risk Scores: A Primer.
感知邻里因素和阿尔茨海默病多基因评分与认知的关联:来自健康与退休研究的证据。
medRxiv. 2025 Mar 15:2025.03.14.25324002. doi: 10.1101/2025.03.14.25324002.
4
Gene-environment interactions in human health.人类健康中的基因-环境相互作用。
Nat Rev Genet. 2024 Nov;25(11):768-784. doi: 10.1038/s41576-024-00731-z. Epub 2024 May 28.
从多基因风险评分的基础科学到临床应用:入门指南。
JAMA Psychiatry. 2021 Jan 1;78(1):101-109. doi: 10.1001/jamapsychiatry.2020.3049.
4
Tutorial: a guide to performing polygenic risk score analyses.教程:多基因风险评分分析操作指南。
Nat Protoc. 2020 Sep;15(9):2759-2772. doi: 10.1038/s41596-020-0353-1. Epub 2020 Jul 24.
5
Using Genetic Risk Score Approaches to Infer Whether an Environmental Factor Attenuates or Exacerbates the Adverse Influence of a Candidate Gene.使用遗传风险评分方法推断环境因素是减轻还是加剧候选基因的不利影响。
Front Genet. 2020 May 8;11:331. doi: 10.3389/fgene.2020.00331. eCollection 2020.
6
Efficient Implementation of Penalized Regression for Genetic Risk Prediction.高效实现基于惩罚回归的遗传风险预测。
Genetics. 2019 May;212(1):65-74. doi: 10.1534/genetics.119.302019. Epub 2019 Feb 26.
7
Genome-Wide Gene-Environment Interaction Analysis Using Set-Based Association Tests.使用基于集合的关联测试进行全基因组基因-环境相互作用分析。
Front Genet. 2019 Jan 14;9:715. doi: 10.3389/fgene.2018.00715. eCollection 2018.
8
Polygenic Risk Scores for Prediction of Breast Cancer and Breast Cancer Subtypes.多基因风险评分在乳腺癌及乳腺癌亚型预测中的应用。
Am J Hum Genet. 2019 Jan 3;104(1):21-34. doi: 10.1016/j.ajhg.2018.11.002. Epub 2018 Dec 13.
9
An atlas of genetic associations in UK Biobank.英国生物银行中的遗传关联图谱
Nat Genet. 2018 Nov;50(11):1593-1599. doi: 10.1038/s41588-018-0248-z. Epub 2018 Oct 22.
10
Nonatopic eczema in elderly women: Effect of air pollution and genes.老年女性非特应性湿疹:空气污染和基因的影响。
J Allergy Clin Immunol. 2019 Jan;143(1):378-385.e9. doi: 10.1016/j.jaci.2018.09.031. Epub 2018 Oct 16.