• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于模型的多因素降维方法在通过控制群体结构进行上位性检测中的性能

Performance of model-based multifactor dimensionality reduction methods for epistasis detection by controlling population structure.

作者信息

Abegaz Fentaw, Van Lishout François, Mahachie John Jestinah M, Chiachoompu Kridsadakorn, Bhardwaj Archana, Duroux Diane, Gusareva Elena S, Wei Zhi, Hakonarson Hakon, Van Steen Kristel

机构信息

GIGA-R, Medical Genomics - BIO3, University of Liège, Liège, Belgium.

Department of Computer Science, New Jersey Institute of Technology, Newark, NJ, USA.

出版信息

BioData Min. 2021 Feb 19;14(1):16. doi: 10.1186/s13040-021-00247-w.

DOI:10.1186/s13040-021-00247-w
PMID:33608043
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7893746/
Abstract

BACKGROUND

In genome-wide association studies the extent and impact of confounding due to population structure have been well recognized. Inadequate handling of such confounding is likely to lead to spurious associations, hampering replication, and the identification of causal variants. Several strategies have been developed for protecting associations against confounding, the most popular one is based on Principal Component Analysis. In contrast, the extent and impact of confounding due to population structure in gene-gene interaction association epistasis studies are much less investigated and understood. In particular, the role of nonlinear genetic population substructure in epistasis detection is largely under-investigated, especially outside a regression framework.

METHODS

To identify causal variants in synergy, to improve interpretability and replicability of epistasis results, we introduce three strategies based on a model-based multifactor dimensionality reduction approach for structured populations, namely MBMDR-PC, MBMDR-PG, and MBMDR-GC.

RESULTS

Simulation results comparing the performance of various approaches show that in the presence of population structure MBMDR-PC and MBMDR-PG consistently better control type I error rate at the nominal level than MBMDR-GC. Moreover, our proposed three methods of population structure correction outperform MDR-SP in terms of statistical power.

CONCLUSION

We demonstrate through extensive simulation studies the effect of various degrees of genetic population structure and relatedness on epistasis detection and propose appropriate remedial measures based on linear and nonlinear sample genetic similarity.

摘要

背景

在全基因组关联研究中,由于群体结构导致的混杂因素的程度和影响已得到充分认识。对这种混杂因素处理不当可能会导致虚假关联,妨碍重复验证以及因果变异的识别。已经开发了几种策略来保护关联不受混杂因素影响,最流行的一种基于主成分分析。相比之下,在基因-基因相互作用关联上位性研究中,由于群体结构导致的混杂因素的程度和影响则较少被研究和理解。特别是,非线性遗传群体亚结构在上位性检测中的作用在很大程度上未得到充分研究,尤其是在回归框架之外。

方法

为了识别协同作用中的因果变异,提高上位性结果的可解释性和可重复性,我们基于一种针对结构化群体的基于模型的多因素降维方法引入了三种策略,即MBMDR-PC、MBMDR-PG和MBMDR-GC。

结果

比较各种方法性能的模拟结果表明,在存在群体结构的情况下,MBMDR-PC和MBMDR-PG在名义水平上始终比MBMDR-GC更好地控制I型错误率。此外,我们提出的三种群体结构校正方法在统计功效方面优于MDR-SP。

结论

我们通过广泛的模拟研究证明了不同程度的遗传群体结构和相关性对上位性检测的影响,并基于线性和非线性样本遗传相似性提出了适当的补救措施。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/7893746/669e8f423b82/13040_2021_247_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/7893746/2d52d5ce52b0/13040_2021_247_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/7893746/a22b48c2877e/13040_2021_247_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/7893746/167af10dabe9/13040_2021_247_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/7893746/66cceac51185/13040_2021_247_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/7893746/f1ed2f1f61c3/13040_2021_247_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/7893746/669e8f423b82/13040_2021_247_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/7893746/2d52d5ce52b0/13040_2021_247_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/7893746/a22b48c2877e/13040_2021_247_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/7893746/167af10dabe9/13040_2021_247_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/7893746/66cceac51185/13040_2021_247_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/7893746/f1ed2f1f61c3/13040_2021_247_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9d5/7893746/669e8f423b82/13040_2021_247_Fig6_HTML.jpg

相似文献

1
Performance of model-based multifactor dimensionality reduction methods for epistasis detection by controlling population structure.基于模型的多因素降维方法在通过控制群体结构进行上位性检测中的性能
BioData Min. 2021 Feb 19;14(1):16. doi: 10.1186/s13040-021-00247-w.
2
Confounding of linkage disequilibrium patterns in large scale DNA based gene-gene interaction studies.大规模基于DNA的基因-基因相互作用研究中连锁不平衡模式的混杂
BioData Min. 2019 Jun 10;12:11. doi: 10.1186/s13040-019-0199-7. eCollection 2019.
3
A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection.基于模型的多因素降维中参数和非参数检验用于检测上位性的稳健性研究。
BioData Min. 2013 Apr 25;6(1):9. doi: 10.1186/1756-0381-6-9.
4
mbmdr: an R package for exploring gene-gene interactions associated with binary or quantitative traits.mbmdr:一个用于探索与二项式或定量性状相关的基因-基因相互作用的 R 包。
Bioinformatics. 2010 Sep 1;26(17):2198-9. doi: 10.1093/bioinformatics/btq352. Epub 2010 Jul 1.
5
A unified model based multifactor dimensionality reduction framework for detecting gene-gene interactions.一种基于统一模型的多因素降维框架用于检测基因-基因相互作用。
Bioinformatics. 2016 Sep 1;32(17):i605-i610. doi: 10.1093/bioinformatics/btw424.
6
An efficient algorithm to perform multiple testing in epistasis screening.一种用于连锁分析中多重检验的高效算法。
BMC Bioinformatics. 2013 Apr 24;14:138. doi: 10.1186/1471-2105-14-138.
7
FAM-MDR: a flexible family-based multifactor dimensionality reduction technique to detect epistasis using related individuals.FAM-MDR:一种灵活的基于家系的多因子维度缩减技术,用于检测相关个体中的上位性。
PLoS One. 2010 Apr 22;5(4):e10304. doi: 10.1371/journal.pone.0010304.
8
[Detecting gene-gene/environment interactions by model-based multifactor dimensionality reduction].[基于模型的多因素降维法检测基因-基因/环境相互作用]
Zhonghua Liu Xing Bing Xue Za Zhi. 2015 Nov;36(11):1305-10.
9
TrioMDR: Detecting SNP interactions in trio families with model-based multifactor dimensionality reduction.TrioMDR:基于模型的多因子降维方法在三核苷酸家族中检测 SNP 相互作用。
Genomics. 2019 Sep;111(5):1176-1182. doi: 10.1016/j.ygeno.2018.07.014. Epub 2018 Jul 25.
10
gammaMAXT: a fast multiple-testing correction algorithm.伽马MAXT:一种快速多重检验校正算法。
BioData Min. 2015 Nov 20;8:36. doi: 10.1186/s13040-015-0069-x. eCollection 2015.

引用本文的文献

1
Genome-wide epistasis analysis reveals significant epistatic signals associated with Parkinson's disease risk.全基因组上位性分析揭示了与帕金森病风险相关的显著上位性信号。
Brain. 2025 Jun 3;148(6):2060-2074. doi: 10.1093/brain/awae398.
2
Considerations in the search for epistasis.连锁不平衡分析中的考虑因素。
Genome Biol. 2024 Nov 19;25(1):296. doi: 10.1186/s13059-024-03427-z.
3
* and DOCK1* gene-gene interactions associated with rheumatoid arthritis in the focal adhesion pathway.以及与类风湿关节炎相关的粘着斑通路中DOCK1基因与基因的相互作用。

本文引用的文献

1
Principals about principal components in statistical genetics.统计遗传学中主成分的基本原理。
Brief Bioinform. 2019 Nov 27;20(6):2200-2216. doi: 10.1093/bib/bby081.
2
Recent advances in the study of fine-scale population structure in humans.人类精细尺度种群结构研究的最新进展。
Curr Opin Genet Dev. 2016 Dec;41:98-105. doi: 10.1016/j.gde.2016.08.007. Epub 2016 Sep 20.
3
Control for Population Structure and Relatedness for Binary Traits in Genetic Association Studies via Logistic Mixed Models.通过逻辑混合模型在遗传关联研究中对二元性状的群体结构和相关性进行控制。
Front Genet. 2024 May 13;15:1375036. doi: 10.3389/fgene.2024.1375036. eCollection 2024.
4
Roles of interacting stress-related genes in lifespan regulation: insights for translating experimental findings to humans.相互作用的应激相关基因在寿命调节中的作用:将实验结果转化为人类研究的见解
J Transl Genet Genom. 2021;5(4):357-379. Epub 2021 Oct 19.
Am J Hum Genet. 2016 Apr 7;98(4):653-66. doi: 10.1016/j.ajhg.2016.02.012. Epub 2016 Mar 24.
4
Iterative Usage of Fixed and Random Effect Models for Powerful and Efficient Genome-Wide Association Studies.用于强大且高效的全基因组关联研究的固定效应模型和随机效应模型的迭代使用
PLoS Genet. 2016 Feb 1;12(2):e1005767. doi: 10.1371/journal.pgen.1005767. eCollection 2016 Feb.
5
gammaMAXT: a fast multiple-testing correction algorithm.伽马MAXT:一种快速多重检验校正算法。
BioData Min. 2015 Nov 20;8:36. doi: 10.1186/s13040-015-0069-x. eCollection 2015.
6
Model-Based Multifactor Dimensionality Reduction for Rare Variant Association Analysis.基于模型的多因素降维用于罕见变异关联分析。
Hum Hered. 2015;79(3-4):157-67. doi: 10.1159/000381286. Epub 2015 Jul 28.
7
A roadmap to multifactor dimensionality reduction methods.多因素降维方法路线图。
Brief Bioinform. 2016 Mar;17(2):293-308. doi: 10.1093/bib/bbv038. Epub 2015 Jun 24.
8
Highlighting nonlinear patterns in population genetics datasets.突出群体遗传学数据集中的非线性模式。
Sci Rep. 2015 Jan 30;5:8140. doi: 10.1038/srep08140.
9
Detecting epistasis in human complex traits.检测人类复杂性状中的上位性。
Nat Rev Genet. 2014 Nov;15(11):722-33. doi: 10.1038/nrg3747. Epub 2014 Sep 9.
10
Practical aspects of genome-wide association interaction analysis.全基因组关联相互作用分析的实践方面
Hum Genet. 2014 Nov;133(11):1343-58. doi: 10.1007/s00439-014-1480-y. Epub 2014 Aug 28.