• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于结构遗传数据的惩罚线性混合模型。

Penalized linear mixed models for structured genetic data.

机构信息

Department of Biostatistics, University of Iowa, Iowa City, Iowa, USA.

出版信息

Genet Epidemiol. 2021 Jul;45(5):427-444. doi: 10.1002/gepi.22384. Epub 2021 May 16.

DOI:10.1002/gepi.22384
PMID:33998038
Abstract

Many genetic studies that aim to identify genetic variants associated with complex phenotypes are subject to unobserved confounding factors arising from environmental heterogeneity. This poses a challenge to detecting associations of interest and is known to induce spurious associations when left unaccounted for. Penalized linear mixed models (LMMs) are an attractive method to correct for unobserved confounding. These methods correct for varying levels of relatedness and population structure by modeling it as a random effect with a covariance structure estimated from observed genetic data. Despite an extensive literature on penalized regression and LMMs separately, the two are rarely discussed together. The aim of this review is to do so while examining the statistical properties of penalized LMMs in the genetic association setting. Specifically, the ability of penalized LMMs to accurately estimate genetic effects in the presence of environmental confounding has not been well studied. To clarify the important yet subtle distinction between population structure and environmental heterogeneity, we present a detailed review of relevant concepts and methods. In addition, we evaluate the performance of penalized LMMs and competing methods in terms of estimation and selection accuracy in the presence of a number of confounding structures.

摘要

许多旨在识别与复杂表型相关的遗传变异的遗传研究都受到环境异质性引起的未观察到的混杂因素的影响。这给检测感兴趣的关联带来了挑战,并且已知在未被考虑时会产生虚假关联。惩罚线性混合模型 (LMM) 是一种纠正未观察到的混杂因素的有吸引力的方法。这些方法通过将其建模为具有协方差结构的随机效应来纠正不同程度的相关性和群体结构,该协方差结构是从观察到的遗传数据中估计得出的。尽管关于惩罚回归和 LMM 的文献很多,但很少将它们放在一起讨论。本综述的目的是在检查遗传关联环境中惩罚 LMM 的统计特性的同时做到这一点。具体来说,惩罚 LMM 在存在环境混杂的情况下准确估计遗传效应的能力尚未得到很好的研究。为了澄清群体结构和环境异质性之间重要而微妙的区别,我们详细回顾了相关概念和方法。此外,我们还评估了惩罚 LMM 和竞争方法在存在多种混杂结构时在估计和选择准确性方面的性能。

相似文献

1
Penalized linear mixed models for structured genetic data.基于结构遗传数据的惩罚线性混合模型。
Genet Epidemiol. 2021 Jul;45(5):427-444. doi: 10.1002/gepi.22384. Epub 2021 May 16.
2
Efficient penalized generalized linear mixed models for variable selection and genetic risk prediction in high-dimensional data.高效惩罚广义线性混合模型在高维数据中的变量选择和遗传风险预测。
Bioinformatics. 2023 Feb 3;39(2). doi: 10.1093/bioinformatics/btad063.
3
Hybrid of Restricted and Penalized Maximum Likelihood Method for Efficient Genome-Wide Association Study.基于受限极大似然和惩罚极大似然法的高效全基因组关联研究混合方法
Genes (Basel). 2020 Oct 29;11(11):1286. doi: 10.3390/genes11111286.
4
Trade-offs of Linear Mixed Models in Genome-Wide Association Studies.全基因组关联研究中线性混合模型的权衡
J Comput Biol. 2022 Mar;29(3):233-242. doi: 10.1089/cmb.2021.0157. Epub 2022 Feb 25.
5
Principal component regression and linear mixed model in association analysis of structured samples: competitors or complements?结构化样本关联分析中的主成分回归与线性混合模型:竞争对手还是互补方法?
Genet Epidemiol. 2015 Mar;39(3):149-55. doi: 10.1002/gepi.21879. Epub 2014 Dec 23.
6
Control for Population Structure and Relatedness for Binary Traits in Genetic Association Studies via Logistic Mixed Models.通过逻辑混合模型在遗传关联研究中对二元性状的群体结构和相关性进行控制。
Am J Hum Genet. 2016 Apr 7;98(4):653-66. doi: 10.1016/j.ajhg.2016.02.012. Epub 2016 Mar 24.
7
Multilocus association testing with penalized regression.基于惩罚回归的多位点关联分析。
Genet Epidemiol. 2011 Dec;35(8):755-65. doi: 10.1002/gepi.20625. Epub 2011 Sep 15.
8
Performance and robustness of penalized and unpenalized methods for genetic prediction of complex human disease.惩罚和非惩罚方法在人类复杂疾病遗传预测中的性能和稳健性。
Genet Epidemiol. 2013 Feb;37(2):184-95. doi: 10.1002/gepi.21698. Epub 2012 Nov 30.
9
A penalized linear mixed model with generalized method of moments estimators for complex phenotype prediction.一种用于复杂表型预测的带有广义矩估计器的惩罚线性混合模型。
Bioinformatics. 2022 Nov 30;38(23):5222-5228. doi: 10.1093/bioinformatics/btac659.
10
Conditional random fields for fast, large-scale genome-wide association studies.条件随机场在快速、大规模全基因组关联研究中的应用。
PLoS One. 2011;6(7):e21591. doi: 10.1371/journal.pone.0021591. Epub 2011 Jul 12.

引用本文的文献

1
Familial Oculoauriculovertebral Spectrum: A Genomic Investigation of Autosomal Dominant Inheritance.家族性眼耳脊椎综合征:常染色体显性遗传的基因组研究
Cleft Palate Craniofac J. 2025 Jan 17:10556656241306202. doi: 10.1177/10556656241306202.
2
HighDimMixedModels.jl: Robust high-dimensional mixed-effects models across omics data.HighDimMixedModels.jl:跨组学数据的稳健高维混合效应模型。
PLoS Comput Biol. 2025 Jan 13;21(1):e1012143. doi: 10.1371/journal.pcbi.1012143. eCollection 2025 Jan.
3
Whole genome sequencing of a family with autosomal dominant features within the oculoauriculovertebral spectrum.
眼耳脊椎综合征范围内具有常染色体显性特征的一个家族的全基因组测序。
medRxiv. 2024 Jul 15:2024.02.07.24301824. doi: 10.1101/2024.02.07.24301824.
4
Alzheimer's Disease Protein Relevance Analysis Using Human and Mouse Model Proteomics Data.利用人类和小鼠模型蛋白质组学数据进行阿尔茨海默病蛋白质相关性分析
Front Syst Biol. 2023;3. doi: 10.3389/fsysb.2023.1085577. Epub 2023 Jul 13.
5
Efficient penalized generalized linear mixed models for variable selection and genetic risk prediction in high-dimensional data.高效惩罚广义线性混合模型在高维数据中的变量选择和遗传风险预测。
Bioinformatics. 2023 Feb 3;39(2). doi: 10.1093/bioinformatics/btad063.