• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于全基因组关联研究中样本结构的方差成分模型。

Variance component model to account for sample structure in genome-wide association studies.

机构信息

Center for Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

出版信息

Nat Genet. 2010 Apr;42(4):348-54. doi: 10.1038/ng.548. Epub 2010 Mar 7.

DOI:10.1038/ng.548
PMID:20208533
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3092069/
Abstract

Although genome-wide association studies (GWASs) have identified numerous loci associated with complex traits, imprecise modeling of the genetic relatedness within study samples may cause substantial inflation of test statistics and possibly spurious associations. Variance component approaches, such as efficient mixed-model association (EMMA), can correct for a wide range of sample structures by explicitly accounting for pairwise relatedness between individuals, using high-density markers to model the phenotype distribution; but such approaches are computationally impractical. We report here a variance component approach implemented in publicly available software, EMMA eXpedited (EMMAX), that reduces the computational time for analyzing large GWAS data sets from years to hours. We apply this method to two human GWAS data sets, performing association analysis for ten quantitative traits from the Northern Finland Birth Cohort and seven common diseases from the Wellcome Trust Case Control Consortium. We find that EMMAX outperforms both principal component analysis and genomic control in correcting for sample structure.

摘要

虽然全基因组关联研究 (GWAS) 已经确定了许多与复杂性状相关的基因座,但研究样本中遗传相关性的不精确建模可能导致检验统计量的大量膨胀,并可能产生虚假关联。方差成分方法,如有效混合模型关联 (EMMA),可以通过显式考虑个体之间的成对相关性,使用高密度标记来模拟表型分布,从而纠正广泛的样本结构;但这种方法在计算上是不切实际的。我们在这里报告了一种方差成分方法,该方法在可公开获取的软件 EMMA eXpedited (EMMAX) 中实现,该方法将分析大型 GWAS 数据集的计算时间从数年缩短到数小时。我们将该方法应用于两个人类 GWAS 数据集,对来自芬兰北部出生队列的十个定量性状和来自威康信托基金会病例对照联合会的七个常见疾病进行关联分析。我们发现,EMMAX 在纠正样本结构方面优于主成分分析和基因组控制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bd0/3092069/ad6b4f74f639/nihms196426f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bd0/3092069/097878f5299b/nihms196426f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bd0/3092069/b5f05d89cd68/nihms196426f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bd0/3092069/02de6874dc3b/nihms196426f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bd0/3092069/34b222ed55ca/nihms196426f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bd0/3092069/ad6b4f74f639/nihms196426f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bd0/3092069/097878f5299b/nihms196426f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bd0/3092069/b5f05d89cd68/nihms196426f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bd0/3092069/02de6874dc3b/nihms196426f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bd0/3092069/34b222ed55ca/nihms196426f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bd0/3092069/ad6b4f74f639/nihms196426f5.jpg

相似文献

1
Variance component model to account for sample structure in genome-wide association studies.用于全基因组关联研究中样本结构的方差成分模型。
Nat Genet. 2010 Apr;42(4):348-54. doi: 10.1038/ng.548. Epub 2010 Mar 7.
2
Genome-wide efficient mixed-model analysis for association studies.全基因组高效混合模型关联分析。
Nat Genet. 2012 Jun 17;44(7):821-4. doi: 10.1038/ng.2310.
3
Novel probabilistic models of spatial genetic ancestry with applications to stratification correction in genome-wide association studies.用于全基因组关联研究分层校正的空间遗传血统新型概率模型。
Bioinformatics. 2017 Mar 15;33(6):879-885. doi: 10.1093/bioinformatics/btw720.
4
An efficient multi-locus mixed-model approach for genome-wide association studies in structured populations.一种在结构群体中进行全基因组关联研究的高效多基因混合模型方法。
Nat Genet. 2012 Jun 17;44(7):825-30. doi: 10.1038/ng.2314.
5
A mixed model reduces spurious genetic associations produced by population stratification in genome-wide association studies.混合模型可减少全基因组关联研究中群体分层产生的虚假遗传关联。
Genomics. 2015 Apr;105(4):191-6. doi: 10.1016/j.ygeno.2015.01.006. Epub 2015 Jan 30.
6
Sparse principal component analysis for identifying ancestry-informative markers in genome-wide association studies.稀疏主成分分析在全基因组关联研究中识别与祖先相关的标记。
Genet Epidemiol. 2012 May;36(4):293-302. doi: 10.1002/gepi.21621. Epub 2012 Apr 16.
7
Linkage Disequilibrium and Evaluation of Genome-Wide Association Mapping Models in Tetraploid Potato.四倍体马铃薯的连锁不平衡及全基因组关联作图模型评估
G3 (Bethesda). 2018 Oct 3;8(10):3185-3202. doi: 10.1534/g3.118.200377.
8
Genome-wide association analysis of metabolic traits in a birth cohort from a founder population.对一个奠基者群体出生队列中的代谢性状进行全基因组关联分析。
Nat Genet. 2009 Jan;41(1):35-46. doi: 10.1038/ng.271. Epub 2008 Dec 7.
9
Genomic inflation factors under polygenic inheritance.多基因遗传下的基因组膨胀因子。
Eur J Hum Genet. 2011 Jul;19(7):807-12. doi: 10.1038/ejhg.2011.39. Epub 2011 Mar 16.
10
Accounting for Population Structure in Gene-by-Environment Interactions in Genome-Wide Association Studies Using Mixed Models.在全基因组关联研究中使用混合模型分析基因-环境相互作用中的群体结构
PLoS Genet. 2016 Mar 4;12(3):e1005849. doi: 10.1371/journal.pgen.1005849. eCollection 2016 Mar.

引用本文的文献

1
Investigating the role of transposable elements in shaping abdominal fat and egg production phenotypic traits in geese.研究转座元件在塑造鹅腹部脂肪和产蛋表型性状中的作用。
BMC Genomics. 2025 Sep 3;26(1):803. doi: 10.1186/s12864-025-11976-1.
2
Genetic variation in the FMO and GSTO gene clusters impacts arsenic metabolism in humans.FMO和GSTO基因簇中的遗传变异会影响人体的砷代谢。
PLoS Genet. 2025 Sep 2;21(9):e1011826. doi: 10.1371/journal.pgen.1011826. eCollection 2025 Sep.
3
Identification of new genomic loci for seed protein and oil content in the soybean pangenome using genome-wide association and haplotype analyses.

本文引用的文献

1
Finding the missing heritability of complex diseases.寻找复杂疾病中缺失的遗传力。
Nature. 2009 Oct 8;461(7265):747-53. doi: 10.1038/nature08494.
2
A kinship-based modification of the armitage trend test to address hidden population structure and small differential genotyping errors.一种基于亲属关系对阿米蒂奇趋势检验进行的修正,以解决隐藏的群体结构和微小的差异基因分型错误。
PLoS One. 2009 Jun 8;4(6):e5825. doi: 10.1371/journal.pone.0005825.
3
A large-scale genome-wide association study of Asian populations uncovers genetic factors influencing eight quantitative traits.
利用全基因组关联分析和单倍型分析在大豆泛基因组中鉴定种子蛋白和油含量的新基因组位点。
Theor Appl Genet. 2025 Sep 1;138(9):237. doi: 10.1007/s00122-025-05020-9.
4
Transposon Dynamics Drive Genome Evolution and Regulate Genetic Mechanisms of Agronomic Traits in Cotton.转座子动态驱动棉花基因组进化并调控农艺性状的遗传机制
Plants (Basel). 2025 Aug 12;14(16):2509. doi: 10.3390/plants14162509.
5
Whole genome resequencing uncovers candidate genes related to plumage color in Yuexi frizzled feather chicken.全基因组重测序揭示了与岳西卷羽鸡羽色相关的候选基因。
Poult Sci. 2025 Aug 13;104(11):105680. doi: 10.1016/j.psj.2025.105680.
6
Genomic regions associated with bovine respiratory disease in pacific northwest Holstein cattle.与太平洋西北地区荷斯坦奶牛牛呼吸道疾病相关的基因组区域。
Front Vet Sci. 2025 Jul 31;12:1637087. doi: 10.3389/fvets.2025.1637087. eCollection 2025.
7
Pangenome analysis of transposable element insertion polymorphisms reveals features underlying cold tolerance in rice.转座元件插入多态性的泛基因组分析揭示了水稻耐寒性的潜在特征。
Nat Commun. 2025 Aug 16;16(1):7634. doi: 10.1038/s41467-025-62887-4.
8
A 25K Wheat SNP Array Revealed the Genetic Diversity and Population Structure of Durum Wheat ( subsp. ) Landraces and Cultivars.一个25K小麦单核苷酸多态性(SNP)芯片揭示了硬粒小麦(亚种)地方品种和栽培品种的遗传多样性及群体结构。
Int J Mol Sci. 2025 Jul 25;26(15):7220. doi: 10.3390/ijms26157220.
9
Association mapping and candidate gene identification for drought tolerance in sorghum.高粱耐旱性的关联作图与候选基因鉴定
Front Plant Sci. 2025 Jul 25;16:1629615. doi: 10.3389/fpls.2025.1629615. eCollection 2025.
10
Genome-wide association and linkage analysis of histidine-rich glycoprotein identifies common variants associated with plasma histidine-rich glycoprotein concentrations.富含组氨酸糖蛋白的全基因组关联和连锁分析确定了与血浆富含组氨酸糖蛋白浓度相关的常见变异。
Res Pract Thromb Haemost. 2025 Jun 24;9(5):102955. doi: 10.1016/j.rpth.2025.102955. eCollection 2025 Jul.
一项针对亚洲人群的大规模全基因组关联研究揭示了影响八个数量性状的遗传因素。
Nat Genet. 2009 May;41(5):527-34. doi: 10.1038/ng.357. Epub 2009 Apr 26.
4
Case-control association testing in the presence of unknown relationships.存在未知关系时的病例对照关联检验。
Genet Epidemiol. 2009 Dec;33(8):668-78. doi: 10.1002/gepi.20418.
5
A multistage genome-wide association study in breast cancer identifies two new risk alleles at 1p11.2 and 14q24.1 (RAD51L1).一项针对乳腺癌的多阶段全基因组关联研究在1p11.2和14q24.1(RAD51L1)发现了两个新的风险等位基因。
Nat Genet. 2009 May;41(5):579-84. doi: 10.1038/ng.353. Epub 2009 Mar 29.
6
Newly discovered breast cancer susceptibility loci on 3p24 and 17q23.2.3p24和17q23.2上新发现的乳腺癌易感基因座。
Nat Genet. 2009 May;41(5):585-90. doi: 10.1038/ng.354. Epub 2009 Mar 29.
7
Genome-wide association studies in an isolated founder population from the Pacific Island of Kosrae.对来自太平洋科斯雷岛的一个隔离奠基人群体进行全基因组关联研究。
PLoS Genet. 2009 Feb;5(2):e1000365. doi: 10.1371/journal.pgen.1000365. Epub 2009 Feb 6.
8
Genotype-based matching to correct for population stratification in large-scale case-control genetic association studies.在大规模病例对照基因关联研究中,基于基因型匹配以校正群体分层。
Genet Epidemiol. 2009 Sep;33(6):508-17. doi: 10.1002/gepi.20403.
9
The genome-wide patterns of variation expose significant substructure in a founder population.全基因组变异模式揭示了创始人群体中显著的亚结构。
Am J Hum Genet. 2008 Dec;83(6):787-94. doi: 10.1016/j.ajhg.2008.11.005.
10
Genome-wide association analysis of metabolic traits in a birth cohort from a founder population.对一个奠基者群体出生队列中的代谢性状进行全基因组关联分析。
Nat Genet. 2009 Jan;41(1):35-46. doi: 10.1038/ng.271. Epub 2008 Dec 7.