• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于基因组关系矩阵的全基因组关联研究:小麦和拟南芥的案例分析

Genome-Wide Association Studies with a Genomic Relationship Matrix: A Case Study with Wheat and Arabidopsis.

作者信息

Gianola Daniel, Fariello Maria I, Naya Hugo, Schön Chris-Carolin

机构信息

Department of Animal Sciences, University of Wisconsin-Madison, Wisconsin 53706 Department of Dairy Science, University of Wisconsin-Madison, Wisconsin 53706 Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Wisconsin 53706 Technical University of Munich School of Life Sciences Weihenstephan, Technical University of Munich, D-85354 Freising, Germany Institute for Advanced Study, Technical University of Munich, D-85748 Garching, Germany Bioinformatics Unit, Institut Pasteur de Montevideo, 11400, Uruguay

Bioinformatics Unit, Institut Pasteur de Montevideo, 11400, Uruguay Instituto de Matemática y Estadística Rafael Laguardia, Facultad de Ingeniería, Universidad de la República, 11300 Montevideo, Uruguay.

出版信息

G3 (Bethesda). 2016 Oct 13;6(10):3241-3256. doi: 10.1534/g3.116.034256.

DOI:10.1534/g3.116.034256
PMID:27520956
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5068945/
Abstract

Standard genome-wide association studies (GWAS) scan for relationships between each of p molecular markers and a continuously distributed target trait. Typically, a marker-based matrix of genomic similarities among individuals ( G: ) is constructed, to account more properly for the covariance structure in the linear regression model used. We show that the generalized least-squares estimator of the regression of phenotype on one or on m markers is invariant with respect to whether or not the marker(s) tested is(are) used for building G,: provided variance components are unaffected by exclusion of such marker(s) from G: The result is arrived at by using a matrix expression such that one can find many inverses of genomic relationship, or of phenotypic covariance matrices, stemming from removing markers tested as fixed, but carrying out a single inversion. When eigenvectors of the genomic relationship matrix are used as regressors with fixed regression coefficients, e.g., to account for population stratification, their removal from G: does matter. Removal of eigenvectors from G: can have a noticeable effect on estimates of genomic and residual variances, so caution is needed. Concepts were illustrated using genomic data on 599 wheat inbred lines, with grain yield as target trait, and on close to 200 Arabidopsis thaliana accessions.

摘要

标准的全基因组关联研究(GWAS)扫描p个分子标记中的每一个与连续分布的目标性状之间的关系。通常,构建个体间基于标记的基因组相似性矩阵(G:),以便更恰当地考虑所用线性回归模型中的协方差结构。我们表明,表型对一个或m个标记的回归的广义最小二乘估计量对于所测试的标记是否用于构建G:是不变的:前提是方差分量不受从G:中排除此类标记的影响。该结果是通过使用矩阵表达式得出的,这样一来,人们可以找到许多源于将测试标记视为固定但仅进行一次求逆的基因组关系或表型协方差矩阵的逆矩阵。当基因组关系矩阵的特征向量用作具有固定回归系数的回归变量时,例如为了考虑群体分层,将它们从G:中移除确实会产生影响。从G:中移除特征向量可能会对基因组方差和残差方差的估计产生显著影响,因此需要谨慎。使用了关于599个小麦自交系(以籽粒产量为目标性状)和近200个拟南芥种质的基因组数据来说明这些概念。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/a301fd998385/3241f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/f0336bffb39b/3241f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/df0ca43f463c/3241f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/9f38e39982ea/3241f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/89277792295c/3241f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/326105ccf794/3241f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/b57b7389bf72/3241f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/3c20acfbdefb/3241f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/ea963316cbd3/3241f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/063b6d6e8943/3241f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/17a7ff75ec15/3241f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/a301fd998385/3241f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/f0336bffb39b/3241f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/df0ca43f463c/3241f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/9f38e39982ea/3241f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/89277792295c/3241f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/326105ccf794/3241f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/b57b7389bf72/3241f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/3c20acfbdefb/3241f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/ea963316cbd3/3241f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/063b6d6e8943/3241f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/17a7ff75ec15/3241f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08bd/5068945/a301fd998385/3241f11.jpg

相似文献

1
Genome-Wide Association Studies with a Genomic Relationship Matrix: A Case Study with Wheat and Arabidopsis.基于基因组关系矩阵的全基因组关联研究:小麦和拟南芥的案例分析
G3 (Bethesda). 2016 Oct 13;6(10):3241-3256. doi: 10.1534/g3.116.034256.
2
Accounting for trait architecture in genomic predictions of US Holstein cattle using a weighted realized relationship matrix.利用加权实现关系矩阵对美国荷斯坦奶牛的基因组预测进行性状结构分析。
Genet Sel Evol. 2015 Apr 2;47(1):24. doi: 10.1186/s12711-015-0100-1.
3
Marker-based estimation of heritability in immortal populations.基于标记的永生群体遗传力估计
Genetics. 2015 Feb;199(2):379-98. doi: 10.1534/genetics.114.167916. Epub 2014 Dec 19.
4
Genomic variance estimates: With or without disequilibrium covariances?基因组方差估计:是否考虑不平衡协方差?
J Anim Breed Genet. 2017 Jun;134(3):232-241. doi: 10.1111/jbg.12268.
5
Inferring trait-specific similarity among individuals from molecular markers and phenotypes with Bayesian regression.基于贝叶斯回归推断个体的分子标记和表型特征之间的特异性相似性。
Theor Popul Biol. 2020 Apr;132:47-59. doi: 10.1016/j.tpb.2019.11.008. Epub 2019 Dec 9.
6
Genome-wide estimation of heritability and its functional components for flowering, defense, ionomics, and developmental traits in a geographically diverse population of Arabidopsis thaliana.对地理分布多样的拟南芥群体中开花、防御、离子组学和发育性状的遗传力及其功能成分进行全基因组估计。
Genome. 2017 Jul;60(7):572-580. doi: 10.1139/gen-2016-0213. Epub 2017 Mar 17.
7
Genome-Wide Association Mapping and Genomic Prediction Elucidate the Genetic Architecture of Morphological Traits in Arabidopsis.全基因组关联图谱绘制与基因组预测揭示拟南芥形态性状的遗传结构
Plant Physiol. 2016 Apr;170(4):2187-203. doi: 10.1104/pp.15.00997. Epub 2016 Feb 11.
8
Comparative analysis of the GBLUP, emBayesB, and GWAS algorithms to predict genetic values in large yellow croaker (Larimichthys crocea).GBLUP、emBayesB和GWAS算法对大黄鱼(Larimichthys crocea)遗传值预测的比较分析
BMC Genomics. 2016 Jun 14;17:460. doi: 10.1186/s12864-016-2756-5.
9
Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction.在基因组预测中不进行交叉验证的交叉验证
G3 (Bethesda). 2016 Oct 13;6(10):3107-3128. doi: 10.1534/g3.116.033381.
10
Regulatory network construction in Arabidopsis by using genome-wide gene expression quantitative trait loci.利用全基因组基因表达数量性状位点构建拟南芥调控网络
Proc Natl Acad Sci U S A. 2007 Jan 30;104(5):1708-13. doi: 10.1073/pnas.0610429104. Epub 2007 Jan 19.

引用本文的文献

1
Modeling within and between Sub-Genomes Epistasis of Synthetic Hexaploid Wheat for Genome-Enabled Prediction of Diseases.亚基因组间和亚基因组内的上位性模型构建:用于疾病的基于基因组的预测的合成六倍体小麦。
Genes (Basel). 2024 Feb 20;15(3):262. doi: 10.3390/genes15030262.
2
A novel rare variants association test for binary traits in family-based designs via copulas.基于 Copula 的家系设计中二元性状的新型罕见变异关联检验
Stat Methods Med Res. 2023 Nov;32(11):2096-2122. doi: 10.1177/09622802231197977. Epub 2023 Oct 13.
3
Inference about quantitative traits under selection: a Bayesian revisitation for the post-genomic era.

本文引用的文献

1
Why significant variables aren't automatically good predictors.为什么显著变量并非自动成为良好的预测指标。
Proc Natl Acad Sci U S A. 2015 Nov 10;112(45):13892-7. doi: 10.1073/pnas.1518285112. Epub 2015 Oct 26.
2
Comparing estimates of genetic variance across different relationship models.比较不同亲缘关系模型下的遗传方差估计值。
Theor Popul Biol. 2016 Feb;107:26-30. doi: 10.1016/j.tpb.2015.08.005. Epub 2015 Sep 2.
3
Genomic heritability: what is it?基因组遗传力:它是什么?
选择下数量性状的推断:后基因组时代的贝叶斯再探讨。
Genet Sel Evol. 2022 Dec 2;54(1):78. doi: 10.1186/s12711-022-00765-z.
4
Gene-Based Association Testing of Dichotomous Traits With Generalized Functional Linear Mixed Models Using Extended Pedigrees: Applications to Age-Related Macular Degeneration.使用扩展家系的广义功能线性混合模型对二分性状进行基于基因的关联测试:年龄相关性黄斑变性的应用。
J Am Stat Assoc. 2021;116(534):531-545. doi: 10.1080/01621459.2020.1799809. Epub 2020 Jul 28.
5
Genome-Wide Association Studies of Mineral Content in Common Bean.普通菜豆矿物质含量的全基因组关联研究
Front Plant Sci. 2021 Mar 5;12:636484. doi: 10.3389/fpls.2021.636484. eCollection 2021.
6
Improved Prediction of Bacterial Genotype-Phenotype Associations Using Interpretable Pangenome-Spanning Regressions.利用可解释的泛基因组跨越回归提高细菌基因型-表型关联的预测能力。
mBio. 2020 Jul 7;11(4):e01344-20. doi: 10.1128/mBio.01344-20.
7
Fast Algorithms for Conducting Large-Scale GWAS of Age-at-Onset Traits Using Cox Mixed-Effects Models.基于 Cox 混合效应模型的大规模发病年龄性状全基因组关联分析的快速算法。
Genetics. 2020 May;215(1):41-58. doi: 10.1534/genetics.119.302940. Epub 2020 Mar 4.
8
Association analysis of loci implied in "buffering" epistasis.“缓冲”上位性相关位点的关联分析。
J Anim Sci. 2020 Mar 1;98(3). doi: 10.1093/jas/skaa045.
9
A Multiple-Trait Bayesian Lasso for Genome-Enabled Analysis and Prediction of Complex Traits.用于基于基因组的复杂性状分析与预测的多性状贝叶斯套索法
Genetics. 2020 Feb;214(2):305-331. doi: 10.1534/genetics.119.302934. Epub 2019 Dec 26.
10
Genome-Wide Association Study for Milk Fatty Acids in Holstein Cattle Accounting for the Gene Effect.考虑基因效应的荷斯坦奶牛乳脂肪酸全基因组关联研究
Animals (Basel). 2019 Nov 19;9(11):997. doi: 10.3390/ani9110997.
PLoS Genet. 2015 May 5;11(5):e1005048. doi: 10.1371/journal.pgen.1005048. eCollection 2015 May.
4
From association to prediction: statistical methods for the dissection and selection of complex traits in plants.从关联到预测:植物复杂性状解析与选择的统计方法
Curr Opin Plant Biol. 2015 Apr;24:110-8. doi: 10.1016/j.pbi.2015.02.010. Epub 2015 Mar 17.
5
Genome-wide regression and prediction with the BGLR statistical package.使用BGLR统计软件包进行全基因组回归与预测。
Genetics. 2014 Oct;198(2):483-95. doi: 10.1534/genetics.114.164442. Epub 2014 Jul 9.
6
Recovering power in association mapping panels with variable levels of linkage disequilibrium.在具有不同连锁不平衡水平的关联作图群体中恢复效能。
Genetics. 2014 May;197(1):375-87. doi: 10.1534/genetics.113.159731. Epub 2014 Feb 14.
7
Genome-wide prediction of traits with different genetic architecture through efficient variable selection.通过有效的变量选择对具有不同遗传结构的性状进行全基因组预测。
Genetics. 2013 Oct;195(2):573-87. doi: 10.1534/genetics.113.150078. Epub 2013 Aug 9.
8
Priors in whole-genome regression: the bayesian alphabet returns.全基因组回归中的先验信息:贝叶斯字母表回归。
Genetics. 2013 Jul;194(3):573-96. doi: 10.1534/genetics.113.151753. Epub 2013 May 1.
9
Contribution of an additive locus to genetic variance when inheritance is multi-factorial with implications on interpretation of GWAS.当遗传是多因子的时,加性位点对遗传方差的贡献及其对 GWAS 解释的影响。
Theor Appl Genet. 2013 Jun;126(6):1457-72. doi: 10.1007/s00122-013-2064-2. Epub 2013 Mar 19.
10
Statistical distributions of test statistics used for quantitative trait association mapping in structured populations.结构群体中用于数量性状关联作图的检验统计量的统计分布。
Genet Sel Evol. 2012 Nov 12;44(1):32. doi: 10.1186/1297-9686-44-32.