Suppr超能文献

一种整合表型和表达数据的多基因座关联分析方法揭示了多个与野生拟南芥开花时间变异相关的新关联。

A multilocus association analysis method integrating phenotype and expression data reveals multiple novel associations to flowering time variation in wild-collected Arabidopsis thaliana.

机构信息

Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden.

出版信息

Mol Ecol Resour. 2018 Jul;18(4):798-808. doi: 10.1111/1755-0998.12757. Epub 2018 Feb 19.

Abstract

The adaptation to a new habitat often results in a confounding between genomewide genotype and beneficial alleles. When the confounding is strong, or the allelic effects is weak, it is a major statistical challenge to detect the adaptive polymorphisms. We describe a novel approach to dissect polygenic traits in natural populations. First, candidate adaptive loci are identified by screening for loci directly associated with the adaptive trait or the expression of genes known to affect it. Then, a multilocus genetic architecture is inferred using a backward elimination association analysis across all candidate loci with an adaptive false discovery rate-based threshold. Effects of population stratification are controlled by accounting for genomic kinship in both steps of the analysis and also by simultaneously testing all candidate loci in the multilocus model. We illustrate the method by exploring the polygenic basis of an important adaptive trait, flowering time in Arabidopsis thaliana, using public data from the 1,001 genomes project. We revealed associations between 33 (29) loci and flowering time at 10 (16)°C in this collection of natural accessions, where standard genomewide association analysis methods detected five (3) loci. The 33 (29) loci explained approximately 55.1 (48.7)% of the total phenotypic variance of the respective traits. Our work illustrates how the genetic basis of highly polygenic adaptive traits in natural populations can be explored in much greater detail using new multilocus mapping approaches taking advantage of prior biological information, genome and transcriptome data.

摘要

适应新栖息地通常会导致全基因组基因型与有利等位基因之间发生混淆。当混淆很强,或者等位基因效应很弱时,检测适应性多态性是一个主要的统计挑战。我们描述了一种在自然种群中剖析多基因性状的新方法。首先,通过筛选与适应性性状或已知影响其表达的基因直接相关的基因座来识别候选适应性基因座。然后,使用基于自适应假发现率的关联分析在所有候选基因座上进行反向消除关联分析,以推断多基因遗传结构。通过在分析的两个步骤中考虑基因组亲缘关系,并在多基因模型中同时测试所有候选基因座,控制群体分层的影响。我们使用来自 1001 个基因组项目的公共数据来探索拟南芥开花时间这一重要适应性性状的多基因基础,来说明该方法。我们在该自然群体中发现了 33(29)个与 10(16)°C 开花时间相关的基因座(位点),而标准的全基因组关联分析方法仅检测到 5(3)个基因座(位点)。这 33(29)个基因座解释了各自性状总表型方差的约 55.1(48.7)%。我们的工作说明了如何利用新的多基因作图方法,利用先前的生物学信息、基因组和转录组数据,更详细地探索自然种群中高度多基因适应性性状的遗传基础。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验