Linder Robert A, Majumder Arundhati, Chakraborty Mahul, Long Anthony
Department of Ecology and Evolutionary Biology, School of Biological Sciences, University of California, Irvine, California 92697-2525
Department of Ecology and Evolutionary Biology, School of Biological Sciences, University of California, Irvine, California 92697-2525.
Genetics. 2020 Jun;215(2):323-342. doi: 10.1534/genetics.120.303202. Epub 2020 Apr 2.
Advanced-generation multiparent populations (MPPs) are a valuable tool for dissecting complex traits, having more power than genome-wide association studies to detect rare variants and higher resolution than F linkage mapping. To extend the advantages of MPPs in budding yeast, we describe the creation and characterization of two outbred MPPs derived from 18 genetically diverse founding strains. We carried out assemblies of the genomes of the 18 founder strains, such that virtually all variation segregating between these strains is known, and represented those assemblies as Santa Cruz Genome Browser tracks. We discovered complex patterns of structural variation segregating among the founders, including a large deletion within the vacuolar ATPase , several different deletions within the osmosensor , a series of deletions and insertions at and the adjacent , as well as copy number variation at the dehydrogenase Resequenced haploid recombinant clones from the two MPPs have a median unrecombined block size of 66 kb, demonstrating that the population is highly recombined. We pool-sequenced the two MPPs to 3270× and 2226× coverage and demonstrated that we can accurately estimate local haplotype frequencies using pooled data. We further downsampled the pool-sequenced data to ∼20-40× and showed that local haplotype frequency estimates remained accurate, with median error rates 0.8 and 0.6% at 20× and 40×, respectively. Haplotypes frequencies are estimated much more accurately than SNP frequencies obtained directly from the same data. Deep sequencing of the two populations revealed that 10 or more founders are present at a detectable frequency for > 98% of the genome, validating the utility of this resource for the exploration of the role of standing variation in the architecture of complex traits.
先进世代的多亲本群体(MPPs)是剖析复杂性状的宝贵工具,在检测罕见变异方面比全基因组关联研究更具效力,在分辨率上比F连锁图谱更高。为了扩展MPPs在芽殖酵母中的优势,我们描述了从18个遗传多样的奠基菌株衍生而来的两个远交MPPs的创建和特征。我们对18个奠基菌株的基因组进行了组装,使得这些菌株之间几乎所有分离的变异都是已知的,并将这些组装表示为圣克鲁斯基因组浏览器轨迹。我们发现了奠基菌株之间分离的复杂结构变异模式,包括液泡ATP酶内的一个大缺失、渗透感受器内的几个不同缺失、在[具体位置1]及其相邻位置的一系列缺失和插入,以及脱氢酶处的拷贝数变异。对来自两个MPPs的单倍体重组克隆进行重测序,未重组块大小的中位数为66 kb,表明该群体具有高度重组性。我们对两个MPPs进行了池测序,覆盖度分别为3270×和2226×,并证明我们可以使用池数据准确估计局部单倍型频率。我们进一步将池测序数据下采样至约20 - 40×,结果表明局部单倍型频率估计仍然准确,在20×和40×时的中位数错误率分别为0.8%和0.6%。单倍型频率的估计比直接从同一数据获得的SNP频率准确得多。对这两个群体的深度测序表明,在可检测频率下,> 98%的基因组中存在10个或更多的奠基菌株,验证了该资源在探索复杂性状结构中现存变异作用方面的实用性。