利用混合样本进行亲子基因型填充以实现植物育种中经济高效的基因分型

Parent-progeny imputation from pooled samples for cost-efficient genotyping in plant breeding.

作者信息

Technow Frank, Gerke Justin

机构信息

Maize Product Development/Systems and Innovation for Breeding and Seed Products, DuPont Pioneer, Tavistock, Ontario, Canada.

Systems and Innovation for Breeding and Seed Products, DuPont Pioneer, Johnston, Iowa, United States of America.

出版信息

PLoS One. 2017 Dec 22;12(12):e0190271. doi: 10.1371/journal.pone.0190271. eCollection 2017.

DOI:10.1371/journal.pone.0190271

PMID:29272307

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5741258/

Abstract

The increased usage of whole-genome selection (WGS) and other molecular evaluation methods in plant breeding relies on the ability to genotype a very large number of untested individuals in each breeding cycle. Many plant breeding programs evaluate large biparental populations of homozygous individuals derived from homozygous parent inbred lines. This structure lends itself to parent-progeny imputation, which transfers the genotype scores of the parents to progeny individuals that are genotyped for a much smaller number of loci. Here we introduce a parent-progeny imputation method that infers individual genotypes from non-barcoded pooled samples of DNA of multiple individuals using a Hidden Markov Model (HMM). We demonstrate the method for pools of simulated maize double haploids (DH) from biparental populations, genotyped using a genotyping by sequencing (GBS) approach for 3,000 loci at 0.125x to 4x coverage. We observed high concordance between true and imputed marker scores and the HMM produced well-calibrated genotype probabilities that correctly reflected the uncertainty of the imputed scores. Genomic estimated breeding values (GEBV) calculated from the imputed scores closely matched GEBV calculated from the true marker scores. The within-population correlation between these sets of GEBV approached 0.95 at 1x and 4x coverage when pooling two or four individuals, respectively. Our approach can reduce the genotyping cost per individual by a factor up to the number of pooled individuals in GBS applications without the need for extra sequencing coverage, thereby enabling cost-effective large scale genotyping for applications such as WGS in plant breeding.

摘要

全基因组选择（WGS）和其他分子评估方法在植物育种中的使用增加，这依赖于在每个育种周期中对大量未经测试的个体进行基因分型的能力。许多植物育种计划评估来自纯合亲本自交系的纯合个体的大型双亲群体。这种结构适合亲子代基因型填充，即将亲本的基因型分数转移到仅对少数位点进行基因分型的子代个体上。在这里，我们介绍一种亲子代基因型填充方法，该方法使用隐马尔可夫模型（HMM）从多个个体的非条形码DNA混合样本中推断个体基因型。我们展示了该方法在来自双亲群体的模拟玉米双单倍体（DH）样本池中的应用，这些样本池使用测序基因分型（GBS）方法在0.125x至4x覆盖度下对3000个位点进行基因分型。我们观察到真实标记分数与填充标记分数之间具有高度一致性，并且HMM产生了校准良好的基因型概率，正确反映了填充分数的不确定性。根据填充分数计算的基因组估计育种值（GEBV）与根据真实标记分数计算的GEBV紧密匹配。当分别合并两个或四个个体时，在1x和4x覆盖度下，这两组GEBV之间的群体内相关性分别接近0.95。我们的方法可以将每个个体的基因分型成本降低多达GBS应用中合并个体数量的倍数，而无需额外的测序覆盖度，从而能够在植物育种中的WGS等应用中进行具有成本效益的大规模基因分型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/122b/5741258/2111a21cd2d5/pone.0190271.g001.jpg

相似文献

Parent-progeny imputation from pooled samples for cost-efficient genotyping in plant breeding.

PLoS One. 2017 Dec 22;12(12):e0190271. doi: 10.1371/journal.pone.0190271. eCollection 2017.

Practical implementation of cost-effective genomic selection in commercial pig breeding using imputation.

J Anim Sci. 2013 Aug;91(8):3583-92. doi: 10.2527/jas.2013-6270. Epub 2013 Jun 4.

Imputation of non-genotyped F1 dams to improve genetic gain in swine crossbreeding programs.

J Anim Sci. 2022 May 1;100(5). doi: 10.1093/jas/skac148.

Low-depth genotyping-by-sequencing (GBS) in a bovine population: strategies to maximize the selection of high quality genotypes and the accuracy of imputation.

BMC Genet. 2017 Apr 5;18(1):32. doi: 10.1186/s12863-017-0501-y.

Imputation of non-genotyped sheep from the genotypes of their mates and resulting progeny.

Animal. 2018 Feb;12(2):191-198. doi: 10.1017/S1751731117001653. Epub 2017 Jul 17.

Applications of genotyping-by-sequencing (GBS) in maize genetics and breeding.

Sci Rep. 2020 Oct 1;10(1):16308. doi: 10.1038/s41598-020-73321-8.

Whole-genome characterization in pedigreed non-human primates using genotyping-by-sequencing (GBS) and imputation.

BMC Genomics. 2016 Aug 24;17(1):676. doi: 10.1186/s12864-016-2966-x.

A joint use of pooling and imputation for genotyping SNPs.

BMC Bioinformatics. 2022 Oct 13;23(1):421. doi: 10.1186/s12859-022-04974-7.

Genotype Imputation To Improve the Cost-Efficiency of Genomic Selection in Farmed Atlantic Salmon.

G3 (Bethesda). 2017 Apr 3;7(4):1377-1383. doi: 10.1534/g3.117.040717.

Genotype Imputation to Improve the Cost-Efficiency of Genomic Selection in Rabbits.

Animals (Basel). 2021 Mar 13;11(3):803. doi: 10.3390/ani11030803.

引用本文的文献

A Virome Scanning of Saffron ( L.) at the National Scale in Iran Using High-Throughput Sequencing Technologies.

Viruses. 2025 Aug 4;17(8):1079. doi: 10.3390/v17081079.

Genotyping of SNPs in bread wheat at reduced cost from pooled experiments and imputation.

Theor Appl Genet. 2024 Jan 19;137(1):26. doi: 10.1007/s00122-023-04533-5.

A joint use of pooling and imputation for genotyping SNPs.

BMC Bioinformatics. 2022 Oct 13;23(1):421. doi: 10.1186/s12859-022-04974-7.

Integrated Approach in Genomic Selection to Accelerate Genetic Gain in Sugarcane.

Plants (Basel). 2022 Aug 17;11(16):2139. doi: 10.3390/plants11162139.

High-throughput characterization, correlation, and mapping of leaf photosynthetic and functional traits in the soybean (Glycine max) nested association mapping population.

Genetics. 2022 May 31;221(2). doi: 10.1093/genetics/iyac065.

Characterization and Mapping of , and Broad-Spectrum Resistances to Turnip Mosaic Virus in , and the Development of Robust Methods for Utilizing Recalcitrant Genotyping Data.

Front Plant Sci. 2022 Jan 12;12:787354. doi: 10.3389/fpls.2021.787354. eCollection 2021.

Can we harness digital technologies and physiology to hasten genetic gain in US maize breeding?

Plant Physiol. 2022 Feb 4;188(2):1141-1157. doi: 10.1093/plphys/kiab527.

Use of F2 Bulks in Training Sets for Genomic Prediction of Combining Ability and Hybrid Performance.

G3 (Bethesda). 2019 May 7;9(5):1557-1569. doi: 10.1534/g3.118.200994.

本文引用的文献

Marker Imputation Before Genomewide Selection in Biparental Maize Populations.

Plant Genome. 2015 Jul;8(2):eplantgenome2014.10.0078. doi: 10.3835/plantgenome2014.10.0078.

Accuracy of Genomic Prediction in Synthetic Populations Depending on the Number of Parents, Relatedness, and Ancestral Linkage Disequilibrium.

Genetics. 2017 Jan;205(1):441-454. doi: 10.1534/genetics.116.193243. Epub 2016 Nov 9.

Genomic selection in wheat: optimum allocation of test resources and comparison of breeding strategies for line and hybrid breeding.

Theor Appl Genet. 2015 Jul;128(7):1297-306. doi: 10.1007/s00122-015-2505-1. Epub 2015 Apr 16.

Shrinkage estimation of the genomic relationship matrix can improve genomic estimated breeding values in the training set.

Theor Appl Genet. 2015 Apr;128(4):693-703. doi: 10.1007/s00122-015-2464-6. Epub 2015 Mar 4.

Identification of key ancestors of modern germplasm in a breeding program of maize.

Theor Appl Genet. 2014 Dec;127(12):2545-53. doi: 10.1007/s00122-014-2396-6. Epub 2014 Sep 11.

Genome properties and prospects of genomic prediction of hybrid performance in a breeding program of maize.

Genetics. 2014 Aug;197(4):1343-55. doi: 10.1534/genetics.114.165860. Epub 2014 May 21.

Population-genetic inference from pooled-sequencing data.

Genome Biol Evol. 2014 Apr 30;6(5):1210-8. doi: 10.1093/gbe/evu085.

A general approach for haplotype phasing across the full spectrum of relatedness.

PLoS Genet. 2014 Apr 17;10(4):e1004234. doi: 10.1371/journal.pgen.1004234. eCollection 2014 Apr.

Validation of SNP allele frequencies determined by pooled next-generation sequencing in natural populations of a non-model plant species.

PLoS One. 2013 Nov 7;8(11):e80422. doi: 10.1371/journal.pone.0080422. eCollection 2013.

Field high-throughput phenotyping: the new crop breeding frontier.

Trends Plant Sci. 2014 Jan;19(1):52-61. doi: 10.1016/j.tplants.2013.09.008. Epub 2013 Oct 16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用混合样本进行亲子基因型填充以实现植物育种中经济高效的基因分型

Parent-progeny imputation from pooled samples for cost-efficient genotyping in plant breeding.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献