• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

无序标记的推断及其对基因组选择准确性的影响。

Imputation of unordered markers and the impact on genomic selection accuracy.

机构信息

Department of Plant Breeding and Genetics, Cornell University, Ithaca New York 14853, USA.

出版信息

G3 (Bethesda). 2013 Mar;3(3):427-39. doi: 10.1534/g3.112.005363. Epub 2013 Mar 1.

DOI:10.1534/g3.112.005363
PMID:23449944
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3583451/
Abstract

Genomic selection, a breeding method that promises to accelerate rates of genetic gain, requires dense, genome-wide marker data. Genotyping-by-sequencing can generate a large number of de novo markers. However, without a reference genome, these markers are unordered and typically have a large proportion of missing data. Because marker imputation algorithms were developed for species with a reference genome, algorithms suited for unordered markers have not been rigorously evaluated. Using four empirical datasets, we evaluate and characterize four such imputation methods, referred to as k-nearest neighbors, singular value decomposition, random forest regression, and expectation maximization imputation, in terms of their imputation accuracies and the factors affecting accuracy. The effect of imputation method on the genomic selection accuracy is assessed in comparison with mean imputation. The effect of excluding markers with a large proportion of missing data on the genomic selection accuracy is also examined. Our results show that imputation of unordered markers can be accurate, especially when linkage disequilibrium between markers is high and genotyped individuals are related. Of the methods evaluated, random forest regression imputation produced superior accuracy. In comparison with mean imputation, all four imputation methods we evaluated led to greater genomic selection accuracies when the level of missing data was high. Including rather than excluding markers with a large proportion of missing data nearly always led to greater GS accuracies. We conclude that high levels of missing data in dense marker sets is not a major obstacle for genomic selection, even when marker order is not known.

摘要

基因组选择是一种有望加速遗传增益速度的育种方法,它需要密集的、全基因组标记数据。测序基因分型可以产生大量的从头标记。然而,没有参考基因组,这些标记是无序的,通常有很大比例的缺失数据。由于标记估计算法是为有参考基因组的物种开发的,因此尚未对适合无序标记的算法进行严格评估。我们使用四个经验数据集,从估计准确性和影响准确性的因素两个方面,评估并描述了四种这样的估计方法,分别称为 k-最近邻、奇异值分解、随机森林回归和期望最大化估计,这些方法是针对无序标记的。与均值估计相比,评估了估计方法对基因组选择准确性的影响。还检查了排除具有大量缺失数据的标记对基因组选择准确性的影响。我们的结果表明,无序标记的估计可以是准确的,尤其是当标记之间的连锁不平衡较高且基因型个体相关时。在所评估的方法中,随机森林回归估计产生了较高的准确性。与均值估计相比,当缺失数据水平较高时,我们评估的所有四种估计方法都导致了更高的基因组选择准确性。我们得出结论,即使不知道标记顺序,在密集标记集中存在高水平的缺失数据也不是基因组选择的主要障碍。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ae0/3583451/90fac197b553/427f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ae0/3583451/825d5de15180/427f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ae0/3583451/c467d6334d35/427f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ae0/3583451/19468f0ad508/427f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ae0/3583451/e1405d69c3da/427f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ae0/3583451/195b7f6e537b/427f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ae0/3583451/90fac197b553/427f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ae0/3583451/825d5de15180/427f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ae0/3583451/c467d6334d35/427f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ae0/3583451/19468f0ad508/427f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ae0/3583451/e1405d69c3da/427f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ae0/3583451/195b7f6e537b/427f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ae0/3583451/90fac197b553/427f6.jpg

相似文献

1
Imputation of unordered markers and the impact on genomic selection accuracy.无序标记的推断及其对基因组选择准确性的影响。
G3 (Bethesda). 2013 Mar;3(3):427-39. doi: 10.1534/g3.112.005363. Epub 2013 Mar 1.
2
Optimizing Low-Cost Genotyping and Imputation Strategies for Genomic Selection in Atlantic Salmon.优化大西洋鲑基因组选择中的低成本基因型分型和基因分型策略。
G3 (Bethesda). 2020 Feb 6;10(2):581-590. doi: 10.1534/g3.119.400800.
3
LinkImpute: Fast and Accurate Genotype Imputation for Nonmodel Organisms.LinkImpute:非模式生物的快速准确基因型填充
G3 (Bethesda). 2015 Sep 15;5(11):2383-90. doi: 10.1534/g3.115.021667.
4
Genomic Prediction for Grain Yield and Yield-Related Traits in Chinese Winter Wheat.中国冬小麦产量及产量相关性状的基因组预测。
Int J Mol Sci. 2020 Feb 17;21(4):1342. doi: 10.3390/ijms21041342.
5
Whole-genome characterization in pedigreed non-human primates using genotyping-by-sequencing (GBS) and imputation.利用简化基因组测序(GBS)和填充技术对圈养非人灵长类动物进行全基因组特征分析。
BMC Genomics. 2016 Aug 24;17(1):676. doi: 10.1186/s12864-016-2966-x.
6
Genotyping by sequencing for genomic prediction in a soybean breeding population.大豆育种群体中用于基因组预测的测序基因分型
BMC Genomics. 2014 Aug 29;15(1):740. doi: 10.1186/1471-2164-15-740.
7
Resource allocation for maximizing prediction accuracy and genetic gain of genomic selection in plant breeding: a simulation experiment.最大化植物育种中基因组选择预测准确性和遗传增益的资源分配:模拟实验。
G3 (Bethesda). 2013 Mar;3(3):481-91. doi: 10.1534/g3.112.004911. Epub 2013 Mar 1.
8
Genetic diversity analysis of highly incomplete SNP genotype data with imputations: an empirical assessment.基于插补的高度不完整SNP基因型数据的遗传多样性分析:实证评估
G3 (Bethesda). 2014 Mar 13;4(5):891-900. doi: 10.1534/g3.114.010942.
9
Saturated linkage map construction in Rubus idaeus using genotyping by sequencing and genome-independent imputation.利用测序基因型和与基因组无关的推断构建树莓饱和连锁图谱。
BMC Genomics. 2013 Jan 16;14:2. doi: 10.1186/1471-2164-14-2.
10
Enlarging a training set for genomic selection by imputation of un-genotyped animals in populations of varying genetic architecture.通过在遗传结构不同的群体中对未基因型动物进行推断来扩大基因组选择的训练集。
Genet Sel Evol. 2013 Apr 26;45(1):12. doi: 10.1186/1297-9686-45-12.

引用本文的文献

1
Genome-wide association mapping for early maturity in kintoki bean ( L.).近畿锦豆(L.)早熟性的全基因组关联图谱分析
Breed Sci. 2025 Apr;75(2):119-128. doi: 10.1270/jsbbs.24054. Epub 2025 Mar 27.
2
Enhancing prediction accuracy of key biomass partitioning traits in wheat using multi-kernel genomic prediction models integrating secondary traits and environmental covariates.利用整合次要性状和环境协变量的多核基因组预测模型提高小麦关键生物量分配性状的预测准确性。
Plant Genome. 2025 Jun;18(2):e70052. doi: 10.1002/tpg2.70052.
3
Evaluation of Low-Coverage Sequencing Strategies for Whole-Genome Imputation in Pacific Abalone .

本文引用的文献

1
Effectiveness of genomic prediction of maize hybrid performance in different breeding populations and environments.基因组预测在不同育种群体和环境下对玉米杂交种表现的有效性。
G3 (Bethesda). 2012 Nov;2(11):1427-36. doi: 10.1534/g3.112.003699. Epub 2012 Nov 1.
2
Imputation of genotypes with low-density chips and its effect on reliability of direct genomic values in Dutch Holstein cattle.使用低密度芯片进行基因型推断及其对荷兰荷斯坦奶牛直接基因组值可靠性的影响。
J Dairy Sci. 2012 Feb;95(2):876-89. doi: 10.3168/jds.2011-4490.
3
Accuracy of genomic selection in European maize elite breeding populations.
太平洋鲍鱼全基因组插补的低覆盖度测序策略评估
Int J Mol Sci. 2025 May 11;26(10):4598. doi: 10.3390/ijms26104598.
4
Genomics-assisted breeding for designing salinity-smart future crops.通过基因组学辅助育种设计适应盐渍环境的未来作物。
Plant Biotechnol J. 2025 Aug;23(8):3119-3151. doi: 10.1111/pbi.70104. Epub 2025 May 20.
5
Comparison of ddRADseq and EUChip60K SNP genotyping systems for population genetics and genomic selection in (Maiden).用于(梅登)群体遗传学和基因组选择的ddRADseq和EUChip60K单核苷酸多态性基因分型系统的比较
Front Genet. 2024 Mar 26;15:1361418. doi: 10.3389/fgene.2024.1361418. eCollection 2024.
6
Genomic selection for growth characteristics in Korean red pine ( Seibold & Zucc.).韩国赤松(Siebold & Zucc.)生长特性的基因组选择
Front Plant Sci. 2024 Jan 23;15:1285094. doi: 10.3389/fpls.2024.1285094. eCollection 2024.
7
An international wheat diversity panel reveals novel sources of genetic resistance to tan spot in Australia.国际小麦多样性小组揭示了澳大利亚小麦叶枯病的新型遗传抗性来源。
Theor Appl Genet. 2023 Mar 13;136(3):61. doi: 10.1007/s00122-023-04332-y.
8
Unraveling Genomic Regions Controlling Root Traits as a Function of Nitrogen Availability in the MAGIC Wheat Population WM-800.解析MAGIC小麦群体WM - 800中作为氮素有效性函数的控制根系性状的基因组区域。
Plants (Basel). 2022 Dec 14;11(24):3520. doi: 10.3390/plants11243520.
9
A Haplotype-Based GWAS Identified Trait-Improving QTL Alleles Controlling Agronomic Traits under Contrasting Nitrogen Fertilization Treatments in the MAGIC Wheat Population WM-800.基于单倍型的全基因组关联研究在多亲本高级世代互交小麦群体WM-800中鉴定出在不同氮肥处理下控制农艺性状的性状改良数量性状位点等位基因。
Plants (Basel). 2022 Dec 14;11(24):3508. doi: 10.3390/plants11243508.
10
Integrated Approach in Genomic Selection to Accelerate Genetic Gain in Sugarcane.基因组选择中的综合方法以加速甘蔗的遗传增益
Plants (Basel). 2022 Aug 17;11(16):2139. doi: 10.3390/plants11162139.
基因组选择在欧洲玉米精英育种群体中的准确性。
Theor Appl Genet. 2012 Mar;124(4):769-76. doi: 10.1007/s00122-011-1745-y. Epub 2011 Nov 11.
4
MissForest--non-parametric missing value imputation for mixed-type data.MissForest--用于混合类型数据的非参数缺失值插补。
Bioinformatics. 2012 Jan 1;28(1):112-8. doi: 10.1093/bioinformatics/btr597. Epub 2011 Oct 28.
5
Performance of genotype imputation for rare variants identified in exons and flanking regions of genes.外显子和基因侧翼区域中鉴定出的稀有变异的基因型推断性能。
PLoS One. 2011;6(9):e24945. doi: 10.1371/journal.pone.0024945. Epub 2011 Sep 19.
6
Effect of imputing markers from a low-density chip on the reliability of genomic breeding values in Holstein populations.低密度芯片标记信息导入对荷斯坦牛群体基因组育种值可靠性的影响。
J Dairy Sci. 2011 Jul;94(7):3679-86. doi: 10.3168/jds.2011-4299.
7
A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species.一种用于高多样性物种的稳健、简单的测序分型(GBS)方法。
PLoS One. 2011 May 4;6(5):e19379. doi: 10.1371/journal.pone.0019379.
8
MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes.MaCH:利用序列和基因型数据来估计单倍型和未观测基因型。
Genet Epidemiol. 2010 Dec;34(8):816-34. doi: 10.1002/gepi.20533.
9
Marker imputation with low-density marker panels in Dutch Holstein cattle.荷兰荷斯坦奶牛的低密度标记面板标记导入。
J Dairy Sci. 2010 Nov;93(11):5487-94. doi: 10.3168/jds.2010-3501.
10
Imputation of genotypes from different single nucleotide polymorphism panels in dairy cattle.从奶牛不同单核苷酸多态性面板中推断基因型。
J Dairy Sci. 2010 Nov;93(11):5443-54. doi: 10.3168/jds.2010-3255.