• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从黑杨育种群体中的低密度单核苷酸多态性面板进行序列推断。

Sequence imputation from low density single nucleotide polymorphism panel in a black poplar breeding population.

机构信息

BioForA, INRA, ONF, 45075, Orléans, France, 2163 Avenue de la Pomme de Pin CS 40001 ARDON, Orléans Cedex 2, 45075, France.

Etude du Polymorphisme des Génomes Végétaux (EPGV), INRA, Université Paris-Saclay, 91000, 2 rue Gaston Crémieux, Evry, 9100, France.

出版信息

BMC Genomics. 2019 Apr 18;20(1):302. doi: 10.1186/s12864-019-5660-y.

DOI:10.1186/s12864-019-5660-y
PMID:30999856
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6471894/
Abstract

BACKGROUND

Genomic selection accuracy increases with the use of high SNP (single nucleotide polymorphism) coverage. However, such gains in coverage come at high costs, preventing their prompt operational implementation by breeders. Low density panels imputed to higher densities offer a cheaper alternative during the first stages of genomic resources development. Our study is the first to explore the imputation in a tree species: black poplar. About 1000 pure-breed Populus nigra trees from a breeding population were selected and genotyped with a 12K custom Infinium Bead-Chip. Forty-three of those individuals corresponding to nodal trees in the pedigree were fully sequenced (reference), while the remaining majority (target) was imputed from 8K to 1.4 million SNPs using FImpute. Each SNP and individual was evaluated for imputation errors by leave-one-out cross validation in the training sample of 43 sequenced trees. Some summary statistics such as Hardy-Weinberg Equilibrium exact test p-value, quality of sequencing, depth of sequencing per site and per individual, minor allele frequency, marker density ratio or SNP information redundancy were calculated. Principal component and Boruta analyses were used on all these parameters to rank the factors affecting the quality of imputation. Additionally, we characterize the impact of the relatedness between reference population and target population.

RESULTS

During the imputation process, we used 7540 SNPs from the chip to impute 1,438,827 SNPs from sequences. At the individual level, imputation accuracy was high with a proportion of SNPs correctly imputed between 0.84 and 0.99. The variation in accuracies was mostly due to differences in relatedness between individuals. At a SNP level, the imputation quality depended on genotyped SNP density and on the original minor allele frequency. The imputation did not appear to result in an increase of linkage disequilibrium. The genotype densification not only brought a better distribution of markers all along the genome, but also we did not detect any substantial bias in annotation categories.

CONCLUSIONS

This study shows that it is possible to impute low-density marker panels to whole genome sequence with good accuracy under certain conditions that could be common to many breeding populations.

摘要

背景

基因组选择的准确性随着 SNP(单核苷酸多态性)覆盖率的提高而提高。然而,这种覆盖范围的增加需要付出高昂的成本,这使得育种者无法迅速实施。在基因组资源开发的早期阶段,低密度面板被推断为更高密度的面板提供了一种更便宜的选择。我们的研究首次探索了树木物种中的推断:黑杨。从一个育种群体中选择了大约 1000 棵纯黑杨,并使用 12K 定制的 Infinium Bead-Chip 进行了基因型分析。其中 43 棵对应于系谱中的节点树的个体被完全测序(参考),而其余的大多数(目标)则通过 FImpute 从 8K 推断到 140 万 SNPs。每个 SNP 和个体都在 43 棵测序树的训练样本中通过留一法交叉验证来评估推断错误。通过 Hardy-Weinberg 平衡精确检验 p 值、测序质量、每个位点和个体的测序深度、次要等位基因频率、标记密度比或 SNP 信息冗余等一些汇总统计数据来计算。对所有这些参数进行主成分和 Boruta 分析,以对影响推断质量的因素进行排名。此外,我们还描述了参考群体和目标群体之间的亲缘关系对推断的影响。

结果

在推断过程中,我们使用芯片上的 7540 个 SNP 来推断来自序列的 1438827 个 SNP。在个体水平上,SNP 的正确推断比例在 0.84 到 0.99 之间,推断精度较高。准确性的变化主要是由于个体之间的亲缘关系不同。在 SNP 水平上,推断质量取决于基因分型 SNP 的密度和原始的次要等位基因频率。推断似乎没有导致连锁不平衡的增加。基因型密集化不仅使标记在整个基因组上的分布更好,而且我们没有检测到注释类别有任何实质性的偏差。

结论

本研究表明,在某些条件下,有可能以较高的准确性将低密度标记面板推断为全基因组序列,这些条件可能在许多育种群体中都很常见。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5354/6471894/2c86c403712b/12864_2019_5660_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5354/6471894/0a49af1b85d3/12864_2019_5660_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5354/6471894/c9c870155107/12864_2019_5660_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5354/6471894/5fdaec3b1c33/12864_2019_5660_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5354/6471894/1d7c4d034433/12864_2019_5660_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5354/6471894/d57aac3d3e12/12864_2019_5660_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5354/6471894/2c86c403712b/12864_2019_5660_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5354/6471894/0a49af1b85d3/12864_2019_5660_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5354/6471894/c9c870155107/12864_2019_5660_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5354/6471894/5fdaec3b1c33/12864_2019_5660_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5354/6471894/1d7c4d034433/12864_2019_5660_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5354/6471894/d57aac3d3e12/12864_2019_5660_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5354/6471894/2c86c403712b/12864_2019_5660_Fig6_HTML.jpg

相似文献

1
Sequence imputation from low density single nucleotide polymorphism panel in a black poplar breeding population.从黑杨育种群体中的低密度单核苷酸多态性面板进行序列推断。
BMC Genomics. 2019 Apr 18;20(1):302. doi: 10.1186/s12864-019-5660-y.
2
Design of a low-density SNP chip for the main Australian sheep breeds and its effect on imputation and genomic prediction accuracy.用于澳大利亚主要绵羊品种的低密度单核苷酸多态性(SNP)芯片设计及其对填充和基因组预测准确性的影响。
Anim Genet. 2015 Oct;46(5):544-56. doi: 10.1111/age.12340. Epub 2015 Sep 11.
3
High imputation accuracy from informative low-to-medium density single nucleotide polymorphism genotypes is achievable in sheep1.在绵羊中,信息量较低且中等密度的单核苷酸多态性基因型也能实现高的插补准确性。1
J Anim Sci. 2019 Apr 3;97(4):1550-1567. doi: 10.1093/jas/skz043.
4
Accuracy of genotype imputation in sheep breeds.绵羊品种基因型推断的准确性。
Anim Genet. 2012 Feb;43(1):72-80. doi: 10.1111/j.1365-2052.2011.02208.x. Epub 2011 May 27.
5
The feasibility of using low-density marker panels for genotype imputation and genomic prediction of crossbred dairy cattle of East Africa.利用低密度标记面板对东非杂交奶牛进行基因型推断和基因组预测的可行性。
J Dairy Sci. 2018 Oct;101(10):9108-9127. doi: 10.3168/jds.2018-14621. Epub 2018 Aug 1.
6
Genotype imputation from various low-density SNP panels and its impact on accuracy of genomic breeding values in pigs.从各种低密度 SNP 面板进行基因型推断及其对猪基因组育种值准确性的影响。
Animal. 2018 Nov;12(11):2235-2245. doi: 10.1017/S175173111800085X. Epub 2018 Apr 30.
7
Multi-generational imputation of single nucleotide polymorphism marker genotypes and accuracy of genomic selection.多世代单核苷酸多态性标记基因型的推断及其基因组选择的准确性。
Animal. 2016 Jul;10(7):1077-85. doi: 10.1017/S1751731115002906. Epub 2016 Jan 6.
8
Accuracy of genotype imputation in Nelore cattle.内洛尔牛基因型填充的准确性。
Genet Sel Evol. 2014 Oct 10;46(1):69. doi: 10.1186/s12711-014-0069-1.
9
Extent of linkage disequilibrium, consistency of gametic phase, and imputation accuracy within and across Canadian dairy breeds.加拿大奶牛品种内和品种间的连锁不平衡程度、配子相位一致性及填充准确性。
J Dairy Sci. 2014 May;97(5):3128-41. doi: 10.3168/jds.2013-6826. Epub 2014 Feb 26.
10
Strategies for single nucleotide polymorphism (SNP) genotyping to enhance genotype imputation in Gyr (Bos indicus) dairy cattle: Comparison of commercially available SNP chips.提高吉尔(印度瘤牛)奶牛单核苷酸多态性(SNP)基因分型以增强基因型填充的策略:市售SNP芯片的比较
J Dairy Sci. 2015 Jul;98(7):4969-89. doi: 10.3168/jds.2014-9213. Epub 2015 May 7.

引用本文的文献

1
Genome-wide genotyping data renew knowledge on genetic diversity of a worldwide alfalfa collection and give insights on genetic control of phenology traits.全基因组基因分型数据更新了对全球苜蓿种质资源遗传多样性的认识,并为物候性状的遗传控制提供了见解。
Front Plant Sci. 2023 Jul 5;14:1196134. doi: 10.3389/fpls.2023.1196134. eCollection 2023.
2
How to achieve a higher selection plateau in forest tree breeding? Fostering heterozygote × homozygote relationships in optimal contribution selection in the case study of .如何在林木育种中实现更高的选择平台期?在……的案例研究中,在最优贡献选择中促进杂合子×纯合子关系。
Evol Appl. 2021 Sep 21;14(11):2635-2646. doi: 10.1111/eva.13300. eCollection 2021 Nov.
3

本文引用的文献

1
Genomic evaluation of feed efficiency component traits in Duroc pigs using 80K, 650K and whole-genome sequence variants.利用 80K、650K 和全基因组序列变异对杜洛克猪饲料效率组成性状进行基因组评估。
Genet Sel Evol. 2018 Apr 6;50(1):14. doi: 10.1186/s12711-018-0387-9.
2
Imputation from SNP chip to sequence: a case study in a Chinese indigenous chicken population.从单核苷酸多态性(SNP)芯片到序列的填充:以一个中国地方鸡群体为例的研究
J Anim Sci Biotechnol. 2018 Mar 21;9:30. doi: 10.1186/s40104-018-0241-5. eCollection 2018.
3
Comparing strategies for selection of low-density SNPs for imputation-mediated genomic prediction in U. S. Holsteins.
The apple REFPOP-a reference population for genomics-assisted breeding in apple.
苹果REFPOP——苹果基因组辅助育种的参考群体。
Hortic Res. 2020 Nov 1;7(1):189. doi: 10.1038/s41438-020-00408-8.
4
Favorable Conditions for Genomic Evaluation to Outperform Classical Pedigree Evaluation Highlighted by a Proof-of-Concept Study in Poplar.杨树概念验证研究凸显基因组评估优于传统系谱评估的有利条件
Front Plant Sci. 2020 Oct 28;11:581954. doi: 10.3389/fpls.2020.581954. eCollection 2020.
比较美国荷斯坦奶牛中用于插补介导的基因组预测的低密度单核苷酸多态性(SNP)选择策略。
Genetica. 2018 Apr;146(2):137-149. doi: 10.1007/s10709-017-0004-9. Epub 2017 Dec 14.
4
Short communication: Genomic prediction using imputed whole-genome sequence variants in Brown Swiss Cattle.短讯:利用导入的全基因组序列变异对瑞士褐牛进行基因组预测。
J Dairy Sci. 2018 Feb;101(2):1292-1296. doi: 10.3168/jds.2017-12890. Epub 2017 Nov 15.
5
Imputation of non-genotyped sheep from the genotypes of their mates and resulting progeny.从配偶和后代的基因型推断未基因型绵羊。
Animal. 2018 Feb;12(2):191-198. doi: 10.1017/S1751731117001653. Epub 2017 Jul 17.
6
Reevaluation of SNP heritability in complex human traits.复杂人类性状中SNP遗传力的重新评估。
Nat Genet. 2017 Jul;49(7):986-992. doi: 10.1038/ng.3865. Epub 2017 May 22.
7
The impact of multi-generational genotype imputation strategies on imputation accuracy and subsequent genomic predictions.多代基因型填充策略对填充准确性及后续基因组预测的影响。
J Anim Sci. 2017 Apr;95(4):1489-1501. doi: 10.2527/jas.2016.1212.
8
Genotype Imputation To Improve the Cost-Efficiency of Genomic Selection in Farmed Atlantic Salmon.基因型填充以提高养殖大西洋鲑基因组选择的成本效益
G3 (Bethesda). 2017 Apr 3;7(4):1377-1383. doi: 10.1534/g3.117.040717.
9
Multi-generational imputation of single nucleotide polymorphism marker genotypes and accuracy of genomic selection.多世代单核苷酸多态性标记基因型的推断及其基因组选择的准确性。
Animal. 2016 Jul;10(7):1077-85. doi: 10.1017/S1751731115002906. Epub 2016 Jan 6.
10
Assessing and comparison of different machine learning methods in parent-offspring trios for genotype imputation.评估和比较亲子三联体中不同机器学习方法用于基因型填充的情况。
J Theor Biol. 2016 Jun 21;399:148-58. doi: 10.1016/j.jtbi.2016.03.035. Epub 2016 Apr 2.