• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

获取和修剪用于基因组预测的插补全基因组序列数据的策略。

Strategies for Obtaining and Pruning Imputed Whole-Genome Sequence Data for Genomic Prediction.

作者信息

Ye Shaopan, Gao Ning, Zheng Rongrong, Chen Zitao, Teng Jinyan, Yuan Xiaolong, Zhang Hao, Chen Zanmou, Zhang Xiquan, Li Jiaqi, Zhang Zhe

机构信息

Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China.

State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China.

出版信息

Front Genet. 2019 Jul 17;10:673. doi: 10.3389/fgene.2019.00673. eCollection 2019.

DOI:10.3389/fgene.2019.00673
PMID:31379929
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6650575/
Abstract

Genomic prediction with imputed whole-genome sequencing (WGS) data is an attractive approach to improve predictive ability with low cost. However, high accuracy has not been realized using this method in livestock. In this study, we imputed 435 individuals from 600K single nucleotide polymorphism (SNP) chip data to WGS data using different reference panels. We also investigated the prediction accuracy of genomic best linear unbiased prediction (GBLUP) using imputed WGS data from different reference panels, linkage disequilibrium (LD)-based marker pruning, and pre-selected variants based on Genome-wide association society (GWAS) results. Results showed that the imputation accuracies from 600K to WGS data were 0.873 ± 0.038, 0.906 ± 0.036, and 0.979 ± 0.010 for the internal, external, and combined reference panels, respectively. In most traits of chickens, the prediction accuracy of imputed WGS data obtained from the internal reference panel was greater than or equal to that of the combined reference panel; the external reference panel had the lowest prediction accuracy. Compared with 600K chip data, GBLUP with imputed WGS data had only a small increase (1-3%) in prediction accuracy. Using only variants selected from imputed WGS data based on GWAS results resulted in almost no increase for most traits and even increased the bias of the regression coefficient. The impact of the degree of LD of selected and remaining variants on prediction accuracy was different. For average daily gain (ADG), residual feed intake (RFI), intestine length (IL), and body weight in 91 days (BW91), the accuracy of GBLUP increased as the degree of LD of selected variants decreased, but the opposite relationship occurred for the remaining variants. But for breast muscle weight (BMW) and average daily feed intake (ADFI), the accuracy of GBLUP increased as the degree of LD of selected variants increased, and the degree of LD of remaining variants had a small effect on prediction accuracy. Overall, the optimal imputation strategy to obtain WGS data for genomic prediction should consider the relationship between selected individuals and target population individuals to avoid heterogeneity of imputation. LD-based marker pruning can be used to improve the accuracy of genomic prediction using imputed WGS data.

摘要

利用推算的全基因组测序(WGS)数据进行基因组预测是一种低成本提高预测能力的有吸引力的方法。然而,在牲畜中使用这种方法尚未实现高精度。在本研究中,我们使用不同的参考面板将600K单核苷酸多态性(SNP)芯片数据中的435个个体推算为WGS数据。我们还研究了使用来自不同参考面板的推算WGS数据、基于连锁不平衡(LD)的标记筛选以及基于全基因组关联协会(GWAS)结果预先选择的变异进行基因组最佳线性无偏预测(GBLUP)的预测准确性。结果表明,对于内部、外部和组合参考面板,从600K到WGS数据的推算准确性分别为0.873±0.038、0.906±0.036和0.979±0.010。在鸡的大多数性状中,从内部参考面板获得的推算WGS数据的预测准确性大于或等于组合参考面板;外部参考面板的预测准确性最低。与600K芯片数据相比,使用推算WGS数据的GBLUP在预测准确性上仅略有提高(1 - 3%)。仅使用基于GWAS结果从推算WGS数据中选择的变异,对于大多数性状几乎没有提高,甚至增加了回归系数的偏差。所选和剩余变异的LD程度对预测准确性的影响不同。对于平均日增重(ADG)、剩余采食量(RFI)、肠长度(IL)和91日龄体重(BW91),GBLUP的准确性随着所选变异LD程度的降低而增加,但对于剩余变异则呈现相反的关系。但对于胸肌重(BMW)和平均日采食量(ADFI),GBLUP的准确性随着所选变异LD程度的增加而增加,剩余变异的LD程度对预测准确性影响较小。总体而言,为进行基因组预测获取WGS数据的最佳推算策略应考虑所选个体与目标群体个体之间的关系,以避免推算的异质性。基于LD的标记筛选可用于提高使用推算WGS数据进行基因组预测的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0664/6650575/e932ae27d45c/fgene-10-00673-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0664/6650575/2672bc277d16/fgene-10-00673-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0664/6650575/06fdc1495704/fgene-10-00673-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0664/6650575/5a414ffc3e65/fgene-10-00673-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0664/6650575/e932ae27d45c/fgene-10-00673-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0664/6650575/2672bc277d16/fgene-10-00673-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0664/6650575/06fdc1495704/fgene-10-00673-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0664/6650575/5a414ffc3e65/fgene-10-00673-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0664/6650575/e932ae27d45c/fgene-10-00673-g004.jpg

相似文献

1
Strategies for Obtaining and Pruning Imputed Whole-Genome Sequence Data for Genomic Prediction.获取和修剪用于基因组预测的插补全基因组序列数据的策略。
Front Genet. 2019 Jul 17;10:673. doi: 10.3389/fgene.2019.00673. eCollection 2019.
2
Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs.利用基于插补的全基因组测序数据提高猪的组合群体基因组预测的准确性。
Genet Sel Evol. 2019 Oct 21;51(1):58. doi: 10.1186/s12711-019-0500-8.
3
Genomic Prediction Based on SNP Functional Annotation Using Imputed Whole-Genome Sequence Data in Korean Hanwoo Cattle.基于插补全基因组序列数据的单核苷酸多态性功能注释在韩牛中的基因组预测
Front Genet. 2021 Jan 21;11:603822. doi: 10.3389/fgene.2020.603822. eCollection 2020.
4
Pre-selecting markers based on fixation index scores improved the power of genomic evaluations in a combined Yorkshire pig population.基于固定指数得分预先选择标记,提高了约克夏猪混合群体基因组评估的效能。
Animal. 2020;14(8):1555-1564. doi: 10.1017/S1751731120000506. Epub 2020 Mar 25.
5
Use of whole-genome sequence data and novel genomic selection strategies to improve selection for age at puberty in tropically-adapted beef heifers.利用全基因组序列数据和新型基因组选择策略提高热带适应性肉牛小母牛初情期的选择效果。
Genet Sel Evol. 2020 May 27;52(1):28. doi: 10.1186/s12711-020-00547-5.
6
Genomic prediction based on selected variants from imputed whole-genome sequence data in Australian sheep populations.基于澳大利亚绵羊群体中推断的全基因组序列数据中选择的变体的基因组预测。
Genet Sel Evol. 2019 Dec 5;51(1):72. doi: 10.1186/s12711-019-0514-2.
7
Comparison of genotype imputation strategies using a combined reference panel for chicken population.利用鸡群体的组合参考面板比较基因型推断策略。
Animal. 2019 Jun;13(6):1119-1126. doi: 10.1017/S1751731118002860. Epub 2018 Oct 29.
8
Genomic prediction based on preselected single-nucleotide polymorphisms from genome-wide association study and imputed whole-genome sequence data annotation for growth traits in Duroc pigs.基于全基因组关联研究中预先选择的单核苷酸多态性和杜洛克猪生长性状的全基因组序列数据注释进行基因组预测。
Evol Appl. 2024 Feb 15;17(2):e13651. doi: 10.1111/eva.13651. eCollection 2024 Feb.
9
Reliabilities of Genomic Prediction for Young Stock Survival Traits Using 54K SNP Chip Augmented With Additional Single-Nucleotide Polymorphisms Selected From Imputed Whole-Genome Sequencing Data.使用从推算全基因组测序数据中选择的额外单核苷酸多态性增强的54K SNP芯片对幼畜存活性状进行基因组预测的可靠性
Front Genet. 2021 Jul 19;12:667300. doi: 10.3389/fgene.2021.667300. eCollection 2021.
10
Increased accuracy of genomic predictions for growth under chronic thermal stress in rainbow trout by prioritizing variants from GWAS using imputed sequence data.通过使用推算序列数据对全基因组关联研究(GWAS)中的变异进行优先级排序,提高虹鳟在慢性热应激下生长的基因组预测准确性。
Evol Appl. 2021 May 18;15(4):537-552. doi: 10.1111/eva.13240. eCollection 2022 Apr.

引用本文的文献

1
Genetic parameters and genomic prediction of egg production traits in ducks.鸭产蛋性状的遗传参数与基因组预测
Poult Sci. 2025 Jul 3;104(10):105510. doi: 10.1016/j.psj.2025.105510.
2
Genetic structure and demographic history of house mice in western Europe inferred using whole-genome sequences.利用全基因组序列推断西欧家鼠的遗传结构和种群历史。
Proc Biol Sci. 2025 Apr;292(2045):20242709. doi: 10.1098/rspb.2024.2709. Epub 2025 Apr 16.
3
Improvement of the accuracy of breeding value prediction for egg production traits in Muscovy duck using low-coverage whole-genome sequence data.

本文引用的文献

1
SumHer better estimates the SNP heritability of complex traits from summary statistics.SumHer 可以更好地从汇总统计数据估计复杂性状的 SNP 遗传力。
Nat Genet. 2019 Feb;51(2):277-284. doi: 10.1038/s41588-018-0279-5. Epub 2018 Dec 3.
2
Comparison of genotype imputation strategies using a combined reference panel for chicken population.利用鸡群体的组合参考面板比较基因型推断策略。
Animal. 2019 Jun;13(6):1119-1126. doi: 10.1017/S1751731118002860. Epub 2018 Oct 29.
3
Genomic prediction for numerically small breeds, using models with pre-selected and differentially weighted markers.
利用低覆盖度全基因组序列数据提高番鸭产蛋性状育种值预测的准确性
Poult Sci. 2025 Feb;104(2):104812. doi: 10.1016/j.psj.2025.104812. Epub 2025 Jan 15.
4
Explainable artificial intelligence for genotype-to-phenotype prediction in plant breeding: a case study with a dataset from an almond germplasm collection.用于植物育种中基因型到表型预测的可解释人工智能:以杏仁种质资源库数据集为例的研究
Front Plant Sci. 2024 Sep 9;15:1434229. doi: 10.3389/fpls.2024.1434229. eCollection 2024.
5
GWAS Enhances Genomic Prediction Accuracy of Caviar Yield, Caviar Color and Body Weight Traits in Sturgeons Using Whole-Genome Sequencing Data.GWAS 通过全基因组测序数据提高了鲟鱼鱼子酱产量、鱼子酱颜色和体重性状的基因组预测准确性。
Int J Mol Sci. 2024 Sep 9;25(17):9756. doi: 10.3390/ijms25179756.
6
Landscape genomics reveals regions associated with adaptive phenotypic and genetic variation in Ethiopian indigenous chickens.景观基因组学揭示了与埃塞俄比亚本土鸡适应表型和遗传变异相关的区域。
BMC Genomics. 2024 Mar 18;25(1):284. doi: 10.1186/s12864-024-10193-6.
7
Genomic prediction based on preselected single-nucleotide polymorphisms from genome-wide association study and imputed whole-genome sequence data annotation for growth traits in Duroc pigs.基于全基因组关联研究中预先选择的单核苷酸多态性和杜洛克猪生长性状的全基因组序列数据注释进行基因组预测。
Evol Appl. 2024 Feb 15;17(2):e13651. doi: 10.1111/eva.13651. eCollection 2024 Feb.
8
Increased Accuracy of Genomic Prediction Using Preselected SNPs from GWAS with Imputed Whole-Genome Sequence Data in Pigs.在猪中使用来自全基因组关联研究(GWAS)的预选择单核苷酸多态性(SNP)和推算的全基因组序列数据提高基因组预测的准确性。
Animals (Basel). 2023 Dec 15;13(24):3871. doi: 10.3390/ani13243871.
9
Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets.用于基因组预测的单倍型模块:在多个作物数据集中的比较评估
Front Plant Sci. 2023 Sep 5;14:1217589. doi: 10.3389/fpls.2023.1217589. eCollection 2023.
10
Improving the accuracy of genomic prediction for meat quality traits using whole genome sequence data in pigs.利用猪的全基因组序列数据提高肉质性状基因组预测的准确性。
J Anim Sci Biotechnol. 2023 May 10;14(1):67. doi: 10.1186/s40104-023-00863-y.
基于预选择和差异化加权标记模型的数值较小品种的基因组预测。
Genet Sel Evol. 2018 Oct 10;50(1):49. doi: 10.1186/s12711-018-0419-5.
4
Genomic evaluation of feed efficiency component traits in Duroc pigs using 80K, 650K and whole-genome sequence variants.利用 80K、650K 和全基因组序列变异对杜洛克猪饲料效率组成性状进行基因组评估。
Genet Sel Evol. 2018 Apr 6;50(1):14. doi: 10.1186/s12711-018-0387-9.
5
Imputation from SNP chip to sequence: a case study in a Chinese indigenous chicken population.从单核苷酸多态性(SNP)芯片到序列的填充:以一个中国地方鸡群体为例的研究
J Anim Sci Biotechnol. 2018 Mar 21;9:30. doi: 10.1186/s40104-018-0241-5. eCollection 2018.
6
Use of meta-analyses and joint analyses to select variants in whole genome sequences for genomic evaluation: An application in milk production of French dairy cattle breeds.使用荟萃分析和联合分析选择全基因组序列中的变异进行基因组评估:在法国奶牛品种的牛奶生产中的应用。
J Dairy Sci. 2018 Apr;101(4):3126-3139. doi: 10.3168/jds.2017-13587. Epub 2018 Feb 7.
7
Short communication: Genomic prediction using imputed whole-genome sequence variants in Brown Swiss Cattle.短讯:利用导入的全基因组序列变异对瑞士褐牛进行基因组预测。
J Dairy Sci. 2018 Feb;101(2):1292-1296. doi: 10.3168/jds.2017-12890. Epub 2017 Nov 15.
8
Improved imputation accuracy of rare and low-frequency variants using population-specific high-coverage WGS-based imputation reference panel.使用基于全基因组测序(WGS)的特定人群高覆盖度插补参考面板提高罕见和低频变异的插补准确性。
Eur J Hum Genet. 2017 Jun;25(7):869-876. doi: 10.1038/ejhg.2017.51. Epub 2017 Apr 12.
9
Selecting sequence variants to improve genomic predictions for dairy cattle.选择序列变异以改善奶牛的基因组预测。
Genet Sel Evol. 2017 Mar 7;49(1):32. doi: 10.1186/s12711-017-0307-4.
10
Whole genomic prediction of growth and carcass traits in a Chinese quality chicken population.中国优质鸡群体生长和胴体性状的全基因组预测
J Anim Sci. 2017 Jan;95(1):72-80. doi: 10.2527/jas.2016.0823.