• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种针对大型基因组数据的快速基因组选择方法。

A fast genomic selection approach for large genomic data.

作者信息

Liu Hailan, Chen Guo-Bo

机构信息

Maize Research Institute, Sichuan Agricultural University, Chengdu, Sichuan Province, 611130, China.

Evergreen Landscape and Architecture Studio, Xixi Road 562, Hangzhou, Zhejiang Province, 310026, China.

出版信息

Theor Appl Genet. 2017 Jun;130(6):1277-1284. doi: 10.1007/s00122-017-2887-3. Epub 2017 Apr 7.

DOI:10.1007/s00122-017-2887-3
PMID:28389770
Abstract

We propose a novel computational method for genomic selection that combines identical-by-state (IBS)-based Haseman-Elston (HE) regression and best linear prediction (BLP), called HE-BLP. Genomic best linear unbiased prediction (GBLUP) has been widely used in whole-genome prediction for breeding programs. To determine the total genetic variance of a training population, a linear mixed model (LMM) should be solved via restricted maximum likelihood (REML), whose computational complexity is the cube of the sample size. We proposed a novel computational method combining identical-by-state (IBS)-based Haseman-Elston (HE) regression and best linear prediction (BLP), called HE-BLP. With this method, the total genetic variance can be estimated by solving a simple HE linear regression, which has a computational complex of the sample size squared; therefore, it is suitable for large-scale genomic data, except those with which environmental effects need to be estimated simultaneously, because it does not allow for this estimation. In Monte Carlo simulation studies, the estimated heritability based on HE was identical to that based on REML, and the prediction accuracy via HE-BLP and traditional GBLUP was also quite similar when quantitative trait loci (QTLs) were randomly distributed along the genome and their effects followed a normal distribution. In addition, the kernel row number (KRN) trait in a maize IBM population was used to evaluate the performance of the two methods; the results showed similar prediction accuracy of breeding values despite slightly different estimated heritability via HE and REML, probably due to the underlying genetic architecture. HE-BLP can be a future genomic selection method choice for even larger sets of genomic data in certain special cases where environmental effects can be ignored. The software for HE regression and the simulation program is available online in the Genetic Analysis Repository (GEAR; https://github.com/gc5k/GEAR/wiki).

摘要

我们提出了一种用于基因组选择的新型计算方法,该方法将基于状态相同(IBS)的哈斯曼 - 埃尔斯顿(HE)回归与最佳线性预测(BLP)相结合,称为HE - BLP。基因组最佳线性无偏预测(GBLUP)已广泛应用于育种计划的全基因组预测。为了确定训练群体的总遗传方差,应通过限制最大似然法(REML)求解线性混合模型(LMM),其计算复杂度为样本量的立方。我们提出了一种新型计算方法,将基于状态相同(IBS)的哈斯曼 - 埃尔斯顿(HE)回归与最佳线性预测(BLP)相结合,称为HE - BLP。使用这种方法,可以通过求解简单的HE线性回归来估计总遗传方差,其计算复杂度为样本量的平方;因此,它适用于大规模基因组数据,但不适用于需要同时估计环境效应的数据,因为它不允许进行这种估计。在蒙特卡罗模拟研究中,当数量性状位点(QTL)沿基因组随机分布且其效应服从正态分布时,基于HE估计的遗传力与基于REML估计的遗传力相同,并且通过HE - BLP和传统GBLUP的预测准确性也非常相似。此外,利用玉米IBM群体中的穗行数(KRN)性状评估了这两种方法的性能;结果表明,尽管通过HE和REML估计的遗传力略有不同,但育种值的预测准确性相似,这可能是由于潜在的遗传结构所致。在某些可以忽略环境效应的特殊情况下,HE - BLP可能成为未来处理更大规模基因组数据的基因组选择方法。HE回归软件和模拟程序可在遗传分析库(GEAR;https://github.com/gc5k/GEAR/wiki)上在线获取。

相似文献

1
A fast genomic selection approach for large genomic data.一种针对大型基因组数据的快速基因组选择方法。
Theor Appl Genet. 2017 Jun;130(6):1277-1284. doi: 10.1007/s00122-017-2887-3. Epub 2017 Apr 7.
2
Estimating heritability of complex traits from genome-wide association studies using IBS-based Haseman-Elston regression.基于 IBS 的 Haseman-Elston 回归估计全基因组关联研究中复杂性状的遗传力。
Front Genet. 2014 Apr 30;5:107. doi: 10.3389/fgene.2014.00107. eCollection 2014.
3
Genetic architecture of maize kernel row number and whole genome prediction.玉米穗行数的遗传结构与全基因组预测
Theor Appl Genet. 2015 Nov;128(11):2243-54. doi: 10.1007/s00122-015-2581-2. Epub 2015 Jul 19.
4
GA-GBLUP: leveraging the genetic algorithm to improve the predictability of genomic selection.GA-GBLUP:利用遗传算法提高基因组选择的预测能力。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae385.
5
Accuracy of genomic selection for a sib-evaluated trait using identity-by-state and identity-by-descent relationships.利用状态一致性和系谱一致性关系对同胞评估性状进行基因组选择的准确性。
Genet Sel Evol. 2015 Feb 25;47(1):9. doi: 10.1186/s12711-014-0084-2.
6
Estimation of heritability with genomic information by method R.利用方法 R 从基因组信息估算遗传力。
J Anim Breed Genet. 2024 Sep;141(5):550-558. doi: 10.1111/jbg.12863. Epub 2024 Mar 25.
7
Multiple quantitative trait loci Haseman-Elston regression using all markers on the entire genome.使用全基因组上的所有标记进行多数量性状位点Haseman-Elston回归。
Theor Appl Genet. 2008 Sep;117(5):683-90. doi: 10.1007/s00122-008-0809-0. Epub 2008 Jun 19.
8
A novel genomic selection method combining GBLUP and LASSO.一种结合GBLUP和LASSO的新型基因组选择方法。
Genetica. 2015 Jun;143(3):299-304. doi: 10.1007/s10709-015-9826-5. Epub 2015 Feb 6.
9
Accuracy of Genomic Prediction in Synthetic Populations Depending on the Number of Parents, Relatedness, and Ancestral Linkage Disequilibrium.取决于亲本数量、亲缘关系和祖先连锁不平衡的合成群体中基因组预测的准确性。
Genetics. 2017 Jan;205(1):441-454. doi: 10.1534/genetics.116.193243. Epub 2016 Nov 9.
10
Genomic best linear unbiased prediction method including imprinting effects for genomic evaluation.包含印记效应的基因组最佳线性无偏预测方法用于基因组评估。
Genet Sel Evol. 2015 Apr 19;47(1):32. doi: 10.1186/s12711-015-0091-y.

引用本文的文献

1
Efficient large-scale genomic prediction in approximate genome-based kernel model.基于近似基因组的核模型中的高效大规模基因组预测
Theor Appl Genet. 2024 Dec 12;138(1):6. doi: 10.1007/s00122-024-04793-9.
2
Genomic prediction of yield-related traits and genome-based establishment of heterotic pattern in maize hybrid breeding of Southwest China.中国西南地区玉米杂交育种中产量相关性状的基因组预测及基于基因组的杂种优势模式建立
Front Plant Sci. 2024 Sep 9;15:1441555. doi: 10.3389/fpls.2024.1441555. eCollection 2024.
3
A dimensionality-reduction genomic prediction method without direct inverse of the genomic relationship matrix for large genomic data.

本文引用的文献

1
A MARKER-BASED METHOD FOR INFERENCES ABOUT QUANTITATIVE INHERITANCE IN NATURAL POPULATIONS.一种基于标记推断自然群体中数量遗传的方法。
Evolution. 1996 Jun;50(3):1062-1073. doi: 10.1111/j.1558-5646.1996.tb02347.x.
2
Genomic prediction contributing to a promising global strategy to turbocharge gene banks.基因组预测有助于推动基因库的全球战略。
Nat Plants. 2016 Oct 3;2:16150. doi: 10.1038/nplants.2016.150.
3
On the reconciliation of missing heritability for genome-wide association studies.关于全基因组关联研究中缺失遗传力的调和
一种用于大型基因组数据的、无需对基因组关系矩阵求直接逆矩阵的降维基因组预测方法。
Plant Cell Rep. 2023 Nov;42(11):1825-1832. doi: 10.1007/s00299-023-03069-8. Epub 2023 Sep 26.
4
An efficient genomic prediction method without the direct inverse of the genomic relationship matrix.一种无需基因组关系矩阵直接求逆的高效基因组预测方法。
Front Plant Sci. 2022 Dec 21;13:1089937. doi: 10.3389/fpls.2022.1089937. eCollection 2022.
5
Estimating variance components in population scale family trees.估计群体规模家系中的方差分量。
PLoS Genet. 2019 May 9;15(5):e1008124. doi: 10.1371/journal.pgen.1008124. eCollection 2019 May.
6
Genome-Wide SNP Data Revealed the Extent of Linkage Disequilibrium, Persistence of Phase and Effective Population Size in Purebred and Crossbred Buffalo Populations.全基因组SNP数据揭示了纯种和杂交水牛群体中的连锁不平衡程度、相位持续性和有效种群大小。
Front Genet. 2019 Jan 8;9:688. doi: 10.3389/fgene.2018.00688. eCollection 2018.
7
A new genomic prediction method with additive-dominance effects in the least-squares framework.基于最小二乘框架的加性-显性效应新基因组预测方法。
Heredity (Edinb). 2018 Aug;121(2):196-204. doi: 10.1038/s41437-018-0099-5. Epub 2018 Jun 20.
Eur J Hum Genet. 2016 Dec;24(12):1810-1816. doi: 10.1038/ejhg.2016.89. Epub 2016 Jul 20.
4
Genome-wide prediction models that incorporate de novo GWAS are a powerful new tool for tropical rice improvement.整合从头全基因组关联研究(de novo GWAS)的全基因组预测模型是热带水稻改良的一种强大新工具。
Heredity (Edinb). 2016 Apr;116(4):395-408. doi: 10.1038/hdy.2015.113. Epub 2016 Feb 10.
5
Inexpensive Computation of the Inverse of the Genomic Relationship Matrix in Populations with Small Effective Population Size.有效种群规模较小的群体中基因组关系矩阵逆矩阵的低成本计算
Genetics. 2016 Feb;202(2):401-9. doi: 10.1534/genetics.115.182089. Epub 2015 Nov 19.
6
Genomic selection and association mapping in rice (Oryza sativa): effect of trait genetic architecture, training population composition, marker number and statistical model on accuracy of rice genomic selection in elite, tropical rice breeding lines.水稻(Oryza sativa)的基因组选择与关联图谱分析:性状遗传结构、训练群体组成、标记数量及统计模型对优质热带水稻育种系基因组选择准确性的影响
PLoS Genet. 2015 Feb 17;11(2):e1004982. doi: 10.1371/journal.pgen.1004982. eCollection 2015 Feb.
7
Measuring missing heritability: inferring the contribution of common variants.测量缺失的遗传力:推断常见变异的贡献。
Proc Natl Acad Sci U S A. 2014 Dec 9;111(49):E5272-81. doi: 10.1073/pnas.1419064111. Epub 2014 Nov 24.
8
Predicting hybrid performance in rice using genomic best linear unbiased prediction.利用基因组最佳线性无偏预测法预测水稻杂种表现
Proc Natl Acad Sci U S A. 2014 Aug 26;111(34):12456-61. doi: 10.1073/pnas.1413750111. Epub 2014 Aug 11.
9
Marker-based estimation of genetic parameters in genomics.基因组学中基于标记的遗传参数估计
PLoS One. 2014 Jul 15;9(7):e102715. doi: 10.1371/journal.pone.0102715. eCollection 2014.
10
Estimating heritability of complex traits from genome-wide association studies using IBS-based Haseman-Elston regression.基于 IBS 的 Haseman-Elston 回归估计全基因组关联研究中复杂性状的遗传力。
Front Genet. 2014 Apr 30;5:107. doi: 10.3389/fgene.2014.00107. eCollection 2014.