• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

混合模型的预测残差平方和:在基因组预测中的应用

Predicted Residual Error Sum of Squares of Mixed Models: An Application for Genomic Prediction.

作者信息

Xu Shizhong

机构信息

Department of Botany and Plant Sciences, University of California, Riverside, California 92521

出版信息

G3 (Bethesda). 2017 Mar 10;7(3):895-909. doi: 10.1534/g3.116.038059.

DOI:10.1534/g3.116.038059
PMID:28108552
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5345720/
Abstract

Genomic prediction is a statistical method to predict phenotypes of polygenic traits using high-throughput genomic data. Most diseases and behaviors in humans and animals are polygenic traits. The majority of agronomic traits in crops are also polygenic. Accurate prediction of these traits can help medical professionals diagnose acute diseases and breeders to increase food products, and therefore significantly contribute to human health and global food security. The best linear unbiased prediction (BLUP) is an important tool to analyze high-throughput genomic data for prediction. However, to judge the efficacy of the BLUP model with a particular set of predictors for a given trait, one has to provide an unbiased mechanism to evaluate the predictability. Cross-validation (CV) is an essential tool to achieve this goal, where a sample is partitioned into parts of roughly equal size, one part is predicted using parameters estimated from the remaining - 1 parts, and eventually every part is predicted using a sample excluding that part. Such a CV is called the K-fold CV. Unfortunately, CV presents a substantial increase in computational burden. We developed an alternative method, the HAT method, to replace CV. The new method corrects the estimated residual errors from the whole sample analysis using the leverage values of a hat matrix of the random effects to achieve the predicted residual errors. Properties of the HAT method were investigated using seven agronomic and 1000 metabolomic traits of an inbred rice population. Results showed that the HAT method is a very good approximation of the CV method. The method was also applied to 10 traits in 1495 hybrid rice with 1.6 million SNPs, and to human height of 6161 subjects with roughly 0.5 million SNPs of the Framingham heart study data. Predictabilities of the HAT and CV methods were all similar. The HAT method allows us to easily evaluate the predictabilities of genomic prediction for large numbers of traits in very large populations.

摘要

基因组预测是一种利用高通量基因组数据预测多基因性状表型的统计方法。人类和动物的大多数疾病和行为都是多基因性状。作物的大多数农艺性状也是多基因的。准确预测这些性状有助于医学专业人员诊断急性疾病,帮助育种者提高粮食产量,从而对人类健康和全球粮食安全做出重大贡献。最佳线性无偏预测(BLUP)是分析高通量基因组数据进行预测的重要工具。然而,要判断针对给定性状的特定预测变量集的BLUP模型的有效性,必须提供一种无偏机制来评估可预测性。交叉验证(CV)是实现这一目标的重要工具,即将一个样本分成大致相等大小的 份,用其余 - 1份估计的参数预测其中一份,最终用排除该部分的样本预测每一部分。这种CV称为K折交叉验证。不幸的是,交叉验证会使计算负担大幅增加。我们开发了一种替代方法,即HAT方法,来取代交叉验证。该新方法利用随机效应的帽子矩阵的杠杆值校正全样本分析估计的残差,以获得预测残差。利用一个自交水稻群体的7个农艺性状和1000个代谢组学性状研究了HAT方法的特性。结果表明,HAT方法是交叉验证方法的一个很好的近似。该方法还应用于1495份具有160万个单核苷酸多态性(SNP)的杂交水稻的10个性状,以及弗雷明汉心脏研究数据中约50万个SNP的6161名受试者的身高。HAT方法和交叉验证方法的可预测性都相似。HAT方法使我们能够轻松评估非常大群体中大量性状的基因组预测的可预测性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20e2/5345720/afb95dffe40c/895f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20e2/5345720/28e2714ef357/895f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20e2/5345720/7636477fffd8/895f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20e2/5345720/63af5c44a589/895f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20e2/5345720/831b52ce897d/895f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20e2/5345720/afb95dffe40c/895f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20e2/5345720/28e2714ef357/895f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20e2/5345720/7636477fffd8/895f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20e2/5345720/63af5c44a589/895f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20e2/5345720/831b52ce897d/895f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20e2/5345720/afb95dffe40c/895f5.jpg

相似文献

1
Predicted Residual Error Sum of Squares of Mixed Models: An Application for Genomic Prediction.混合模型的预测残差平方和:在基因组预测中的应用
G3 (Bethesda). 2017 Mar 10;7(3):895-909. doi: 10.1534/g3.116.038059.
2
Metabolomic prediction of yield in hybrid rice.杂交水稻产量的代谢组学预测
Plant J. 2016 Oct;88(2):219-227. doi: 10.1111/tpj.13242. Epub 2016 Aug 29.
3
Boosting predictabilities of agronomic traits in rice using bivariate genomic selection.利用双变量基因组选择提高水稻农艺性状的可预测性。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa103.
4
Hybrid breeding of rice via genomic selection.利用基因组选择进行水稻杂种优势的杂交育种。
Plant Biotechnol J. 2020 Jan;18(1):57-67. doi: 10.1111/pbi.13170. Epub 2019 Jun 26.
5
Incorporation of parental phenotypic data into multi-omic models improves prediction of yield-related traits in hybrid rice.将父母表型数据纳入多组学模型可提高杂交水稻产量相关性状的预测能力。
Plant Biotechnol J. 2021 Feb;19(2):261-272. doi: 10.1111/pbi.13458. Epub 2020 Sep 2.
6
Whole-genome sequence-based genomic prediction in laying chickens with different genomic relationship matrices to account for genetic architecture.利用不同基因组关系矩阵在蛋鸡中基于全基因组序列进行基因组预测以考虑遗传结构。
Genet Sel Evol. 2017 Jan 16;49(1):8. doi: 10.1186/s12711-016-0277-y.
7
Genomic prediction of reproduction traits for Merino sheep.美利奴羊繁殖性状的基因组预测
Anim Genet. 2017 Jun;48(3):338-348. doi: 10.1111/age.12541. Epub 2017 Feb 17.
8
Predicting hybrid performance in rice using genomic best linear unbiased prediction.利用基因组最佳线性无偏预测法预测水稻杂种表现
Proc Natl Acad Sci U S A. 2014 Aug 26;111(34):12456-61. doi: 10.1073/pnas.1413750111. Epub 2014 Aug 11.
9
Genomic prediction of breeding values using previously estimated SNP variances.利用先前估计的单核苷酸多态性(SNP)方差进行育种值的基因组预测。
Genet Sel Evol. 2014 Sep 25;46(1):52. doi: 10.1186/s12711-014-0052-x.
10
Genomic selection using principal component regression.基于主成分回归的基因组选择。
Heredity (Edinb). 2018 Jul;121(1):12-23. doi: 10.1038/s41437-018-0078-x. Epub 2018 May 1.

引用本文的文献

1
Impact of different genomic relationship matrix construction methods on the accuracy of genomic prediction in different species.不同基因组关系矩阵构建方法对不同物种基因组预测准确性的影响。
Front Genet. 2025 May 2;16:1576248. doi: 10.3389/fgene.2025.1576248. eCollection 2025.
2
GA-GBLUP: leveraging the genetic algorithm to improve the predictability of genomic selection.GA-GBLUP:利用遗传算法提高基因组选择的预测能力。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae385.
3
An expression-directed linear mixed model discovering low-effect genetic variants.

本文引用的文献

1
Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction.在基因组预测中不进行交叉验证的交叉验证
G3 (Bethesda). 2016 Oct 13;6(10):3107-3128. doi: 10.1534/g3.116.033381.
2
Genomic heritability: what is it?基因组遗传力:它是什么?
PLoS Genet. 2015 May 5;11(5):e1005048. doi: 10.1371/journal.pgen.1005048. eCollection 2015 May.
3
Genomic analysis of hybrid rice varieties reveals numerous superior alleles that contribute to heterosis.杂交水稻品种的基因组分析揭示了众多有助于杂种优势的优良等位基因。
基于表达量的线性混合效应模型发现低效应遗传变异。
Genetics. 2024 Apr 3;226(4). doi: 10.1093/genetics/iyae018.
4
Total Reflection X-ray Fluorescence Spectrometric Analysis of Ten Lanthanides at the Ultratrace Level Having a High Degree of Overlap in the Emission Lines.发射线高度重叠的十种镧系元素超痕量水平的全反射X射线荧光光谱分析
ACS Omega. 2023 Oct 24;8(44):41402-41410. doi: 10.1021/acsomega.3c05139. eCollection 2023 Nov 7.
5
Integrating genome-wide association study into genomic selection for the prediction of agronomic traits in rice ( L.).将全基因组关联研究整合到基因组选择中以预测水稻(Oryza sativa L.)的农艺性状
Mol Breed. 2023 Nov 13;43(11):81. doi: 10.1007/s11032-023-01423-y. eCollection 2023 Nov.
6
Genome-wide association study and genomic prediction for yield and grain quality traits of hybrid rice.杂交水稻产量和稻米品质性状的全基因组关联研究及基因组预测
Mol Breed. 2022 Mar 18;42(4):16. doi: 10.1007/s11032-022-01289-6. eCollection 2022 Apr.
7
Genomic Prediction: Progress and Perspectives for Rice Improvement.基因组预测:水稻改良的进展与展望
Methods Mol Biol. 2022;2467:569-617. doi: 10.1007/978-1-0716-2205-6_21.
8
Genome-wide association mapping and genomic prediction of yield-related traits and starch pasting properties in cassava.木薯产量相关性状和淀粉糊化特性的全基因组关联图谱绘制及基因组预测
Theor Appl Genet. 2022 Jan;135(1):145-171. doi: 10.1007/s00122-021-03956-2. Epub 2021 Oct 18.
9
Fabrication of Weak C-Axis Preferred AlN Thin Film for Temperature Measurement.用于温度测量的弱c轴择优取向AlN薄膜的制备
Sensors (Basel). 2021 Aug 8;21(16):5345. doi: 10.3390/s21165345.
10
Intelligent Trajectory Tracking Behavior of a Multi-Joint Robotic Arm via Genetic-Swarm Optimization for the Inverse Kinematic Solution.基于遗传-群智能优化的逆运动学求解的多关节机械臂智能轨迹跟踪行为。
Sensors (Basel). 2021 May 3;21(9):3171. doi: 10.3390/s21093171.
Nat Commun. 2015 Feb 5;6:6258. doi: 10.1038/ncomms7258.
4
Predicting hybrid performance in rice using genomic best linear unbiased prediction.利用基因组最佳线性无偏预测法预测水稻杂种表现
Proc Natl Acad Sci U S A. 2014 Aug 26;111(34):12456-61. doi: 10.1073/pnas.1413750111. Epub 2014 Aug 11.
5
Genetic analysis of the metabolome exemplified using a rice population.利用水稻群体进行代谢组学的遗传分析。
Proc Natl Acad Sci U S A. 2013 Dec 10;110(50):20320-5. doi: 10.1073/pnas.1319681110. Epub 2013 Nov 20.
6
Mapping quantitative trait loci by controlling polygenic background effects.通过控制多基因背景效应进行数量性状基因座定位。
Genetics. 2013 Dec;195(4):1209-22. doi: 10.1534/genetics.113.157032. Epub 2013 Sep 27.
7
Prediction of complex human traits using the genomic best linear unbiased predictor.利用基因组最佳线性无偏预测器预测复杂人类特征。
PLoS Genet. 2013;9(7):e1003608. doi: 10.1371/journal.pgen.1003608. Epub 2013 Jul 11.
8
Priors in whole-genome regression: the bayesian alphabet returns.全基因组回归中的先验信息:贝叶斯字母表回归。
Genetics. 2013 Jul;194(3):573-96. doi: 10.1534/genetics.113.151753. Epub 2013 May 1.
9
A comprehensive genetic approach for improving prediction of skin cancer risk in humans.一种全面的遗传方法,用于提高人类皮肤癌风险预测的准确性。
Genetics. 2012 Dec;192(4):1493-502. doi: 10.1534/genetics.112.141705. Epub 2012 Oct 10.
10
Whole-genome regression and prediction methods applied to plant and animal breeding.全基因组回归和预测方法在动植物育种中的应用。
Genetics. 2013 Feb;193(2):327-45. doi: 10.1534/genetics.112.143313. Epub 2012 Jun 28.