• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种稳健的 DF-REML 框架,用于遗传研究中的方差分量估计。

A robust DF-REML framework for variance components estimation in genetic studies.

机构信息

Centre for Mathematics and Applications (CMA) and Department of Mathematics, FCT - NOVA University of Lisbon, Lisbon, Portugal.

Department of Statistics, Federal University of Bahia, Bahia, Brazil.

出版信息

Bioinformatics. 2017 Nov 15;33(22):3584-3594. doi: 10.1093/bioinformatics/btx457.

DOI:10.1093/bioinformatics/btx457
PMID:29036274
Abstract

MOTIVATION

In genetic association studies, linear mixed models (LMMs) are used to test for associations between phenotypes and candidate single nucleotide polymorphisms (SNPs). These same models are also used to estimate heritability, which is central not only to evolutionary biology but also to the prediction of the response to selection in plant and animal breeding, as well as the prediction of disease risk in humans. However, when one or more of the underlying assumptions are violated, the estimation of variance components may be compromised and therefore so may the estimates of heritability and any other functions of these. Considering that datasets obtained from real life experiments are prone to several sources of contamination, which usually induce the violation of the assumption of the normality of the errors, a robust derivative-free restricted-maximum likelihood framework (DF-REML) together with a robust coefficient of determination are proposed for the LMM in the context of genetic studies of continuous traits.

RESULTS

The proposed approach, in addition to the robust estimation of variance components and robust computation of the coefficient of determination, allows in particular for the robust estimation of SNP-based heritability by reducing the bias and increasing the precision of its estimates. The performance of both classical and robust DF-REML approaches is compared via a Monte Carlo simulation study. Additionally, three examples of application of the methodologies to real datasets are given in order to validate the usefulness of the proposed robust approach. Although the main focus of this article is on plant breeding applications, the proposed methodology is applicable to both human and animal genetic studies.

AVAILABILITY AND IMPLEMENTATION

Source code implemented in R is available in the Supplementary Material.

CONTACT

vmml@fct.unl.pt.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

在遗传关联研究中,线性混合模型(LMM)用于检验表型与候选单核苷酸多态性(SNP)之间的关联。这些相同的模型也用于估计遗传力,遗传力不仅对进化生物学很重要,而且对植物和动物育种中选择的反应预测以及人类疾病风险的预测也很重要。然而,当一个或多个基本假设被违反时,方差分量的估计可能会受到影响,因此遗传力的估计以及这些估计的任何其他功能也可能受到影响。考虑到从实际实验中获得的数据集中通常存在几种污染源,这通常会导致误差正态性假设的违反,因此提出了一种稳健的无导数限制极大似然框架(DF-REML)以及稳健的决定系数,用于遗传研究中连续性状的 LMM。

结果

除了稳健的方差分量估计和稳健的决定系数计算外,所提出的方法还特别允许通过减少偏差和提高其估计的精度来稳健地估计 SNP 遗传力。通过蒙特卡罗模拟研究比较了经典和稳健的 DF-REML 方法的性能。此外,还给出了三个将方法应用于真实数据集的示例,以验证所提出的稳健方法的有用性。虽然本文的主要重点是植物育种应用,但所提出的方法也适用于人类和动物遗传研究。

可用性和实现

用 R 实现的源代码可在补充材料中获得。

联系信息

vmml@fct.unl.pt。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
A robust DF-REML framework for variance components estimation in genetic studies.一种稳健的 DF-REML 框架,用于遗传研究中的方差分量估计。
Bioinformatics. 2017 Nov 15;33(22):3584-3594. doi: 10.1093/bioinformatics/btx457.
2
Robust estimation of heritability and predictive accuracy in plant breeding: evaluation using simulation and empirical data.植物育种中遗传力和预测准确性的稳健估计:使用模拟和经验数据进行评估。
BMC Genomics. 2020 Jan 14;21(1):43. doi: 10.1186/s12864-019-6429-z.
3
Estimation of heritability with genomic information by method R.利用方法 R 从基因组信息估算遗传力。
J Anim Breed Genet. 2024 Sep;141(5):550-558. doi: 10.1111/jbg.12863. Epub 2024 Mar 25.
4
A comparison via simulation of least squares Lehmann-Scheffé estimators of two variances and heritability with those of restricted maximum likelihood.通过模拟对两个方差和遗传力的最小二乘Lehmann-Scheffé估计量与限制最大似然估计量进行比较。
J Anim Sci. 2003 Aug;81(8):1950-8. doi: 10.2527/2003.8181950x.
5
Copula miss-specification in REML multivariate genetic animal model estimation.REML 多变量遗传动物模型估计中的 Copula 误指定。
Genet Sel Evol. 2022 May 26;54(1):36. doi: 10.1186/s12711-022-00729-3.
6
Fast heritability estimation based on MINQUE and batch training.基于 MINQUE 和批量训练的快速遗传力估计。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac115.
7
A fast genomic selection approach for large genomic data.一种针对大型基因组数据的快速基因组选择方法。
Theor Appl Genet. 2017 Jun;130(6):1277-1284. doi: 10.1007/s00122-017-2887-3. Epub 2017 Apr 7.
8
Estimation of genetic parameters and prediction of breeding values for multivariate threshold and continuous data in a simulated horse population using Gibbs sampling and residual maximum likelihood.使用吉布斯抽样和残差最大似然法对模拟马群中的多变量阈值和连续数据进行遗传参数估计及育种值预测。
J Anim Breed Genet. 2007 Oct;124(5):308-19. doi: 10.1111/j.1439-0388.2007.00666.x.
9
Stochastic Lanczos estimation of genomic variance components for linear mixed-effects models.基于随机 Lanczos 估计的线性混合效应模型的基因组方差分量估计。
BMC Bioinformatics. 2019 Jul 30;20(1):411. doi: 10.1186/s12859-019-2978-z.
10
Employing a Monte Carlo algorithm in expectation maximization restricted maximum likelihood estimation of the linear mixed model.运用蒙特卡罗算法在期望最大化限制最大似然估计线性混合模型中的应用。
J Anim Breed Genet. 2012 Dec;129(6):457-68. doi: 10.1111/j.1439-0388.2012.01000.x. Epub 2012 Apr 28.

引用本文的文献

1
Root restriction accelerates genomic target identification in quinoa under controlled conditions.在可控条件下,根系限制加速了藜麦基因组靶点的鉴定。
Physiol Plant. 2025 Mar-Apr;177(2):e70223. doi: 10.1111/ppl.70223.
2
Identification of novel putative alleles related to important agronomic traits of wheat using robust strategies in GWAS.利用 GWAS 中的稳健策略鉴定与小麦重要农艺性状相关的新型潜在等位基因。
Sci Rep. 2023 Jun 19;13(1):9927. doi: 10.1038/s41598-023-36134-z.
3
Hybrid of Restricted and Penalized Maximum Likelihood Method for Efficient Genome-Wide Association Study.
基于受限极大似然和惩罚极大似然法的高效全基因组关联研究混合方法
Genes (Basel). 2020 Oct 29;11(11):1286. doi: 10.3390/genes11111286.
4
Diversity in Metabolites and Fruit Quality Traits in Blueberry Enables Ploidy and Species Differentiation and Establishes a Strategy for Future Genetic Studies.蓝莓代谢物和果实品质性状的多样性有助于多倍体和物种分化,并为未来的遗传研究制定了策略。
Front Plant Sci. 2020 Apr 3;11:370. doi: 10.3389/fpls.2020.00370. eCollection 2020.
5
Robust estimation of heritability and predictive accuracy in plant breeding: evaluation using simulation and empirical data.植物育种中遗传力和预测准确性的稳健估计:使用模拟和经验数据进行评估。
BMC Genomics. 2020 Jan 14;21(1):43. doi: 10.1186/s12864-019-6429-z.
6
Heritability in Plant Breeding on a Genotype-Difference Basis.基于基因型差异的植物育种中的遗传力。
Genetics. 2019 Aug;212(4):991-1008. doi: 10.1534/genetics.119.302134. Epub 2019 Jun 27.
7
A robust Bayesian genome-based median regression model.一种稳健的基于基因组的贝叶斯中位数回归模型。
Theor Appl Genet. 2019 May;132(5):1587-1606. doi: 10.1007/s00122-019-03303-6. Epub 2019 Feb 12.