• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 MINQUE 和批量训练的快速遗传力估计。

Fast heritability estimation based on MINQUE and batch training.

机构信息

Division of Health Statistics, School of Public Health, Shanxi Medical University, No.56 Xin jian South Road, 030001 Shanxi, China.

Department of Biostatistics, University of Florida, 2004 Mowry Road, 32611 FL, USA.

出版信息

Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac115.

DOI:10.1093/bib/bbac115
PMID:35383355
Abstract

Heritability, the proportion of phenotypic variance explained by genome-wide single nucleotide polymorphisms (SNPs) in unrelated individuals, is an important measure of the genetic contribution to human diseases and plays a critical role in studying the genetic architecture of human diseases. Linear mixed model (LMM) has been widely used for SNP heritability estimation, where variance component parameters are commonly estimated by using a restricted maximum likelihood (REML) method. REML is an iterative optimization algorithm, which is computationally intensive when applied to large-scale datasets (e.g. UK Biobank). To facilitate the heritability analysis of large-scale genetic datasets, we develop a fast approach, minimum norm quadratic unbiased estimator (MINQUE) with batch training, to estimate variance components from LMM (LMM.MNQ.BCH). In LMM.MNQ.BCH, the parameters are estimated by MINQUE, which has a closed-form solution for fast computation and has no convergence issue. Batch training has also been adopted in LMM.MNQ.BCH to accelerate the computation for large-scale genetic datasets. Through simulations and real data analysis, we demonstrate that LMM.MNQ.BCH is much faster than two existing approaches, GCTA and BOLT-REML.

摘要

遗传力是指在无亲缘关系的个体中,由全基因组单核苷酸多态性(SNPs)解释的表型方差的比例,是衡量人类疾病遗传贡献的重要指标,在研究人类疾病的遗传结构中起着关键作用。线性混合模型(LMM)已广泛用于 SNP 遗传力估计,其中方差分量参数通常通过使用限制最大似然(REML)方法进行估计。REML 是一种迭代优化算法,当应用于大规模数据集(例如 UK Biobank)时,计算量很大。为了便于对大规模遗传数据集进行遗传力分析,我们开发了一种快速方法,即使用批量训练的最小范数二次无偏估计器(MINQUE),从 LMM 中估计方差分量(LMM.MNQ.BCH)。在 LMM.MNQ.BCH 中,参数通过 MINQUE 进行估计,MINQUE 具有快速计算的闭式解,不存在收敛问题。批量训练也已应用于 LMM.MNQ.BCH 中,以加速大规模遗传数据集的计算。通过模拟和真实数据分析,我们证明 LMM.MNQ.BCH 比现有的两种方法 GCTA 和 BOLT-REML 快得多。

相似文献

1
Fast heritability estimation based on MINQUE and batch training.基于 MINQUE 和批量训练的快速遗传力估计。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac115.
2
Estimating SNP heritability in presence of population substructure in biobank-scale datasets.在生物库规模数据集存在群体亚结构的情况下估计 SNP 遗传力。
Genetics. 2022 Apr 4;220(4). doi: 10.1093/genetics/iyac015.
3
Fast and Accurate Construction of Confidence Intervals for Heritability.快速准确地构建遗传力的置信区间
Am J Hum Genet. 2016 Jun 2;98(6):1181-1192. doi: 10.1016/j.ajhg.2016.04.016.
4
Efficient estimation of SNP heritability using Gaussian predictive process in large scale cohort studies.利用高斯预测过程在大规模队列研究中高效估计 SNP 遗传力。
PLoS Genet. 2022 Apr 20;18(4):e1010151. doi: 10.1371/journal.pgen.1010151. eCollection 2022 Apr.
5
A scalable estimator of SNP heritability for biobank-scale data.用于生物库规模数据的 SNP 遗传力可扩展估计器。
Bioinformatics. 2018 Jul 1;34(13):i187-i194. doi: 10.1093/bioinformatics/bty253.
6
A UNIFIED FRAMEWORK FOR VARIANCE COMPONENT ESTIMATION WITH SUMMARY STATISTICS IN GENOME-WIDE ASSOCIATION STUDIES.全基因组关联研究中基于汇总统计量的方差分量估计统一框架
Ann Appl Stat. 2017 Dec;11(4):2027-2051. doi: 10.1214/17-AOAS1052. Epub 2017 Dec 28.
7
Mixed model approaches for diallel analysis based on a bio-model.基于生物模型的双列分析混合模型方法。
Genet Res. 1996 Dec;68(3):233-40. doi: 10.1017/s0016672300034200.
8
Hybrid of Restricted and Penalized Maximum Likelihood Method for Efficient Genome-Wide Association Study.基于受限极大似然和惩罚极大似然法的高效全基因组关联研究混合方法
Genes (Basel). 2020 Oct 29;11(11):1286. doi: 10.3390/genes11111286.
9
Methodological Considerations in Estimation of Phenotype Heritability Using Genome-Wide SNP Data, Illustrated by an Analysis of the Heritability of Height in a Large Sample of African Ancestry Adults.利用全基因组SNP数据估计表型遗传力的方法学考量,以对大量非洲裔成年人身高遗传力的分析为例
PLoS One. 2015 Jun 30;10(6):e0131106. doi: 10.1371/journal.pone.0131106. eCollection 2015.
10
A robust DF-REML framework for variance components estimation in genetic studies.一种稳健的 DF-REML 框架,用于遗传研究中的方差分量估计。
Bioinformatics. 2017 Nov 15;33(22):3584-3594. doi: 10.1093/bioinformatics/btx457.

引用本文的文献

1
AIGen: an artificial intelligence software for complex genetic data analysis.AIGen:用于复杂基因数据分析的人工智能软件。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae566.