• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于半参数多层次混合模型的基于集合的遗传关联推断的高效测试和效应大小估计。

Efficient testing and effect size estimation for set-based genetic association inference via semiparametric multilevel mixture modeling.

机构信息

Center for Spatial Information Science, The University of Tokyo, Chiba, Japan.

Research Center for Medical and Health Data Science, The Institute of Statistical Mathematics, Tokyo, Japan.

出版信息

Biom J. 2022 Aug;64(6):1142-1152. doi: 10.1002/bimj.202100234. Epub 2022 May 11.

DOI:10.1002/bimj.202100234
PMID:35543501
Abstract

In genetic association studies, rare variants with extremely low allele frequencies play a crucial role in complex traits. Therefore, set-based testing methods that jointly assess the effects of groups of single nucleotide polymorphisms (SNPs) were developed to increase the powers of the association tests. However, these powers are still insufficient, and precise estimations of the effect sizes of individual SNPs are largely impossible. In this article, we provide an efficient set-based statistical inference framework that addresses both of these important issues simultaneously using an empirical Bayes method with semiparametric multilevel mixture modeling. We propose to utilize the hierarchical model that incorporates variations in set-specific effects and to apply the optimal discovery procedure (ODP) that achieves the largest overall power in multiple significance testing. In addition, we provide an optimal "set-based" estimator of the empirical distribution of effect sizes. The efficiency of the proposed methods is demonstrated through application to a genome-wide association study of coronary artery disease and through simulation studies. The results demonstrated numerous rare variants with large effect sizes for coronary artery disease, and the number of significant sets detected by the ODP was much greater than those identified by existing methods.

摘要

在遗传关联研究中,具有极低等位基因频率的罕见变异在复杂性状中起着至关重要的作用。因此,开发了基于集合的测试方法,这些方法联合评估一组单核苷酸多态性 (SNP) 的影响,以提高关联测试的功效。然而,这些功效仍然不足,而且个体 SNP 效应大小的精确估计在很大程度上是不可能的。在本文中,我们提供了一个有效的基于集合的统计推断框架,该框架使用具有半参数多层次混合建模的经验贝叶斯方法同时解决了这两个重要问题。我们建议利用包含集合特定效应变化的分层模型,并应用最优发现程序 (ODP),以在多次显著性检验中实现最大的总体功效。此外,我们还提供了效应大小的经验分布的最优“基于集合”估计量。通过对冠状动脉疾病的全基因组关联研究的应用和模拟研究,证明了所提出方法的效率。结果表明,冠状动脉疾病存在许多具有较大效应大小的罕见变异,并且 ODP 检测到的显著集合数量远远超过了现有方法。

相似文献

1
Efficient testing and effect size estimation for set-based genetic association inference via semiparametric multilevel mixture modeling.基于半参数多层次混合模型的基于集合的遗传关联推断的高效测试和效应大小估计。
Biom J. 2022 Aug;64(6):1142-1152. doi: 10.1002/bimj.202100234. Epub 2022 May 11.
2
Exploration of empirical Bayes hierarchical modeling for the analysis of genome-wide association study data.探索经验贝叶斯层次模型在全基因组关联研究数据分析中的应用。
Biostatistics. 2011 Jul;12(3):445-61. doi: 10.1093/biostatistics/kxq072. Epub 2011 Jan 20.
3
The optimal discovery procedure in multiple significance testing: an empirical Bayes approach.多重检验中最优的发现程序:经验贝叶斯方法。
Stat Med. 2012 Jan 30;31(2):165-76. doi: 10.1002/sim.4375. Epub 2011 Oct 4.
4
Application of the optimal discovery procedure to genetic case-control studies: comparison with p values and asymptotic Bayes factors.最优发现程序在基因病例对照研究中的应用:与p值和渐近贝叶斯因子的比较
Hum Hered. 2011;71(1):37-49. doi: 10.1159/000323518. Epub 2011 Mar 10.
5
An Empirical Bayes Mixture Model for Effect Size Distributions in Genome-Wide Association Studies.全基因组关联研究中效应大小分布的经验贝叶斯混合模型。
PLoS Genet. 2015 Dec 29;11(12):e1005717. doi: 10.1371/journal.pgen.1005717. eCollection 2015 Dec.
6
An empirical Bayes optimal discovery procedure based on semiparametric hierarchical mixture models.基于半参数层次混合模型的经验贝叶斯最优发现程序。
Comput Math Methods Med. 2013;2013:568480. doi: 10.1155/2013/568480. Epub 2013 Apr 10.
7
Parametric estimation of the local false discovery rate for identifying genetic associations.基于参数估计的局部假发现率在识别遗传关联中的应用
IEEE/ACM Trans Comput Biol Bioinform. 2013 Jan-Feb;10(1):98-108. doi: 10.1109/TCBB.2012.140.
8
Structured Genome-Wide Association Studies with Bayesian Hierarchical Variable Selection.基于贝叶斯分层变量选择的结构全基因组关联研究。
Genetics. 2019 Jun;212(2):397-415. doi: 10.1534/genetics.119.301906. Epub 2019 Apr 22.
9
A simple Bayesian mixture model with a hybrid procedure for genome-wide association studies.一种简单的贝叶斯混合模型,结合了混合算法用于全基因组关联研究。
Eur J Hum Genet. 2010 Aug;18(8):942-7. doi: 10.1038/ejhg.2010.51. Epub 2010 Apr 21.
10
A novel Bayesian semiparametric algorithm for inferring population structure and adjusting for case-control association tests.一种用于推断群体结构并对病例对照关联检验进行校正的新型贝叶斯半参数算法。
Biometrics. 2013 Mar;69(1):164-73. doi: 10.1111/biom.12004. Epub 2013 Feb 21.