• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于全基因组关联研究中推断汇总统计数据的统一框架。

A Unifying Framework for Imputing Summary Statistics in Genome-Wide Association Studies.

机构信息

Department of Computer Science, University of California, Los Angeles, Los Angeles.

Department of Human Genetics, and University of California, Los Angeles, Los Angeles.

出版信息

J Comput Biol. 2020 Mar;27(3):418-428. doi: 10.1089/cmb.2019.0449. Epub 2020 Feb 13.

DOI:10.1089/cmb.2019.0449
Abstract

Methods to impute missing data are routinely used to increase power in genome-wide association studies. There are two broad classes of imputation methods. The first class imputes genotypes at the untyped variants, given those at the typed variants, and then performs a statistical test of association at the imputed variants. The second class, summary statistic imputation (SSI), directly imputes association statistics at the untyped variants, given the association statistics observed at the typed variants. The second class is appealing as it tends to be computationally efficient while only requiring the summary statistics from a study, while the former class requires access to individual-level data that can be difficult to obtain. The statistical properties of these two classes of imputation methods have not been fully understood. In this study, we show that the two classes of imputation methods yield association statistics with similar distributions for sufficiently large sample sizes. Using this relationship, we can understand the effect of the imputation method on power. We show that a commonly used approach to SSI that we term SSI with variance reweighting generally leads to a loss in power. On the contrary, our proposed method for SSI that does not perform variance reweighting fully accounts for imputation uncertainty, while achieving better power.

摘要

方法来填补缺失的数据通常用于提高全基因组关联研究的功效。有两种广泛的填补方法。第一类填补方法在给定已分型变异的情况下,对未分型变异的基因型进行填补,然后在填补的变异体上进行关联统计检验。第二类,汇总统计量填补(SSI),直接在未分型变异体上进行关联统计量的填补,给定在分型变异体上观察到的关联统计量。第二类方法很有吸引力,因为它在只需要研究的汇总统计量的情况下往往计算效率高,而第一类方法则需要访问个体水平的数据,这可能很难获得。这两类填补方法的统计特性尚未完全理解。在这项研究中,我们表明,在足够大的样本量下,这两类填补方法产生的关联统计量具有相似的分布。利用这种关系,我们可以了解填补方法对功效的影响。我们表明,我们称之为方差重新加权的 SSI 的常用 SSI 方法通常会导致功效降低。相反,我们提出的不进行方差重新加权的 SSI 方法完全考虑了填补不确定性,同时实现了更好的功效。

相似文献

1
A Unifying Framework for Imputing Summary Statistics in Genome-Wide Association Studies.用于全基因组关联研究中推断汇总统计数据的统一框架。
J Comput Biol. 2020 Mar;27(3):418-428. doi: 10.1089/cmb.2019.0449. Epub 2020 Feb 13.
2
Evaluation and application of summary statistic imputation to discover new height-associated loci.评估和应用汇总统计推断发现新的身高相关位点。
PLoS Genet. 2018 May 21;14(5):e1007371. doi: 10.1371/journal.pgen.1007371. eCollection 2018 May.
3
DISTMIX: direct imputation of summary statistics for unmeasured SNPs from mixed ethnicity cohorts.DISTMIX:从混合种族队列中直接推算未测量单核苷酸多态性的汇总统计量。
Bioinformatics. 2015 Oct 1;31(19):3099-104. doi: 10.1093/bioinformatics/btv348. Epub 2015 Jun 9.
4
DIST: direct imputation of summary statistics for unmeasured SNPs.直接对未测量的 SNP 进行汇总统计的推断。
Bioinformatics. 2013 Nov 15;29(22):2925-7. doi: 10.1093/bioinformatics/btt500. Epub 2013 Aug 28.
5
Fast and accurate imputation of summary statistics enhances evidence of functional enrichment.快速准确地推断汇总统计数据可增强功能富集的证据。
Bioinformatics. 2014 Oct 15;30(20):2906-14. doi: 10.1093/bioinformatics/btu416. Epub 2014 Jul 1.
6
Effect of genome-wide genotyping and reference panels on rare variants imputation.全基因组基因分型和参考面板对稀有变异体推断的影响。
J Genet Genomics. 2012 Oct 20;39(10):545-50. doi: 10.1016/j.jgg.2012.07.002. Epub 2012 Jul 24.
7
FAPI: Fast and accurate P-value Imputation for genome-wide association study.FAPI:用于全基因组关联研究的快速准确P值估算
Eur J Hum Genet. 2016 May;24(5):761-6. doi: 10.1038/ejhg.2015.190. Epub 2015 Aug 26.
8
Improving Imputation Accuracy by Inferring Causal Variants in Genetic Studies.通过推断基因研究中的因果变异提高插补准确性。
J Comput Biol. 2019 Nov;26(11):1203-1213. doi: 10.1089/cmb.2018.0139. Epub 2018 Oct 1.
9
Genotype imputation in genome-wide association studies.全基因组关联研究中的基因型填充
Curr Protoc Hum Genet. 2013 Jul;Chapter 1:Unit 1.25. doi: 10.1002/0471142905.hg0125s78.
10
Analysis of untyped SNPs: maximum likelihood and imputation methods.非分型单核苷酸多态性分析:最大似然法和推断方法。
Genet Epidemiol. 2010 Dec;34(8):803-15. doi: 10.1002/gepi.20527.

引用本文的文献

1
CADET: Enhanced transcriptome-wide association analyses in admixed samples using eQTL summary data.学员:使用eQTL汇总数据对混合样本进行增强的全转录组关联分析。
Am J Hum Genet. 2025 Jul 3;112(7):1580-1596. doi: 10.1016/j.ajhg.2025.05.010. Epub 2025 Jun 13.
2
Animal-SNPAtlas: a comprehensive SNP database for multiple animals.动物-SNPAtlas:一个综合性的多动物 SNP 数据库。
Nucleic Acids Res. 2023 Jan 6;51(D1):D816-D826. doi: 10.1093/nar/gkac954.
3
Plant-ImputeDB: an integrated multiple plant reference panel database for genotype imputation.

本文引用的文献

1
Colocalization of GWAS and eQTL Signals Detects Target Genes.全基因组关联研究(GWAS)与表达数量性状基因座(eQTL)信号的共定位可检测目标基因。
Am J Hum Genet. 2016 Dec 1;99(6):1245-1260. doi: 10.1016/j.ajhg.2016.10.003. Epub 2016 Nov 17.
2
Identification of causal genes for complex traits.复杂性状因果基因的鉴定。
Bioinformatics. 2015 Jun 15;31(12):i206-13. doi: 10.1093/bioinformatics/btv240.
3
Identifying causal variants at loci with multiple signals of association.在具有多个关联信号的基因座上识别因果变异。
植物 imputeDB:一个集成的多植物参考面板数据库,用于基因型推断。
Nucleic Acids Res. 2021 Jan 8;49(D1):D1480-D1488. doi: 10.1093/nar/gkaa953.
Genetics. 2014 Oct;198(2):497-508. doi: 10.1534/genetics.114.167908. Epub 2014 Aug 7.
4
Fast and accurate imputation of summary statistics enhances evidence of functional enrichment.快速准确地推断汇总统计数据可增强功能富集的证据。
Bioinformatics. 2014 Oct 15;30(20):2906-14. doi: 10.1093/bioinformatics/btu416. Epub 2014 Jul 1.
5
DIST: direct imputation of summary statistics for unmeasured SNPs.直接对未测量的 SNP 进行汇总统计的推断。
Bioinformatics. 2013 Nov 15;29(22):2925-7. doi: 10.1093/bioinformatics/btt500. Epub 2013 Aug 28.
6
Genome-wide association analysis identifies 13 new risk loci for schizophrenia.全基因组关联分析确定了 13 个精神分裂症的新风险位点。
Nat Genet. 2013 Oct;45(10):1150-9. doi: 10.1038/ng.2742. Epub 2013 Aug 25.
7
Genome-wide association analyses identify multiple loci associated with central corneal thickness and keratoconus.全基因组关联分析鉴定出多个与中央角膜厚度和圆锥角膜相关的位点。
Nat Genet. 2013 Feb;45(2):155-63. doi: 10.1038/ng.2506. Epub 2013 Jan 6.
8
Genome-wide association analyses identify 18 new loci associated with serum urate concentrations.全基因组关联分析鉴定出 18 个与血清尿酸浓度相关的新位点。
Nat Genet. 2013 Feb;45(2):145-54. doi: 10.1038/ng.2500. Epub 2012 Dec 23.
9
Fast and accurate genotype imputation in genome-wide association studies through pre-phasing.通过预分组实现全基因组关联研究中的快速准确基因型推断。
Nat Genet. 2012 Jul 22;44(8):955-9. doi: 10.1038/ng.2354.
10
Genome partitioning of genetic variation for complex traits using common SNPs.利用常见 SNP 对复杂性状的遗传变异进行基因组分区。
Nat Genet. 2011 Jun;43(6):519-25. doi: 10.1038/ng.823. Epub 2011 May 8.