• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有基因型不确定性的大规模 GWAS 的有效方法。

Efficient approaches for large-scale GWAS with genotype uncertainty.

机构信息

Department of Biology, The Bioinformatics Centre, University of Copenhagen, 2200 Copenhagen N, Denmark.

Faculty of Health and Medical Sciences, Novo Nordisk Foundation Center for Basic Metabolic Research, University of Copenhagen, 2200 Copenhagen N, Denmark.

出版信息

G3 (Bethesda). 2022 Jan 4;12(1). doi: 10.1093/g3journal/jkab385.

DOI:10.1093/g3journal/jkab385
PMID:34865001
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8727990/
Abstract

Association studies using genetic data from SNP-chip-based imputation or low-depth sequencing data provide a cost-efficient design for large-scale association studies. We explore methods for performing association studies applicable to such genetic data and investigate how using different priors when estimating genotype probabilities affects the association results. Our proposed method, ANGSD-asso's latent model, models the unobserved genotype as a latent variable in a generalized linear model framework. The software is implemented in C/C++ and can be run multi-threaded. ANGSD-asso is based on genotype probabilities, which can be estimated using either the sample allele frequency or the individual allele frequencies as a prior. We explore through simulations how genotype probability-based methods compare with using genetic dosages. Our simulations show that in a structured population using the individual allele frequency prior has better power than the sample allele frequency. In scenarios with sequencing depth and phenotype correlation ANGSD-asso's latent model has higher statistical power and less bias than using dosages. Adding additional covariates to the linear model of ANGSD-asso's latent model has higher statistical power and less bias than other methods that accommodate genotype uncertainty, while also being much faster. This is shown with imputed data from UK Biobank and simulations.

摘要

利用基于 SNP 芯片的 imputation 或低深度测序数据的遗传关联研究为大规模关联研究提供了一种具有成本效益的设计。我们探索了适用于此类遗传数据的关联研究方法,并研究了当估计基因型概率时使用不同先验如何影响关联结果。我们提出的方法 ANGSD-asso 的潜在模型,将未观察到的基因型建模为广义线性模型框架中的一个潜在变量。该软件是用 C/C++ 实现的,可以多线程运行。ANGSD-asso 基于基因型概率,可以使用样本等位基因频率或个体等位基因频率作为先验来估计。我们通过模拟来探索基于基因型概率的方法与使用遗传剂量相比的情况。我们的模拟表明,在结构群体中,使用个体等位基因频率作为先验比使用样本等位基因频率具有更高的功效。在测序深度和表型相关性的情况下,ANGSD-asso 的潜在模型比使用剂量具有更高的统计功效和更小的偏差。在 ANGSD-asso 的潜在模型的线性模型中添加额外的协变量比其他适应基因型不确定性的方法具有更高的统计功效和更小的偏差,同时也快得多。这是用英国生物库的 imputed 数据和模拟来证明的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655d/8727990/f1f4f586fdec/jkab385f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655d/8727990/b7261607b0e8/jkab385f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655d/8727990/a32b8b8dcf5e/jkab385f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655d/8727990/4375cb984fc1/jkab385f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655d/8727990/1ffdbd3d722c/jkab385f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655d/8727990/f1f4f586fdec/jkab385f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655d/8727990/b7261607b0e8/jkab385f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655d/8727990/a32b8b8dcf5e/jkab385f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655d/8727990/4375cb984fc1/jkab385f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655d/8727990/1ffdbd3d722c/jkab385f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655d/8727990/f1f4f586fdec/jkab385f5.jpg

相似文献

1
Efficient approaches for large-scale GWAS with genotype uncertainty.具有基因型不确定性的大规模 GWAS 的有效方法。
G3 (Bethesda). 2022 Jan 4;12(1). doi: 10.1093/g3journal/jkab385.
2
Comparison of multiple imputation and other methods for the analysis of imputed genotypes.比较多种插补方法和其他方法在分析插补基因型中的应用。
BMC Genomics. 2023 Jun 6;24(1):303. doi: 10.1186/s12864-023-09415-0.
3
Enhanced statistical tests for GWAS in admixed populations: assessment using African Americans from CARe and a Breast Cancer Consortium.混合人群 GWAS 的增强统计检验:使用来自 CARe 和乳腺癌联盟的非裔美国人进行评估。
PLoS Genet. 2011 Apr;7(4):e1001371. doi: 10.1371/journal.pgen.1001371. Epub 2011 Apr 21.
4
Evaluation and application of summary statistic imputation to discover new height-associated loci.评估和应用汇总统计推断发现新的身高相关位点。
PLoS Genet. 2018 May 21;14(5):e1007371. doi: 10.1371/journal.pgen.1007371. eCollection 2018 May.
5
A unified approach for allele frequency estimation, SNP detection and association studies based on pooled sequencing data using EM algorithms.基于 EM 算法的基于测序数据的等位基因频率估计、SNP 检测和关联研究的统一方法。
BMC Genomics. 2013;14 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2164-14-S1-S1. Epub 2013 Jan 21.
6
Accuracy of genome-wide imputation of untyped markers and impacts on statistical power for association studies.未分型标记的全基因组推断准确性及其对关联研究统计效能的影响。
BMC Genet. 2009 Jun 16;10:27. doi: 10.1186/1471-2156-10-27.
7
ANGSD: Analysis of Next Generation Sequencing Data.ANGSD:下一代测序数据分析
BMC Bioinformatics. 2014 Nov 25;15(1):356. doi: 10.1186/s12859-014-0356-4.
8
Estimation of allele frequency and association mapping using next-generation sequencing data.利用下一代测序数据进行等位基因频率估计和关联作图。
BMC Bioinformatics. 2011 Jun 11;12:231. doi: 10.1186/1471-2105-12-231.
9
Imputation strategies for low-coverage whole-genome sequencing data and their effects on genomic prediction and genome-wide association studies in pigs.低覆盖度全基因组测序数据的插补策略及其对猪基因组预测和全基因组关联研究的影响。
Animal. 2024 Sep;18(9):101258. doi: 10.1016/j.animal.2024.101258. Epub 2024 Jul 25.
10
A comparison of approaches to account for uncertainty in analysis of imputed genotypes.比较分析推断基因型时考虑不确定性的方法。
Genet Epidemiol. 2011 Feb;35(2):102-10. doi: 10.1002/gepi.20552.

引用本文的文献

1
Biobanks in GENETICS and G3: tackling the statistical challenges.遗传学领域及《G3:基因与基因组学》中的生物样本库:应对统计学挑战
Genetics. 2025 Apr 17;229(4). doi: 10.1093/genetics/iyaf046.
2
Biobanks in GENETICS and G3: tackling the statistical challenges.遗传学领域及《G3:基因与基因组学》中的生物样本库:应对统计学挑战
G3 (Bethesda). 2025 Apr 17;15(4). doi: 10.1093/g3journal/jkaf060.
3
Variable parallelism in the genomic basis of age at maturity across spatial scales in Atlantic Salmon.大西洋鲑鱼成熟年龄基因组基础在空间尺度上的可变平行性。
Ecol Evol. 2024 Apr 5;14(4):e11068. doi: 10.1002/ece3.11068. eCollection 2024 Apr.
4
Convergent genomic signatures of local adaptation across a continental-scale environmental gradient.大陆尺度环境梯度上的局部适应的趋同基因组特征。
Sci Adv. 2023 May 19;9(20):eadd0560. doi: 10.1126/sciadv.add0560.
5
Ancestry-related distribution of Runs of homozygosity and functional variants in Qatari population.卡塔尔人群中与血统相关的纯合子区域和功能变异的分布。
BMC Genom Data. 2022 Sep 21;23(1):73. doi: 10.1186/s12863-022-01087-1.
6
Locally Adaptive Inversions Modulate Genetic Variation at Different Geographic Scales in a Seaweed Fly.局部适应性反转在海藻蝇的不同地理尺度上调节遗传变异。
Mol Biol Evol. 2021 Aug 23;38(9):3953-3971. doi: 10.1093/molbev/msab143.