利用英国生物库作为全球人群的全球参考：从 GWAS 汇总统计数据衡量祖先多样性的应用。

Using the UK Biobank as a global reference of worldwide populations: application to measuring ancestry diversity from GWAS summary statistics.

机构信息

National Centre for Register-based Research, Aarhus University, Aarhus 8210, Denmark.

出版信息

Bioinformatics. 2022 Jun 27;38(13):3477-3480. doi: 10.1093/bioinformatics/btac348.

DOI:10.1093/bioinformatics/btac348

PMID:35604078

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9237724/

Abstract

MOTIVATION

Measuring genetic diversity is an important problem because increasing genetic diversity is a key to making new genetic discoveries, while also being a major source of confounding to be aware of in genetics studies.

RESULTS

Using the UK Biobank data, a prospective cohort study with deep genetic and phenotypic data collected on almost 500 000 individuals from across the UK, we carefully define 21 distinct ancestry groups from all four corners of the world. These ancestry groups can serve as a global reference of worldwide populations, with a handful of applications. Here, we develop a method that uses allele frequencies and principal components derived from these ancestry groups to effectively measure ancestry proportions from allele frequencies of any genetic dataset.

AVAILABILITY AND IMPLEMENTATION

This method is implemented in function snp_ancestry_summary of R package bigsnpr.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

衡量遗传多样性是一个重要的问题，因为增加遗传多样性是做出新的遗传发现的关键，同时也是遗传学研究中需要注意的主要混杂来源。

结果

利用英国生物库（UK Biobank）的数据，这是一项前瞻性队列研究，对来自英国各地的近 50 万人进行了深入的遗传和表型数据收集，我们从世界的四个角落仔细定义了 21 个不同的祖先群体。这些祖先群体可以作为全球人口的全球参考，具有多种应用。在这里，我们开发了一种方法，该方法使用来自这些祖先群体的等位基因频率和主成分，从任何遗传数据集的等位基因频率中有效地测量祖先比例。

可用性和实现

该方法在 R 包 bigsnpr 的函数 snp_ancestry_summary 中实现。

补充信息

补充数据可在“Bioinformatics”在线获得。

相似文献

Using the UK Biobank as a global reference of worldwide populations: application to measuring ancestry diversity from GWAS summary statistics.

Bioinformatics. 2022 Jun 27;38(13):3477-3480. doi: 10.1093/bioinformatics/btac348.

An atlas of genetic associations in UK Biobank.

Nat Genet. 2018 Nov;50(11):1593-1599. doi: 10.1038/s41588-018-0248-z. Epub 2018 Oct 22.

Novel probabilistic models of spatial genetic ancestry with applications to stratification correction in genome-wide association studies.

Bioinformatics. 2017 Mar 15;33(6):879-885. doi: 10.1093/bioinformatics/btw720.

Improved ancestry inference using weights from external reference panels.

Bioinformatics. 2013 Jun 1;29(11):1399-406. doi: 10.1093/bioinformatics/btt144. Epub 2013 Mar 28.

A Fast and Accurate Method for Genome-wide Scale Phenome-wide G × E Analysis and Its Application to UK Biobank.

Am J Hum Genet. 2019 Dec 5;105(6):1182-1192. doi: 10.1016/j.ajhg.2019.10.008. Epub 2019 Nov 14.

Analyses of biomarker traits in diverse UK biobank participants identify associations missed by European-centric analysis strategies.

J Hum Genet. 2022 Feb;67(2):87-93. doi: 10.1038/s10038-021-00968-0. Epub 2021 Aug 11.

A fast and scalable framework for large-scale and ultrahigh-dimensional sparse regression with application to the UK Biobank.

PLoS Genet. 2020 Oct 23;16(10):e1009141. doi: 10.1371/journal.pgen.1009141. eCollection 2020 Oct.

Cross-ancestry genome-wide association studies identified heterogeneous loci associated with differences of allele frequency and regulome tagging between participants of European descent and other ancestry groups from the UK Biobank.

Hum Mol Genet. 2021 Jul 9;30(15):1457-1467. doi: 10.1093/hmg/ddab114.

Deriving GWAS summary estimates for paternal smoking in UK biobank: a GWAS by subtraction.

BMC Res Notes. 2023 Jul 30;16(1):159. doi: 10.1186/s13104-023-06438-4.

Portability of 245 polygenic scores when derived from the UK Biobank and applied to 9 ancestry groups from the same cohort.

Am J Hum Genet. 2022 Jan 6;109(1):12-23. doi: 10.1016/j.ajhg.2021.11.008.

引用本文的文献

scAI-SNP: a method for inferring ancestry from single-cell data.

BMC Methods. 2025;2(1):10. doi: 10.1186/s44330-025-00029-4. Epub 2025 May 19.

Association between plausible genetic factors and weight loss from GLP1-RA and bariatric surgery.

Nat Med. 2025 Apr 18. doi: 10.1038/s41591-025-03645-3.

Polygenic Scores and Mood Disorder Onsets in the Context of Family History and Early Psychopathology.

JAMA Netw Open. 2025 Apr 1;8(4):e255331. doi: 10.1001/jamanetworkopen.2025.5331.

Characterizing substructure via mixture modeling in large-scale genetic summary statistics.

Am J Hum Genet. 2025 Feb 6;112(2):235-253. doi: 10.1016/j.ajhg.2024.12.007. Epub 2025 Jan 16.

Adjusting for principal components can induce collider bias in genome-wide association studies.

PLoS Genet. 2024 Dec 16;20(12):e1011242. doi: 10.1371/journal.pgen.1011242. eCollection 2024 Dec.

Use of Estonian Biobank data and participant recall to improve Wilson's disease management.

Eur J Hum Genet. 2024 Dec 14. doi: 10.1038/s41431-024-01767-9.

Biobanking with genetics shapes precision medicine and global health.

Nat Rev Genet. 2025 Mar;26(3):191-202. doi: 10.1038/s41576-024-00794-y. Epub 2024 Nov 20.

A novel method for cell deconvolution using DNA methylation in PCA space.

BMC Genomics. 2024 Aug 23;25(1):798. doi: 10.1186/s12864-024-10652-0.

scAI-SNP: a method for inferring ancestry from single-cell data.

bioRxiv. 2024 May 17:2024.05.14.594208. doi: 10.1101/2024.05.14.594208.

Characterizing substructure via mixture modeling in large-scale genetic summary statistics.

bioRxiv. 2024 May 13:2024.01.29.577805. doi: 10.1101/2024.01.29.577805.

本文引用的文献

Portability of 245 polygenic scores when derived from the UK Biobank and applied to 9 ancestry groups from the same cohort.

Am J Hum Genet. 2022 Jan 6;109(1):12-23. doi: 10.1016/j.ajhg.2021.11.008.

A cross-population atlas of genetic associations for 220 human phenotypes.

Nat Genet. 2021 Oct;53(10):1415-1424. doi: 10.1038/s41588-021-00931-x. Epub 2021 Sep 30.

Mapping the human genetic architecture of COVID-19.

Nature. 2021 Dec;600(7889):472-477. doi: 10.1038/s41586-021-03767-x. Epub 2021 Jul 8.

Summix: A method for detecting and adjusting for population structure in genetic summary data.

Am J Hum Genet. 2021 Jul 1;108(7):1270-1282. doi: 10.1016/j.ajhg.2021.05.016. Epub 2021 Jun 21.

Whole genome sequencing in the Middle Eastern Qatari population identifies genetic associations with 45 clinically relevant traits.

Nat Commun. 2021 Feb 23;12(1):1250. doi: 10.1038/s41467-021-21381-3.

A positively selected FBN1 missense variant reduces height in Peruvian individuals.

Nature. 2020 Jun;582(7811):234-239. doi: 10.1038/s41586-020-2302-0. Epub 2020 May 13.

Efficient toolkit implementing best practices for principal component analysis of population genetic data.

Bioinformatics. 2020 Aug 15;36(16):4449-4457. doi: 10.1093/bioinformatics/btaa520.

Insights into human genetic variation and population history from 929 diverse genomes.

Science. 2020 Mar 20;367(6484). doi: 10.1126/science.aay5012.

Target genes, variants, tissues and transcriptional pathways influencing human serum urate levels.

Nat Genet. 2019 Oct;51(10):1459-1474. doi: 10.1038/s41588-019-0504-x. Epub 2019 Oct 2.

Genetic analyses of diverse populations improves discovery for complex traits.

Nature. 2019 Jun;570(7762):514-518. doi: 10.1038/s41586-019-1310-4. Epub 2019 Jun 19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用英国生物库作为全球人群的全球参考：从 GWAS 汇总统计数据衡量祖先多样性的应用。

Using the UK Biobank as a global reference of worldwide populations: application to measuring ancestry diversity from GWAS summary statistics.

机构信息

National Centre for Register-based Research, Aarhus University, Aarhus 8210, Denmark.

出版信息

Bioinformatics. 2022 Jun 27;38(13):3477-3480. doi: 10.1093/bioinformatics/btac348.

DOI:10.1093/bioinformatics/btac348

PMID:35604078

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9237724/

Abstract

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

This method is implemented in function snp_ancestry_summary of R package bigsnpr.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

衡量遗传多样性是一个重要的问题，因为增加遗传多样性是做出新的遗传发现的关键，同时也是遗传学研究中需要注意的主要混杂来源。

利用英国生物库作为全球人群的全球参考：从 GWAS 汇总统计数据衡量祖先多样性的应用。

Using the UK Biobank as a global reference of worldwide populations: application to measuring ancestry diversity from GWAS summary statistics.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

SUPPLEMENTARY INFORMATION

动机

结果

可用性和实现

补充信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

利用英国生物库作为全球人群的全球参考：从 GWAS 汇总统计数据衡量祖先多样性的应用。

Using the UK Biobank as a global reference of worldwide populations: application to measuring ancestry diversity from GWAS summary statistics.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

SUPPLEMENTARY INFORMATION

动机

结果

可用性和实现

补充信息