• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用特定人群变体的祖先谱系分析

Ancestral Spectrum Analysis With Population-Specific Variants.

作者信息

Shi Gang, Kuang Qingmin

机构信息

State Key Laboratory of Integrated Services Networks, Xidian University, Xi'an, China.

出版信息

Front Genet. 2021 Sep 27;12:724638. doi: 10.3389/fgene.2021.724638. eCollection 2021.

DOI:10.3389/fgene.2021.724638
PMID:34646302
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8503515/
Abstract

With the advance of sequencing technology, an increasing number of populations have been sequenced to study the histories of worldwide populations, including their divergence, admixtures, migration, and effective sizes. The variants detected in sequencing studies are largely rare and mostly population specific. Population-specific variants are often recent mutations and are informative for revealing substructures and admixtures in populations; however, computational methods and tools to analyze them are still lacking. In this work, we propose using reference populations and single nucleotide polymorphisms (SNPs) specific to the reference populations. Ancestral information, the best linear unbiased estimator (BLUE) of the ancestral proportion, is proposed, which can be used to infer ancestral proportions in recently admixed target populations and measure the extent to which reference populations serve as good proxies for the admixing sources. Based on the same panel of SNPs, the ancestral information is comparable across samples from different studies and is not affected by genetic outliers, related samples, or the sample sizes of the admixed target populations. In addition, ancestral spectrum is useful for detecting genetic outliers or exploring co-ancestry between study samples and the reference populations. The methods are implemented in a program, Ancestral Spectrum Analyzer (ASA), and are applied in analyzing high-coverage sequencing data from the 1000 Genomes Project and the Human Genome Diversity Project (HGDP). In the analyses of American populations from the 1000 Genomes Project, we demonstrate that recent admixtures can be dissected from ancient admixtures by comparing ancestral spectra with and without indigenous Americans being included in the reference populations.

摘要

随着测序技术的发展,越来越多的人群被测序,以研究全球人群的历史,包括他们的分化、混合、迁移和有效群体大小。测序研究中检测到的变异大多是罕见的,且大多具有群体特异性。群体特异性变异通常是近期突变,有助于揭示群体中的亚结构和混合情况;然而,分析这些变异的计算方法和工具仍然缺乏。在这项工作中,我们提出使用参考群体以及特定于参考群体的单核苷酸多态性(SNP)。我们提出了祖先信息,即祖先比例的最佳线性无偏估计(BLUE),它可用于推断近期混合的目标群体中的祖先比例,并衡量参考群体作为混合源的良好代理的程度。基于同一组SNP,不同研究样本中的祖先信息具有可比性,并且不受遗传异常值、相关样本或混合目标群体样本大小的影响。此外,祖先谱有助于检测遗传异常值或探索研究样本与参考群体之间的共同祖先关系。这些方法在一个名为祖先谱分析仪(ASA)的程序中实现,并应用于分析来自千人基因组计划和人类基因组多样性计划(HGDP)的高覆盖度测序数据。在对千人基因组计划中美国人群的分析中,我们证明,通过比较参考群体中包含和不包含美洲原住民时的祖先谱,可以将近期混合与古代混合区分开来。

相似文献

1
Ancestral Spectrum Analysis With Population-Specific Variants.使用特定人群变体的祖先谱系分析
Front Genet. 2021 Sep 27;12:724638. doi: 10.3389/fgene.2021.724638. eCollection 2021.
2
Local Ancestry Inference Based on Population-Specific Single-Nucleotide Polymorphisms-A Study of Admixed Populations in the 1000 Genomes Project.基于群体特异单核苷酸多态性的局域亲缘关系推断——以 1000 基因组计划中的混合人群为例。
Genes (Basel). 2024 Aug 21;15(8):1099. doi: 10.3390/genes15081099.
3
An ancestry informative marker panel design for individual ancestry estimation of Hispanic population using whole exome sequencing data.基于全外显子组测序数据的西班牙裔个体祖籍信息标记面板设计用于个体祖籍估计。
BMC Genomics. 2019 Dec 30;20(Suppl 12):1007. doi: 10.1186/s12864-019-6333-6.
4
MI-MAAP: marker informativeness for multi-ancestry admixed populations.MI-MAAP:多祖混合人群的标记信息量。
BMC Bioinformatics. 2020 Apr 3;21(1):131. doi: 10.1186/s12859-020-3462-5.
5
Inference of locus-specific ancestry in closely related populations.密切相关群体中特定基因座祖先的推断。
Bioinformatics. 2009 Jun 15;25(12):i213-21. doi: 10.1093/bioinformatics/btp197.
6
Genetic ancestry, admixture and health determinants in Latin America.拉丁美洲的遗传起源、混合和健康决定因素。
BMC Genomics. 2018 Dec 11;19(Suppl 8):861. doi: 10.1186/s12864-018-5195-7.
7
Massively parallel sequencing of 165 ancestry-informative SNPs and forensic biogeographical ancestry inference in three southern Chinese Sinitic/Tai-Kadai populations.对 165 个具有族群遗传信息的 SNP 进行大规模平行测序,并对中国南方三个汉藏语系/台语族群进行法医学生物地理族群推断。
Forensic Sci Int Genet. 2021 May;52:102475. doi: 10.1016/j.fsigen.2021.102475. Epub 2021 Feb 2.
8
Ancestral components of admixed genomes in a Mexican cohort.墨西哥队列中混合基因组的祖先成分。
PLoS Genet. 2011 Dec;7(12):e1002410. doi: 10.1371/journal.pgen.1002410. Epub 2011 Dec 15.
9
Development and Evaluation of the Ancestry Informative Marker Panel of the VISAGE Basic Tool.发展和评估 VISAGE 基本工具的祖源信息标记面板。
Genes (Basel). 2021 Aug 22;12(8):1284. doi: 10.3390/genes12081284.
10
Straightforward inference of ancestry and admixture proportions through ancestry-informative insertion deletion multiplexing.通过基于祖先信息的插入缺失多重PCR 直接推断祖先和混合比例。
PLoS One. 2012;7(1):e29684. doi: 10.1371/journal.pone.0029684. Epub 2012 Jan 17.

引用本文的文献

1
Fine-mapping in admixed populations using CARMA-X, with applications to Latin American studies.使用CARMA-X在混合人群中进行精细定位及其在拉丁美洲研究中的应用。
Am J Hum Genet. 2025 May 1;112(5):1215-1232. doi: 10.1016/j.ajhg.2025.02.020. Epub 2025 Mar 26.
2
Local Ancestry Inference Based on Population-Specific Single-Nucleotide Polymorphisms-A Study of Admixed Populations in the 1000 Genomes Project.基于群体特异单核苷酸多态性的局域亲缘关系推断——以 1000 基因组计划中的混合人群为例。
Genes (Basel). 2024 Aug 21;15(8):1099. doi: 10.3390/genes15081099.
3
SNVstory: inferring genetic ancestry from genome sequencing data.

本文引用的文献

1
High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios.对扩展的 1000 基因组项目队列进行高覆盖率全基因组测序,包括 602 个三核苷酸重复序列。
Cell. 2022 Sep 1;185(18):3426-3440.e19. doi: 10.1016/j.cell.2022.08.004.
2
Genetic Consequences of the Transatlantic Slave Trade in the Americas.美洲跨大西洋奴隶贸易的遗传后果。
Am J Hum Genet. 2020 Aug 6;107(2):265-277. doi: 10.1016/j.ajhg.2020.06.012. Epub 2020 Jul 23.
3
Fast and robust ancestry prediction using principal component analysis.利用主成分分析进行快速稳健的祖源预测。
SNVstory:从基因组测序数据推断遗传起源。
BMC Bioinformatics. 2024 Feb 20;25(1):76. doi: 10.1186/s12859-024-05703-y.
Bioinformatics. 2020 Jun 1;36(11):3439-3446. doi: 10.1093/bioinformatics/btaa152.
4
Insights into human genetic variation and population history from 929 diverse genomes.从 929 个不同的基因组中深入了解人类遗传变异和人口历史。
Science. 2020 Mar 20;367(6484). doi: 10.1126/science.aay5012.
5
UMAP reveals cryptic population structure and phenotype heterogeneity in large genomic cohorts.UMAP 揭示了大型基因组队列中的隐藏种群结构和表型异质性。
PLoS Genet. 2019 Nov 1;15(11):e1008432. doi: 10.1371/journal.pgen.1008432. eCollection 2019 Nov.
6
Evaluation of methods for adjusting population stratification in genome-wide association studies: Standard versus categorical principal component analysis.全基因组关联研究中调整群体分层方法的评估:标准主成分分析与分类主成分分析
Ann Hum Genet. 2019 Nov;83(6):454-464. doi: 10.1111/ahg.12339. Epub 2019 Jul 19.
7
A tutorial on how not to over-interpret STRUCTURE and ADMIXTURE bar plots.关于如何不过度解读 STRUCTURE 和 ADMIXTURE 条形图的教程。
Nat Commun. 2018 Aug 14;9(1):3258. doi: 10.1038/s41467-018-05257-7.
8
Tracing the peopling of the world through genomics.通过基因组学探寻人类的全球迁徙历程。
Nature. 2017 Jan 18;541(7637):302-310. doi: 10.1038/nature21347.
9
The Simons Genome Diversity Project: 300 genomes from 142 diverse populations.西蒙斯基因组多样性项目:来自142个不同群体的300个基因组。
Nature. 2016 Oct 13;538(7624):201-206. doi: 10.1038/nature18964. Epub 2016 Sep 21.
10
Genomic analyses inform on migration events during the peopling of Eurasia.基因组分析为欧亚大陆人类迁徙过程中的迁徙事件提供了信息。
Nature. 2016 Oct 13;538(7624):238-242. doi: 10.1038/nature19792. Epub 2016 Sep 21.