• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从下一代测序数据估计个体混合比例。

Estimating individual admixture proportions from next generation sequencing data.

机构信息

The Bioinformatics Centre, Department of Biology, University of Copenhagen, DK-2200 Copenhagen N.

出版信息

Genetics. 2013 Nov;195(3):693-702. doi: 10.1534/genetics.113.154138. Epub 2013 Sep 11.

DOI:10.1534/genetics.113.154138
PMID:24026093
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3813857/
Abstract

Inference of population structure and individual ancestry is important both for population genetics and for association studies. With next generation sequencing technologies it is possible to obtain genetic data for all accessible genetic variations in the genome. Existing methods for admixture analysis rely on known genotypes. However, individual genotypes cannot be inferred from low-depth sequencing data without introducing errors. This article presents a new method for inferring an individual's ancestry that takes the uncertainty introduced in next generation sequencing data into account. This is achieved by working directly with genotype likelihoods that contain all relevant information of the unobserved genotypes. Using simulations as well as publicly available sequencing data, we demonstrate that the presented method has great accuracy even for very low-depth data. At the same time, we demonstrate that applying existing methods to genotypes called from the same data can introduce severe biases. The presented method is implemented in the NGSadmix software available at http://www.popgen.dk/software.

摘要

推断群体结构和个体祖先对于群体遗传学和关联研究都很重要。随着下一代测序技术的发展,有可能获得基因组中所有可及遗传变异的遗传数据。现有的混合分析方法依赖于已知的基因型。然而,如果不引入错误,从低深度测序数据中无法推断个体基因型。本文提出了一种新的方法,可以从下一个考虑到不确定性世代测序数据中推断个体的祖先直接处理包含未观察到基因型所有相关信息的基因型似然。使用模拟以及公开可用的测序数据,我们证明了即使在深度非常低的数据情况下,所提出的方法也具有很高的准确性。同时,我们证明了将现有方法应用于从同一数据中调用的基因型可能会引入严重的偏差。所提出的方法在可从 http://www.popgen.dk/software 获得的 NGSadmix 软件中实现。

相似文献

1
Estimating individual admixture proportions from next generation sequencing data.从下一代测序数据估计个体混合比例。
Genetics. 2013 Nov;195(3):693-702. doi: 10.1534/genetics.113.154138. Epub 2013 Sep 11.
2
Inferring Population Structure and Admixture Proportions in Low-Depth NGS Data.在低深度 NGS 数据中推断群体结构和混合比例。
Genetics. 2018 Oct;210(2):719-731. doi: 10.1534/genetics.118.301336. Epub 2018 Aug 21.
3
Fast individual ancestry inference from DNA sequence data leveraging allele frequencies for multiple populations.利用多个群体的等位基因频率从DNA序列数据中快速推断个体祖先。
BMC Bioinformatics. 2015 Jan 16;16:4. doi: 10.1186/s12859-014-0418-7.
4
fastNGSadmix: admixture proportions and principal component analysis of a single NGS sample.fastNGSadmix:单个 NGS 样本的混合比例和主成分分析。
Bioinformatics. 2017 Oct 1;33(19):3148-3150. doi: 10.1093/bioinformatics/btx474.
5
NgsRelate: a software tool for estimating pairwise relatedness from next-generation sequencing data.NgsRelate:一种用于从下一代测序数据估计成对亲缘关系的软件工具。
Bioinformatics. 2015 Dec 15;31(24):4009-11. doi: 10.1093/bioinformatics/btv509. Epub 2015 Aug 30.
6
Quantifying population genetic differentiation from next-generation sequencing data.从下一代测序数据中定量群体遗传分化。
Genetics. 2013 Nov;195(3):979-92. doi: 10.1534/genetics.113.154740. Epub 2013 Aug 26.
7
NGSremix: a software tool for estimating pairwise relatedness between admixed individuals from next-generation sequencing data.NGSremix:一种用于从下一代测序数据估算混合个体之间成对亲缘关系的软件工具。
G3 (Bethesda). 2021 Aug 7;11(8). doi: 10.1093/g3journal/jkab174.
8
ANGSD: Analysis of Next Generation Sequencing Data.ANGSD:下一代测序数据分析
BMC Bioinformatics. 2014 Nov 25;15(1):356. doi: 10.1186/s12859-014-0356-4.
9
Fast admixture analysis and population tree estimation for SNP and NGS data.快速混合分析和 SNP 及 NGS 数据的群体树估计。
Bioinformatics. 2017 Jul 15;33(14):2148-2155. doi: 10.1093/bioinformatics/btx098.
10
Powerful Inference with the D-Statistic on Low-Coverage Whole-Genome Data.利用D统计量对低覆盖率全基因组数据进行有力推断。
G3 (Bethesda). 2018 Feb 2;8(2):551-566. doi: 10.1534/g3.117.300192.

引用本文的文献

1
Population Genomics Reveals Panmixia in Pacific Sardine () of the North Pacific.群体基因组学揭示了北太平洋太平洋沙丁鱼()的随机交配。
Evol Appl. 2025 Sep 4;18(9):e70154. doi: 10.1111/eva.70154. eCollection 2025 Sep.
2
Genome sequence analysis provides evidence that a boreal crustacean colonised Svalbard well before the ongoing Atlantification of the Arctic.基因组序列分析提供的证据表明,一种北方甲壳类动物在北极地区当前正在进行的大西洋化之前很久就已在斯瓦尔巴群岛定居。
Heredity (Edinb). 2025 Aug 23. doi: 10.1038/s41437-025-00793-7.
3
Late Iron Age and Roman equine breeding north of the Alps: Genetic insights and cultural implications.阿尔卑斯山以北铁器时代晚期和罗马时期的马匹繁育:遗传学见解与文化影响
iScience. 2025 Aug 13;28(9):113224. doi: 10.1016/j.isci.2025.113224. eCollection 2025 Sep 19.
4
Postglacial Echoes: Parasite Genomics Uncover Environmental Changes in Postglacial European Lakes.冰期后的回响:寄生虫基因组学揭示欧洲冰期后湖泊的环境变化
Mol Ecol. 2025 Sep;34(17):e70039. doi: 10.1111/mec.70039. Epub 2025 Jul 16.
5
Museomics unravels cryptic diversity in an endemic group of New Guinean songbirds.博物馆组学揭示了新几内亚特有鸣禽群体中的隐秘多样性。
Biol Lett. 2025 Jul;21(7):20240611. doi: 10.1098/rsbl.2024.0611. Epub 2025 Jul 16.
6
Species Limits and Hybridization in Andean Leaf-Eared Mice ().安第斯叶耳鼠的物种界限与杂交()。
Ecol Evol. 2025 Jul 13;15(7):e71783. doi: 10.1002/ece3.71783. eCollection 2025 Jul.
7
Evolutionary insights into Na/K-ATPase-mediated toxin resistance in the Crested Serpent-eagle preying on introduced cane toads in Okinawa, Japan.对日本冲绳捕食引进的甘蔗蟾蜍的凤头鹰中钠钾ATP酶介导的毒素抗性的进化见解。
BMC Ecol Evol. 2025 Jul 14;25(1):70. doi: 10.1186/s12862-025-02412-9.
8
The blueprint for survival: the blue dasher dragonfly as a model for urban adaptation.生存蓝图:蓝胸蟌作为城市适应的典范
BMC Ecol Evol. 2025 Jul 8;25(1):67. doi: 10.1186/s12862-025-02399-3.
9
Haploblocks contribute to parallel climate adaptation following global invasion of a cosmopolitan plant.单倍型块在一种世界性植物全球入侵后促成了平行的气候适应。
Nat Ecol Evol. 2025 Jul 8. doi: 10.1038/s41559-025-02751-2.
10
Environmental Drivers of Genetic Divergence in Two Corals From the Florida Keys.佛罗里达群岛两种珊瑚遗传分化的环境驱动因素
Evol Appl. 2025 Jun 29;18(7):e70126. doi: 10.1111/eva.70126. eCollection 2025 Jul.

本文引用的文献

1
An integrated map of genetic variation from 1,092 human genomes.1092 个人类基因组遗传变异的综合图谱。
Nature. 2012 Nov 1;491(7422):56-65. doi: 10.1038/nature11632.
2
Extremely low-coverage sequencing and imputation increases power for genome-wide association studies.极低覆盖度测序和模拟提高了全基因组关联研究的效能。
Nat Genet. 2012 May 20;44(6):631-5. doi: 10.1038/ng.2283.
3
Association testing for next-generation sequencing data using score statistics.基于评分统计量的下一代测序数据关联分析。
Genet Epidemiol. 2012 Jul;36(5):430-7. doi: 10.1002/gepi.21636. Epub 2012 May 8.
4
A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.一种用于从测序数据中进行 SNP 调用、突变发现、关联映射和群体遗传参数估计的统计框架。
Bioinformatics. 2011 Nov 1;27(21):2987-93. doi: 10.1093/bioinformatics/btr509. Epub 2011 Sep 8.
5
Structurama: bayesian inference of population structure.Structurama:群体结构的贝叶斯推断。
Evol Bioinform Online. 2011;7:55-9. doi: 10.4137/EBO.S6761. Epub 2011 Jun 2.
6
Estimation of allele frequency and association mapping using next-generation sequencing data.利用下一代测序数据进行等位基因频率估计和关联作图。
BMC Bioinformatics. 2011 Jun 11;12:231. doi: 10.1186/1471-2105-12-231.
7
Genotype and SNP calling from next-generation sequencing data.从下一代测序数据中进行基因型和单核苷酸多态性(SNP)的调用。
Nat Rev Genet. 2011 Jun;12(6):443-51. doi: 10.1038/nrg2986.
8
A map of human genome variation from population-scale sequencing.人类基因组变异的图谱来自于基于人群的测序。
Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.
9
Integrating common and rare genetic variation in diverse human populations.整合不同人类群体中的常见和罕见遗传变异。
Nature. 2010 Sep 2;467(7311):52-8. doi: 10.1038/nature09298.
10
The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.基因组分析工具包:一种用于分析下一代 DNA 测序数据的 MapReduce 框架。
Genome Res. 2010 Sep;20(9):1297-303. doi: 10.1101/gr.107524.110. Epub 2010 Jul 19.