Suppr超能文献

从高覆盖度基因组测序项目中估计等位基因频率。

Estimation of allele frequencies from high-coverage genome-sequencing projects.

作者信息

Lynch Michael

机构信息

Department of Biology, Indiana University, Bloomington, Indiana 47405, USA.

出版信息

Genetics. 2009 May;182(1):295-301. doi: 10.1534/genetics.109.100479. Epub 2009 Mar 16.

Abstract

A new generation of high-throughput sequencing strategies will soon lead to the acquisition of high-coverage genomic profiles of hundreds to thousands of individuals within species, generating unprecedented levels of information on the frequencies of nucleotides segregating at individual sites. However, because these new technologies are error prone and yield uneven coverage of alleles in diploid individuals, they also introduce the need for novel methods for analyzing the raw read data. A maximum-likelihood method for the estimation of allele frequencies is developed, eliminating both the need to arbitrarily discard individuals with low coverage and the requirement for an extrinsic measure of the sequence error rate. The resultant estimates are nearly unbiased with asymptotically minimal sampling variance, thereby defining the limits to our ability to estimate population-genetic parameters and providing a logical basis for the optimal design of population-genomic surveys.

摘要

新一代高通量测序策略很快将带来物种内数百至数千个体的高覆盖基因组图谱的获取,产生关于单个位点核苷酸分离频率的前所未有的信息量。然而,由于这些新技术容易出错且在二倍体个体中产生等位基因覆盖不均的情况,它们也带来了对分析原始读取数据的新方法的需求。开发了一种用于估计等位基因频率的最大似然方法,既消除了任意舍弃低覆盖个体的需要,也消除了对序列错误率进行外部测量的要求。所得估计几乎无偏差,渐近采样方差最小,从而确定了我们估计群体遗传参数能力的极限,并为群体基因组调查的优化设计提供了逻辑基础。

相似文献

引用本文的文献

1
Speciation with gene flow in an island endemic hummingbird.岛屿特有蜂鸟中伴随基因流动的物种形成
PNAS Nexus. 2025 Apr 15;4(4):pgaf095. doi: 10.1093/pnasnexus/pgaf095. eCollection 2025 Apr.
9
Genotype-Frequency Estimation from High-Throughput Sequencing Data.高通量测序数据的基因型频率估计。
Genetics. 2015 Oct;201(2):473-86. doi: 10.1534/genetics.115.179077. Epub 2015 Jul 29.

本文引用的文献

1
Population genetic inference from resequencing data.基于重测序数据的群体遗传推断。
Genetics. 2009 Jan;181(1):187-97. doi: 10.1534/genetics.107.080630. Epub 2008 Nov 3.
7
Patterns of damage in genomic DNA sequences from a Neandertal.来自尼安德特人的基因组DNA序列中的损伤模式。
Proc Natl Acad Sci U S A. 2007 Sep 11;104(37):14616-21. doi: 10.1073/pnas.0704665104. Epub 2007 Aug 21.
10
The structure of linkage disequilibrium around a selective sweep.选择性清除周围的连锁不平衡结构。
Genetics. 2007 Mar;175(3):1395-406. doi: 10.1534/genetics.106.062828. Epub 2006 Dec 28.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验