Suppr超能文献

“逆向生态学”与群体基因组学的力量。

"Reverse ecology" and the power of population genomics.

作者信息

Li Yong Fuga, Costello James C, Holloway Alisha K, Hahn Matthew W

机构信息

School of Informatics, Indiana University, Bloomington, IN, USA.

出版信息

Evolution. 2008 Dec;62(12):2984-94. doi: 10.1111/j.1558-5646.2008.00486.x. Epub 2008 Aug 26.

Abstract

Rapid and inexpensive sequencing technologies are making it possible to collect whole genome sequence data on multiple individuals from a population. This type of data can be used to quickly identify genes that control important ecological and evolutionary phenotypes by finding the targets of adaptive natural selection, and we therefore refer to such approaches as "reverse ecology." To quantify the power gained in detecting positive selection using population genomic data, we compare three statistical methods for identifying targets of selection: the McDonald-Kreitman test, the mkprf method, and a likelihood implementation for detecting d(N)/d(S) > 1. Because the first two methods use polymorphism data we expect them to have more power to detect selection. However, when applied to population genomic datasets from human, fly, and yeast, the tests using polymorphism data were actually weaker in two of the three datasets. We explore reasons why the simpler comparative method has identified more genes under selection, and suggest that the different methods may really be detecting different signals from the same sequence data. Finally, we find several statistical anomalies associated with the mkprf method, including an almost linear dependence between the number of positively selected genes identified and the prior distributions used. We conclude that interpreting the results produced by this method should be done with some caution.

摘要

快速且低成本的测序技术使得从一个群体中收集多个个体的全基因组序列数据成为可能。这类数据可用于通过寻找适应性自然选择的靶点,快速识别控制重要生态和进化表型的基因,因此我们将此类方法称为“反向生态学”。为了量化利用群体基因组数据检测正选择时所获得的功效,我们比较了三种用于识别选择靶点的统计方法:麦克唐纳 - 克里特曼检验、mkprf方法以及一种用于检测d(N)/d(S)>1的似然法。由于前两种方法使用多态性数据,我们预期它们在检测选择方面具有更强的功效。然而,当应用于来自人类、果蝇和酵母的群体基因组数据集时,在三个数据集中有两个数据集里,使用多态性数据的检验实际上功效更弱。我们探究了为何更简单的比较方法能识别出更多处于选择状态的基因,并提出不同方法可能实际上是从相同序列数据中检测到了不同信号。最后,我们发现了与mkprf方法相关的几个统计异常情况,包括所识别出的正选择基因数量与所用先验分布之间几乎呈线性依赖关系。我们得出结论,在解释该方法产生的结果时应谨慎行事。

相似文献

1
"Reverse ecology" and the power of population genomics.“逆向生态学”与群体基因组学的力量。
Evolution. 2008 Dec;62(12):2984-94. doi: 10.1111/j.1558-5646.2008.00486.x. Epub 2008 Aug 26.
4
iMKT: the integrative McDonald and Kreitman test.iMKT:综合麦当劳-克里坦门测试。
Nucleic Acids Res. 2019 Jul 2;47(W1):W283-W288. doi: 10.1093/nar/gkz372.

引用本文的文献

本文引用的文献

1
ANALYZING TABLES OF STATISTICAL TESTS.分析统计检验表
Evolution. 1989 Jan;43(1):223-225. doi: 10.1111/j.1558-5646.1989.tb04220.x.
2
The polymorphism frequency spectrum of finitely many sites under selection.选择作用下有限多个位点的多态性频谱。
Genetics. 2008 Dec;180(4):2175-91. doi: 10.1534/genetics.108.087361. Epub 2008 Oct 14.
3
Toward a selection theory of molecular evolution.迈向分子进化的选择理论。
Evolution. 2008 Feb;62(2):255-65. doi: 10.1111/j.1558-5646.2007.00308.x.
5
The McDonald-Kreitman test and slightly deleterious mutations.麦克唐纳-克赖特曼检验与轻度有害突变
Mol Biol Evol. 2008 Jun;25(6):1007-15. doi: 10.1093/molbev/msn005. Epub 2008 Jan 14.
9
Localizing recent adaptive evolution in the human genome.定位人类基因组中近期的适应性进化
PLoS Genet. 2007 Jun;3(6):e90. doi: 10.1371/journal.pgen.0030090. Epub 2007 Apr 20.
10
PAML 4: phylogenetic analysis by maximum likelihood.PAML 4:基于最大似然法的系统发育分析。
Mol Biol Evol. 2007 Aug;24(8):1586-91. doi: 10.1093/molbev/msm088. Epub 2007 May 4.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验