“逆向生态学”与群体基因组学的力量。

"Reverse ecology" and the power of population genomics.

作者信息

Li Yong Fuga, Costello James C, Holloway Alisha K, Hahn Matthew W

机构信息

School of Informatics, Indiana University, Bloomington, IN, USA.

出版信息

Evolution. 2008 Dec;62(12):2984-94. doi: 10.1111/j.1558-5646.2008.00486.x. Epub 2008 Aug 26.

DOI:10.1111/j.1558-5646.2008.00486.x

PMID:18752601

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2626434/

Abstract

Rapid and inexpensive sequencing technologies are making it possible to collect whole genome sequence data on multiple individuals from a population. This type of data can be used to quickly identify genes that control important ecological and evolutionary phenotypes by finding the targets of adaptive natural selection, and we therefore refer to such approaches as "reverse ecology." To quantify the power gained in detecting positive selection using population genomic data, we compare three statistical methods for identifying targets of selection: the McDonald-Kreitman test, the mkprf method, and a likelihood implementation for detecting d(N)/d(S) > 1. Because the first two methods use polymorphism data we expect them to have more power to detect selection. However, when applied to population genomic datasets from human, fly, and yeast, the tests using polymorphism data were actually weaker in two of the three datasets. We explore reasons why the simpler comparative method has identified more genes under selection, and suggest that the different methods may really be detecting different signals from the same sequence data. Finally, we find several statistical anomalies associated with the mkprf method, including an almost linear dependence between the number of positively selected genes identified and the prior distributions used. We conclude that interpreting the results produced by this method should be done with some caution.

摘要

快速且低成本的测序技术使得从一个群体中收集多个个体的全基因组序列数据成为可能。这类数据可用于通过寻找适应性自然选择的靶点，快速识别控制重要生态和进化表型的基因，因此我们将此类方法称为“反向生态学”。为了量化利用群体基因组数据检测正选择时所获得的功效，我们比较了三种用于识别选择靶点的统计方法：麦克唐纳 - 克里特曼检验、mkprf方法以及一种用于检测d(N)/d(S)>1的似然法。由于前两种方法使用多态性数据，我们预期它们在检测选择方面具有更强的功效。然而，当应用于来自人类、果蝇和酵母的群体基因组数据集时，在三个数据集中有两个数据集里，使用多态性数据的检验实际上功效更弱。我们探究了为何更简单的比较方法能识别出更多处于选择状态的基因，并提出不同方法可能实际上是从相同序列数据中检测到了不同信号。最后，我们发现了与mkprf方法相关的几个统计异常情况，包括所识别出的正选择基因数量与所用先验分布之间几乎呈线性依赖关系。我们得出结论，在解释该方法产生的结果时应谨慎行事。

相似文献

"Reverse ecology" and the power of population genomics.“逆向生态学”与群体基因组学的力量。

Evolution. 2008 Dec;62(12):2984-94. doi: 10.1111/j.1558-5646.2008.00486.x. Epub 2008 Aug 26.

impMKT: the imputed McDonald and Kreitman test, a straightforward correction that significantly increases the evidence of positive selection of the McDonald and Kreitman test at the gene level.impMKT：推断的 McDonald-Kreitman 检验，一种简单直接的校正方法，可极大地增强基因水平 McDonald-Kreitman 检验阳性选择的证据。

G3 (Bethesda). 2022 Sep 30;12(10). doi: 10.1093/g3journal/jkac206.

A Composite-Likelihood Method for Detecting Incomplete Selective Sweep from Population Genomic Data.一种用于从群体基因组数据中检测不完全选择性清除的复合似然方法。

Genetics. 2015 Jun;200(2):633-49. doi: 10.1534/genetics.115.175380. Epub 2015 Apr 24.

iMKT: the integrative McDonald and Kreitman test.iMKT：综合麦当劳-克里坦门测试。

Nucleic Acids Res. 2019 Jul 2;47(W1):W283-W288. doi: 10.1093/nar/gkz372.

Genomic resources and their influence on the detection of the signal of positive selection in genome scans.基因组资源及其对基因组扫描中正向选择信号检测的影响。

Mol Ecol. 2016 Jan;25(1):170-84. doi: 10.1111/mec.13468. Epub 2015 Dec 17.

An investigation of the statistical power of neutrality tests based on comparative and population genetic data.基于比较和群体遗传数据的中性检验统计功效研究。

Mol Biol Evol. 2009 Feb;26(2):273-83. doi: 10.1093/molbev/msn231. Epub 2008 Oct 14.

Comparative genomics and the study of evolution by natural selection.比较基因组学与自然选择驱动的进化研究。

Mol Ecol. 2008 Nov;17(21):4586-96. doi: 10.1111/j.1365-294X.2008.03954.x.

Machine-Learning Prospects for Detecting Selection Signatures Using Population Genomics Data.利用群体基因组学数据检测选择信号的机器学习前景。

J Comput Biol. 2022 Sep;29(9):943-960. doi: 10.1089/cmb.2021.0447. Epub 2022 May 30.

The influence of demography and weak selection on the McDonald-Kreitman test: an empirical study in Drosophila.人口统计学和弱选择对麦克唐纳-克赖特曼检验的影响：果蝇的实证研究

Mol Biol Evol. 2009 Mar;26(3):691-8. doi: 10.1093/molbev/msn297. Epub 2009 Jan 6.

Proceedings of the SMBE Tri-National Young Investigators' Workshop 2005. Accurate inference and estimation in population genomics.2005年SMBE三国青年研究者研讨会会议记录。群体基因组学中的准确推断与估计。

Mol Biol Evol. 2006 May;23(5):911-8. doi: 10.1093/molbev/msj094. Epub 2006 Jan 11.

引用本文的文献

Context matters: assessing the impacts of genomic background and ecology on microbial biosynthetic gene cluster evolution.背景很重要：评估基因组背景和生态学对微生物生物合成基因簇进化的影响。

mSystems. 2025 Mar 18;10(3):e0153824. doi: 10.1128/msystems.01538-24. Epub 2025 Feb 24.

Beyond genes-for-behaviour: The potential for genomics to resolve long-standing questions in avian brood parasitism.超越行为基因：基因组学解决鸟类巢寄生长期存在问题的潜力

Ecol Evol. 2024 Nov 17;14(11):e70335. doi: 10.1002/ece3.70335. eCollection 2024 Nov.

Rapid Adaptation and Interspecific Introgression in the North American Crop Pest Helicoverpa zea.北美作物害虫棉铃虫的快速适应和种间基因渐渗。

Mol Biol Evol. 2024 Jul 3;41(7). doi: 10.1093/molbev/msae129.

Functional genomic diversity is correlated with neutral genomic diversity in populations of an endangered rattlesnake.功能基因组多样性与濒危响尾蛇种群中性基因组多样性相关。

Proc Natl Acad Sci U S A. 2023 Oct 24;120(43):e2303043120. doi: 10.1073/pnas.2303043120. Epub 2023 Oct 16.

Parasite manipulation of host phenotypes inferred from transcriptional analyses in a trematode-amphipod system.从扁形动物-端足类系统中转录分析推断寄生虫对宿主表型的操纵。

Mol Ecol. 2023 Sep;32(18):5028-5041. doi: 10.1111/mec.17093. Epub 2023 Aug 4.

Metagenomic Discovery of " Parvarchaeales"-Related Lineages Sheds Light on Adaptation and Diversification from Neutral-Thermal to Acidic-Mesothermal Environments.宏基因组学发现“小包古菌目”相关谱系，揭示了从中性-热环境到酸性-中温环境的适应和多样化。

mSystems. 2023 Apr 27;8(2):e0125222. doi: 10.1128/msystems.01252-22. Epub 2023 Mar 21.

Sweepstakes reproductive success via pervasive and recurrent selective sweeps.通过普遍且反复的选择扫荡来赢得繁殖成功。

Elife. 2023 Feb 20;12:e80781. doi: 10.7554/eLife.80781.

Genome sequencing and resequencing identified three horizontal gene transfers and uncovered the genetic mechanism on the intraspecies adaptive evolution of Blume.基因组测序和重测序鉴定出了三次水平基因转移，并揭示了梅花草种内适应性进化的遗传机制。

Front Plant Sci. 2023 Jan 4;13:1035157. doi: 10.3389/fpls.2022.1035157. eCollection 2022.

Testing hypotheses of a coevolutionary key innovation reveals a complex suite of traits involved in defusing the mustard oil bomb.检测协同进化关键创新假说揭示了一系列复杂的特征，这些特征涉及消除芥子油炸弹。

Proc Natl Acad Sci U S A. 2022 Dec 20;119(51):e2208447119. doi: 10.1073/pnas.2208447119. Epub 2022 Dec 12.

Detecting selection in low-coverage high-throughput sequencing data using principal component analysis.使用主成分分析检测低覆盖高通量测序数据中的选择。

BMC Bioinformatics. 2021 Sep 29;22(1):470. doi: 10.1186/s12859-021-04375-2.

本文引用的文献

ANALYZING TABLES OF STATISTICAL TESTS.分析统计检验表

Evolution. 1989 Jan;43(1):223-225. doi: 10.1111/j.1558-5646.1989.tb04220.x.

The polymorphism frequency spectrum of finitely many sites under selection.选择作用下有限多个位点的多态性频谱。

Genetics. 2008 Dec;180(4):2175-91. doi: 10.1534/genetics.108.087361. Epub 2008 Oct 14.

Toward a selection theory of molecular evolution.迈向分子进化的选择理论。

Evolution. 2008 Feb;62(2):255-65. doi: 10.1111/j.1558-5646.2007.00308.x.

Proportionally more deleterious genetic variation in European than in African populations.欧洲人群中有害基因变异的比例高于非洲人群。

Nature. 2008 Feb 21;451(7181):994-7. doi: 10.1038/nature06611.

The McDonald-Kreitman test and slightly deleterious mutations.麦克唐纳-克赖特曼检验与轻度有害突变

Mol Biol Evol. 2008 Jun;25(6):1007-15. doi: 10.1093/molbev/msn005. Epub 2008 Jan 14.

Localization of candidate regions maintaining a common polymorphic inversion (2La) in Anopheles gambiae.冈比亚按蚊中维持常见多态性倒位（2La）的候选区域定位。

PLoS Genet. 2007 Dec;3(12):e217. doi: 10.1371/journal.pgen.0030217.

Population genomics: whole-genome analysis of polymorphism and divergence in Drosophila simulans.群体基因组学：拟暗果蝇多态性和分化的全基因组分析。

PLoS Biol. 2007 Nov 6;5(11):e310. doi: 10.1371/journal.pbio.0050310.

Which evolutionary processes influence natural genetic variation for phenotypic traits?哪些进化过程会影响表型性状的自然遗传变异？

Nat Rev Genet. 2007 Nov;8(11):845-56. doi: 10.1038/nrg2207.

Localizing recent adaptive evolution in the human genome.定位人类基因组中近期的适应性进化

PLoS Genet. 2007 Jun;3(6):e90. doi: 10.1371/journal.pgen.0030090. Epub 2007 Apr 20.

PAML 4: phylogenetic analysis by maximum likelihood.PAML 4：基于最大似然法的系统发育分析。

Mol Biol Evol. 2007 Aug;24(8):1586-91. doi: 10.1093/molbev/msm088. Epub 2007 May 4.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验