Suppr超能文献

针对DNA序列区域内多态性与分化率之间异质性的改良检测方法。

Improved tests for heterogeneity across a region of DNA sequence in the ratio of polymorphism to divergence.

作者信息

McDonald J H

机构信息

Department of Biology, University of Delaware, Newark, USA.

出版信息

Mol Biol Evol. 1998 Apr;15(4):377-84. doi: 10.1093/oxfordjournals.molbev.a025934.

Abstract

The neutral theory of molecular evolution predicts that the ratio of polymorphisms to fixed differences should be fairly uniform across a region of DNA sequence. Significant heterogeneity in this ratio can indicate the effects of balancing selection, selective sweeps, mildly deleterious mutations, or background selection. Comparing an observed heterogeneity statistic with simulations of the heterogeneity resulting from random phylogenetic and sampling variation provides a test of the statistical significance of the observed pattern. When simulated data sets containing heterogeneity in the polymorphism-to-divergence ratio are examined, different statistics are most powerful for detecting different patterns of heterogeneity. The number of runs is most powerful for detecting patterns containing several peaks of polymorphism; the Kolmogorov-Smirnov statistic is most powerful for detecting patterns in which one end of the gene has high polymorphism and the other end has low polymorphism; and a newly developed statistic, the mean sliding G statistic, is most powerful for detecting patterns containing one or two peaks of polymorphism with reduced polymorphism on either side. Nine out of 27 genes from the Drosophila melanogaster subgroup exhibit heterogeneity that is significant under at least one of these three tests, with five of the nine remaining significant after a correction for multiple comparisons, suggesting that detectable evidence for the effects of some kind of selection is fairly common.

摘要

分子进化的中性理论预测,多态性与固定差异的比率在一段DNA序列区域内应相当均匀。该比率的显著异质性可能表明平衡选择、选择性清除、轻度有害突变或背景选择的影响。将观察到的异质性统计量与由随机系统发育和抽样变异产生的异质性模拟进行比较,可对观察到的模式的统计显著性进行检验。当检查包含多态性与分化率异质性的模拟数据集时,不同的统计量对检测不同的异质性模式最为有效。游程数对检测包含多个多态性峰值的模式最为有效;柯尔莫哥洛夫-斯米尔诺夫统计量对检测基因一端具有高多态性而另一端具有低多态性的模式最为有效;一种新开发的统计量,即平均滑动G统计量,对检测包含一两个多态性峰值且两侧多态性降低的模式最为有效。黑腹果蝇亚组的27个基因中有9个表现出异质性,在这三种测试中的至少一种下具有显著性,其中9个中的5个在进行多重比较校正后仍具有显著性,这表明某种选择效应的可检测证据相当普遍。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验