Suppr超能文献

无需对中性替代率进行先验假设的局部保守性评分。

Local conservation scores without a priori assumptions on neutral substitution rates.

作者信息

Dingel Janis, Hanus Pavol, Leonardi Niccolò, Hagenauer Joachim, Zech Jürgen, Mueller Jakob C

机构信息

Institute for Communications Engineering, Technische Universität München, Munich, Germany.

出版信息

BMC Bioinformatics. 2008 Apr 11;9:190. doi: 10.1186/1471-2105-9-190.

Abstract

BACKGROUND

Comparative genomics aims to detect signals of evolutionary conservation as an indicator of functional constraint. Surprisingly, results of the ENCODE project revealed that about half of the experimentally verified functional elements found in non-coding DNA were classified as unconstrained by computational predictions. Following this observation, it has been hypothesized that this may be partly explained by biased estimates on neutral evolutionary rates used by existing sequence conservation metrics. All methods we are aware of rely on a comparison with the neutral rate and conservation is estimated by measuring the deviation of a particular genomic region from this rate. Consequently, it is a reasonable assumption that inaccurate neutral rate estimates may lead to biased conservation and constraint estimates.

RESULTS

We propose a conservation signal that is produced by local Maximum Likelihood estimation of evolutionary parameters using an optimized sliding window and present a Kullback-Leibler projection that allows multiple different estimated parameters to be transformed into a conservation measure. This conservation measure does not rely on assumptions about neutral evolutionary substitution rates and little a priori assumptions on the properties of the conserved regions are imposed. We show the accuracy of our approach (KuLCons) on synthetic data and compare it to the scores generated by state-of-the-art methods (phastCons, GERP, SCONE) in an ENCODE region. We find that KuLCons is most often in agreement with the conservation/constraint signatures detected by GERP and SCONE while qualitatively very different patterns from phastCons are observed. Opposed to standard methods KuLCons can be extended to more complex evolutionary models, e.g. taking insertion and deletion events into account and corresponding results show that scores obtained under this model can diverge significantly from scores using the simpler model.

CONCLUSION

Our results suggest that discriminating among the different degrees of conservation is possible without making assumptions about neutral rates. We find, however, that it cannot be expected to discover considerably different constraint regions than GERP and SCONE. Consequently, we conclude that the reported discrepancies between experimentally verified functional and computationally identified constraint elements are likely not to be explained by biased neutral rate estimates.

摘要

背景

比较基因组学旨在检测进化保守信号,以此作为功能限制的一个指标。令人惊讶的是,ENCODE项目的结果显示,在非编码DNA中发现的约一半经实验验证的功能元件,根据计算预测被归类为无限制的。基于这一观察结果,有人推测,这可能部分是由于现有序列保守性度量所使用的中性进化速率估计存在偏差。我们所知的所有方法都依赖于与中性速率的比较,并且通过测量特定基因组区域与该速率的偏差来估计保守性。因此,一个合理的假设是,不准确的中性速率估计可能导致有偏差的保守性和限制估计。

结果

我们提出了一种通过使用优化的滑动窗口对进化参数进行局部最大似然估计而产生的保守信号,并提出了一种Kullback-Leibler投影,它允许将多个不同的估计参数转换为一种保守性度量。这种保守性度量不依赖于关于中性进化替代率的假设,并且对保守区域的性质几乎没有先验假设。我们在合成数据上展示了我们方法(KuLCons)的准确性,并将其与ENCODE区域中最先进的方法(phastCons、GERP、SCONE)生成的分数进行比较。我们发现KuLCons最常与GERP和SCONE检测到的保守性/限制特征一致,同时观察到与phastCons在定性上非常不同的模式。与标准方法不同,KuLCons可以扩展到更复杂的进化模型,例如考虑插入和删除事件,相应的结果表明,在该模型下获得的分数可能与使用更简单模型时的分数有显著差异。

结论

我们的结果表明,在不做关于中性速率假设的情况下,区分不同程度的保守性是可能的。然而,我们发现,与GERP和SCONE相比,预计不会发现明显不同的限制区域。因此,我们得出结论,实验验证的功能元件与计算识别的限制元件之间报告的差异,可能无法用有偏差的中性速率估计来解释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf5a/2375903/c7cb66283793/1471-2105-9-190-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验