Suppr超能文献

多重序列比对中极度保守元件的定位算法。

Algorithms for locating extremely conserved elements in multiple sequence alignments.

机构信息

Department of Computer Science and Engineering, University of Washington, Box 352350, Seattle, WA 98195-2350, USA.

出版信息

BMC Bioinformatics. 2009 Dec 18;10:432. doi: 10.1186/1471-2105-10-432.

Abstract

BACKGROUND

In 2004, Bejerano et al. announced the startling discovery of hundreds of "ultraconserved elements", long genomic sequences perfectly conserved across human, mouse, and rat. Their announcement stimulated a flurry of subsequent research.

RESULTS

We generalize the notion of ultraconserved element in a natural way from extraordinary human-rodent conservation to extraordinary conservation over an arbitrary set of species. We call these "Extremely Conserved Elements". There is a linear time algorithm to find all such Extremely Conserved Elements in any multiple sequence alignment, provided that the conservation is required to be across all the aligned species. For the general case of conservation across an arbitrary subset of the aligned species, we show that the question of whether there exists an Extremely Conserved Element is NP-complete. We illustrate the linear time algorithm by cataloguing all 177 Extremely Conserved Elements in the currently available 44-vertebrate whole-genome alignment, and point out some of the characteristics of these elements.

CONCLUSIONS

The NP-completeness in the case of conservation across an arbitrary subset of the aligned species implies that it is unlikely an efficient algorithm exists for this general case. Despite this fact, for the interesting case of conservation across all or most of the aligned species, our algorithm is efficient enough to be practical. The 177 Extremely Conserved Elements that we catalog demonstrate many of the characteristics of the original ultraconserved elements of Bejerano et al.

摘要

背景

2004 年,Bejerano 等人宣布了一项惊人的发现,即在人类、老鼠和大鼠之间存在数百个“超保守元件”,这些元件是长基因组序列,完全保守。他们的宣布激发了随后大量的研究。

结果

我们以一种自然的方式将超保守元件的概念从人类与啮齿动物之间的非凡保守扩展到任意物种之间的非凡保守。我们称这些为“极度保守元件”。有一种线性时间算法可以在任何多重序列比对中找到所有这些极度保守元件,前提是要求保守性跨越所有比对的物种。对于在比对物种的任意子集之间的一般保守情况,我们表明是否存在极度保守元件的问题是 NP 完全的。我们通过在当前可用的 44 个脊椎动物全基因组比对中编目所有 177 个极度保守元件来说明线性时间算法,并指出这些元件的一些特征。

结论

在比对物种的任意子集之间的保守情况下的 NP 完全意味着不太可能存在针对这种一般情况的有效算法。尽管如此,对于所有或大多数比对物种之间的有趣保守情况,我们的算法效率足够高,足以实用。我们编目的 177 个极度保守元件展示了 Bejerano 等人最初的超保守元件的许多特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/08d9/2808710/6e94e453ab44/1471-2105-10-432-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验