Suppr超能文献

编辑距离上的串联重复序列。

Tandem repeats over the edit distance.

作者信息

Sokol Dina, Benson Gary, Tojeira Justin

机构信息

Department of Computer and Information Science, Brooklyn College of the City University of New York, Brooklyn, NY, USA.

出版信息

Bioinformatics. 2007 Jan 15;23(2):e30-5. doi: 10.1093/bioinformatics/btl309.

Abstract

MOTIVATION

A tandem repeat in DNA is a sequence of two or more contiguous, approximate copies of a pattern of nucleotides. Tandem repeats occur in the genomes of both eukaryotic and prokaryotic organisms. They are important in numerous fields including disease diagnosis, mapping studies, human identity testing (DNA fingerprinting), sequence homology and population studies. Although tandem repeats have been used by biologists for many years, there are few tools available for performing an exhaustive search for all tandem repeats in a given sequence.

RESULTS

In this paper we describe an efficient algorithm for finding all tandem repeats within a sequence, under the edit distance measure. The contributions of this paper are two-fold: theoretical and practical. We present a precise definition for tandem repeats over the edit distance and an efficient, deterministic algorithm for finding these repeats.

AVAILABILITY

The algorithm has been implemented in C++, and the software is available upon request and can be used at http://www.sci.brooklyn.cuny.edu/~sokol/trepeats. The use of this tool will assist biologists in discovering new ways that tandem repeats affect both the structure and function of DNA and protein molecules.

摘要

动机

DNA中的串联重复是指两个或更多相邻的、近似的核苷酸模式拷贝序列。串联重复存在于真核生物和原核生物的基因组中。它们在众多领域都很重要,包括疾病诊断、图谱研究、人类身份测试(DNA指纹识别)、序列同源性和群体研究。尽管生物学家已经使用串联重复多年,但用于在给定序列中彻底搜索所有串联重复的工具却很少。

结果

在本文中,我们描述了一种在编辑距离度量下查找序列中所有串联重复的高效算法。本文的贡献有两个方面:理论和实践。我们给出了基于编辑距离的串联重复的精确定义,以及用于查找这些重复的高效确定性算法。

可用性

该算法已用C++实现,软件可根据要求提供,可在http://www.sci.brooklyn.cuny.edu/~sokol/trepeats使用。使用此工具将有助于生物学家发现串联重复影响DNA和蛋白质分子结构与功能的新方式。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验