• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于估计串联重复祖先-后代编辑距离的不对称对齐算法。

An Asymmetric Alignment Algorithm for Estimating Ancestor-Descendant Edit Distance for Tandem Repeats.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2022 Jul-Aug;19(4):2080-2091. doi: 10.1109/TCBB.2021.3059239. Epub 2022 Aug 8.

DOI:10.1109/TCBB.2021.3059239
PMID:33587704
Abstract

Tandem repeats are repetitive structures present in some DNA sequences, consisting of many repeated copies of a single motif. They can serve as important markers for phylogenetic and population genetic studies, due to the high polymorphism in the number of motif copies as well as variations in the motif. The first step in using tandem repeats for phylogenetic studies is to estimate the evolutionary distance between a pair D and D of tandem repeat sequences with homologous motifs. This problem can be broken into two sub-problems: 1) Construct the most recent common ancestor of the sequences. 2) Calculate the evolutionary distance between each sequence and the hypothesised common ancestor. We present an algorithm that estimates the solution to the second problem. This takes the form of an asymmetric alignment algorithm to estimate the evolutionary distance between two tandem repeat sequences A and D, where D is assumed to have descended from A, under a model that allows block duplication, deletion, and variant substitution. The algorithm is asymmetric in the sense that the two input sequences A and D play different roles in the calculations, reflecting the assumption that D descends from A. Our model assumes static motif boundaries, meaning that motif duplication and deletion events must respect the motif boundaries. The algorithm may also be applied without modification to more complex repetitive structures with two or more motifs, such as nested tandem repeats.

摘要

串联重复是存在于一些 DNA 序列中的重复结构,由单个基序的许多重复拷贝组成。由于基序拷贝数的高度多态性以及基序的变化,它们可以作为系统发育和群体遗传学研究的重要标记。在使用串联重复进行系统发育研究的第一步是估计具有同源基序的一对串联重复序列 D 和 D 之间的进化距离。这个问题可以分为两个子问题:1)构建序列的最近共同祖先。2)计算每个序列与假设的共同祖先之间的进化距离。我们提出了一种估计第二个问题解决方案的算法。这是一种不对称对齐算法,用于在允许块重复、删除和变体替换的模型下,估计两个串联重复序列 A 和 D 之间的进化距离,其中假设 D 是从 A 衍生而来的。该算法在不对称的意义上,即两个输入序列 A 和 D 在计算中扮演不同的角色,反映了 D 从 A 衍生而来的假设。我们的模型假设基序边界是静态的,这意味着基序复制和删除事件必须遵守基序边界。该算法也可以不经修改应用于具有两个或更多基序的更复杂重复结构,例如嵌套串联重复。

相似文献

1
An Asymmetric Alignment Algorithm for Estimating Ancestor-Descendant Edit Distance for Tandem Repeats.一种用于估计串联重复祖先-后代编辑距离的不对称对齐算法。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jul-Aug;19(4):2080-2091. doi: 10.1109/TCBB.2021.3059239. Epub 2022 Aug 8.
2
An algorithm to solve the motif alignment problem for approximate nested tandem repeats in biological sequences.一种用于解决生物序列中近似嵌套串联重复序列的基序比对问题的算法。
J Comput Biol. 2011 Sep;18(9):1211-8. doi: 10.1089/cmb.2011.0101.
3
Tandem repeats over the edit distance.编辑距离上的串联重复序列。
Bioinformatics. 2007 Jan 15;23(2):e30-5. doi: 10.1093/bioinformatics/btl309.
4
Estimation of duplication history under a stochastic model for tandem repeats.基于串联重复随机模型的重复历史估计。
BMC Bioinformatics. 2019 Feb 6;20(1):64. doi: 10.1186/s12859-019-2603-1.
5
Identification and characterization of tandem repeats in exon III of dopamine receptor D4 (DRD4) genes from different mammalian species.不同哺乳动物物种多巴胺受体D4(DRD4)基因外显子III中串联重复序列的鉴定与表征。
DNA Cell Biol. 2005 Dec;24(12):795-804. doi: 10.1089/dna.2005.24.795.
6
An algorithm for approximate tandem repeats.一种用于近似串联重复序列的算法。
J Comput Biol. 2001;8(1):1-18. doi: 10.1089/106652701300099038.
7
Evolution of mtDNA D-loop sequences and their use in phylogenetic studies of shrews in the subgenus Otisorex (Sorex: Soricidae: Insectivora).线粒体DNA D环序列的进化及其在奥氏鼩鼱亚属(鼩鼱属:鼩鼱科:食虫目)鼩鼱系统发育研究中的应用
Mol Phylogenet Evol. 1994 Mar;3(1):38-46. doi: 10.1006/mpev.1994.1005.
8
De novo assembly of bacterial genomes with repetitive DNA regions by dnaasm application.应用 dnaasm 对具有重复 DNA 区域的细菌基因组进行从头组装。
BMC Bioinformatics. 2018 Jul 18;19(1):273. doi: 10.1186/s12859-018-2281-4.
9
STAR: an algorithm to Search for Tandem Approximate Repeats.STAR:一种搜索串联近似重复序列的算法。
Bioinformatics. 2004 Nov 1;20(16):2812-20. doi: 10.1093/bioinformatics/bth335. Epub 2004 Jun 4.
10
TRedD--a database for tandem repeats over the edit distance.TRedD--一个针对编辑距离上串联重复的数据库。
Database (Oxford). 2010 Jul 6;2010:baq003. doi: 10.1093/database/baq003.