• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多重基因组重排与断点系统发育

Multiple genome rearrangement and breakpoint phylogeny.

作者信息

Sankoff D, Blanchette M

机构信息

Centre de recherches mathématiques, Université de Montréal, Québec, Canada.

出版信息

J Comput Biol. 1998 Fall;5(3):555-70. doi: 10.1089/cmb.1998.5.555.

DOI:10.1089/cmb.1998.5.555
PMID:9773350
Abstract

Multiple alignment of macromolecular sequences generalizes from N = 2 to N > or = 3 the comparison of N sequences which have diverged through the local processes of insertion, deletion and substitution. Gene-order sequences diverge through non-local genome rearrangement processes such as inversion (or reversal) and transposition. In this paper we show which formulations of multiple alignment have counterparts in multiple rearrangement. Based on difficulties inherent in rearrangement edit-distance calculation and interpretation, we argue for the simpler "breakpoint analysis." Consensus-based multiple rearrangement of N > or = 3 orders can be solved exactly through reduction to instances of the Travelling Salesman Problem (TSP). We propose a branch-and-bound solution to TSP particularly suited to these instances. Simulations show how non-uniqueness of the solution is attenuated with increasing numbers of data genomes. Tree-based multiple alignment can be achieved to a great degree of accuracy by decomposing the tree into a number of overlapping 3-stars centered on the non-terminal nodes, and solving the consensus-based problem iteratively for these nodes until convergence. Accuracy improves with very careful initializations at the non-terminal nodes. The degree of non-uniqueness of solutions depends on the position of the node in the tree in terms of path length to the terminal vertices.

摘要

大分子序列的多重比对将N个通过插入、缺失和替换等局部过程而发生分化的序列的比较从N = 2推广到了N≥3。基因顺序序列则通过诸如倒位(或反转)和转座等非局部基因组重排过程发生分化。在本文中,我们展示了多重比对的哪些公式在多重重排中有对应物。基于重排编辑距离计算和解释中固有的困难,我们主张采用更简单的“断点分析”。N≥3个顺序的基于共识的多重重排可以通过简化为旅行商问题(TSP)的实例来精确求解。我们提出了一种特别适用于这些实例的TSP分支定界解决方案。模拟显示了随着数据基因组数量的增加,解决方案的非唯一性是如何减弱的。通过将树分解为以非终端节点为中心的多个重叠三星结构,并对这些节点迭代求解基于共识的问题,直到收敛,可以在很大程度上实现基于树的多重比对。在非终端节点进行非常仔细的初始化时,准确性会提高。解决方案的非唯一性程度取决于树中节点相对于终端顶点的路径长度的位置。

相似文献

1
Multiple genome rearrangement and breakpoint phylogeny.多重基因组重排与断点系统发育
J Comput Biol. 1998 Fall;5(3):555-70. doi: 10.1089/cmb.1998.5.555.
2
Chromosomal breakpoint reuse in genome sequence rearrangement.基因组序列重排中的染色体断点重复利用
J Comput Biol. 2005 Jul-Aug;12(6):812-21. doi: 10.1089/cmb.2005.12.812.
3
A statistically fair comparison of ancestral genome reconstructions, based on breakpoint and rearrangement distances.基于断点和重排距离对祖先基因组重建进行的统计学上公平的比较。
J Comput Biol. 2010 Sep;17(9):1299-314. doi: 10.1089/cmb.2010.0121.
4
Pairwise alignment with rearrangements.带重排的成对比对。
Genome Inform. 2006;17(2):141-51.
5
Algorithms for multiple genome rearrangement by signed reversals.通过有符号反转进行多基因组重排的算法。
Pac Symp Biocomput. 2003:363-74.
6
Using median sets for inferring phylogenetic trees.使用中位数集推断系统发育树。
Bioinformatics. 2007 Jan 15;23(2):e129-35. doi: 10.1093/bioinformatics/btl300.
7
Stability of rearrangement measures in the comparison of genome sequences.基因组序列比较中重排测量的稳定性。
J Comput Biol. 2006 Mar;13(2):554-66. doi: 10.1089/cmb.2006.13.554.
8
Divide-and-conquer approach for the exemplar breakpoint distance.用于示例断点距离的分治法
Bioinformatics. 2005 May 15;21(10):2171-6. doi: 10.1093/bioinformatics/bti327. Epub 2005 Feb 15.
9
Locating rearrangement events in a phylogeny based on highly fragmented assemblies.基于高度碎片化的组装结果在系统发育中定位重排事件。
BMC Genomics. 2016 Jan 11;17 Suppl 1(Suppl 1):1. doi: 10.1186/s12864-015-2294-6.
10
Genome halving and double distance with losses.基因组减半与带损失的双倍距离。
J Comput Biol. 2011 Sep;18(9):1185-99. doi: 10.1089/cmb.2011.0136.

引用本文的文献

1
Pangenome comparison via ED strings.通过编辑距离字符串进行泛基因组比较。
Front Bioinform. 2024 Sep 26;4:1397036. doi: 10.3389/fbinf.2024.1397036. eCollection 2024.
2
Genome Rearrangement Analysis : Cut and Join Genome Rearrangements and Gene Cluster Preserving Approaches.基因组重排分析:切割和连接基因组重排及基因簇保护方法。
Methods Mol Biol. 2024;2802:215-245. doi: 10.1007/978-1-0716-3838-5_9.
3
Detecting gene breakpoints in noisy genome sequences using position-annotated colored de-Bruijn graphs.利用位置注释的有色 de-Bruijn 图检测嘈杂基因组序列中的基因断点。
BMC Bioinformatics. 2023 Jun 5;24(1):235. doi: 10.1186/s12859-023-05371-4.
4
Evaluating impacts of syntenic block detection strategies on rearrangement phylogeny using Mycobacterium tuberculosis isolates.评估基于连锁块检测策略对结核分枝杆菌分离株重排系统发育的影响。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btad024.
5
Convergent evolution of polyploid genomes from across the eukaryotic tree of life.多倍体基因组在整个真核生物进化树上的趋同进化。
G3 (Bethesda). 2022 May 30;12(6). doi: 10.1093/g3journal/jkac094.
6
Phylogenetic Reconstruction Based on Synteny Block and Gene Adjacencies.基于同线性块和基因邻接的系统发育重建。
Mol Biol Evol. 2020 Sep 1;37(9):2747-2762. doi: 10.1093/molbev/msaa114.
7
The lasting after-effects of an ancient polyploidy on the genomes of teleosts.古代多倍体事件对硬骨鱼类基因组的持久影响。
PLoS One. 2020 Apr 16;15(4):e0231356. doi: 10.1371/journal.pone.0231356. eCollection 2020.
8
Reconstructing the Phylogeny of Corynebacteriales while Accounting for Horizontal Gene Transfer.重建棒状杆菌目的系统发育时要考虑水平基因转移。
Genome Biol Evol. 2020 Apr 1;12(4):381-395. doi: 10.1093/gbe/evaa058.
9
Rearrangement analysis of multiple bacterial genomes.多株细菌基因组重排分析。
BMC Bioinformatics. 2019 Dec 27;20(Suppl 23):631. doi: 10.1186/s12859-019-3293-4.
10
A New Algorithm for Identifying Genome Rearrangements in the Mammalian Evolution.一种用于识别哺乳动物进化中基因组重排的新算法。
Front Genet. 2019 Oct 29;10:1020. doi: 10.3389/fgene.2019.01020. eCollection 2019.