• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

线性多染色体基因组上的中位数问题:图形表示与快速精确解

The median problems on linear multichromosomal genomes: graph representation and fast exact solutions.

作者信息

Xu Andrew Wei

机构信息

School of Computer and Communication Sciences, Swiss Federal Institute of Technology, Lausanne, Switzerland.

出版信息

J Comput Biol. 2010 Sep;17(9):1195-211. doi: 10.1089/cmb.2010.0106.

DOI:10.1089/cmb.2010.0106
PMID:20874404
Abstract

In genome rearrangement, given a set of genomes G and a distance measure d, the median problem asks for another genome q that minimizes the total distance [Formula: see text]. This is a key problem in genome rearrangement based phylogenetic analysis. Although this problem is known to be NP-hard, we have shown in a previous article, on circular genomes and under the DCJ distance measure, that a family of patterns in the given genomes--represented by adequate subgraphs--allow us to rapidly find exact solutions to the median problem in a decomposition approach. In this article, we extend this result to the case of linear multichromosomal genomes, in order to solve more interesting problems on eukaryotic nuclear genomes. A multi-way capping problem in the linear multichromosomal case imposes an extra computational challenge on top of the difficulty in the circular case, and this difficulty has been underestimated in our previous study and is addressed in this article. We represent the median problem by the capped multiple breakpoint graph, extend the adequate subgraphs into the capped adequate subgraphs, and prove optimality-preserving decomposition theorems, which give us the tools to solve the median problem and the multi-way capping optimization problem together. We also develop an exact algorithm ASMedian-linear, which iteratively detects instances of (capped) adequate subgraphs and decomposes problems into subproblems. Tested on simulated data, ASMedian-linear can rapidly solve most problems with up to several thousand genes, and it also can provide optimal or near-optimal solutions to the median problem under the reversal/HP distance measures. ASMedian-linear is available at http://sites.google.com/site/andrewweixu .

摘要

在基因组重排中,给定一组基因组G和一个距离度量d,中位数问题是寻找另一个基因组q,使总距离[公式:见原文]最小化。这是基于基因组重排的系统发育分析中的一个关键问题。尽管已知该问题是NP难问题,但我们在之前的一篇文章中表明,对于环状基因组且在DCJ距离度量下,给定基因组中的一类模式(由适当子图表示)使我们能够通过分解方法快速找到中位数问题的精确解。在本文中,我们将这一结果扩展到线性多染色体基因组的情况,以便解决关于真核细胞核基因组的更有趣的问题。线性多染色体情况下的多路封顶问题在环状情况的难度之上又带来了额外的计算挑战,而这一难度在我们之前的研究中被低估了,本文将对此进行探讨。我们用封顶的多重中位数问题来表示中位数问题,将适当子图扩展为封顶的适当子图,并证明保持最优性的分解定理,这些定理为我们提供了同时解决中位数问题和多路封顶优化问题的工具。我们还开发了一种精确算法ASMedian-linear,它迭代地检测(封顶的)适当子图的实例,并将问题分解为子问题。在模拟数据上进行测试时,ASMedian-linear能够快速解决大多数包含数千个基因的问题,并且在反转/HP距离度量下,它也能为中位数问题提供最优或接近最优的解。ASMedian-linear可在http://sites.google.com/site/andrewweixu获取。

相似文献

1
The median problems on linear multichromosomal genomes: graph representation and fast exact solutions.线性多染色体基因组上的中位数问题:图形表示与快速精确解
J Comput Biol. 2010 Sep;17(9):1195-211. doi: 10.1089/cmb.2010.0106.
2
A fast and exact algorithm for the median of three problem: a graph decomposition approach.一种用于解决三数中位数问题的快速精确算法:一种图分解方法。
J Comput Biol. 2009 Oct;16(10):1369-81. doi: 10.1089/cmb.2009.0087.
3
SCJ: a breakpoint-like distance that simplifies several rearrangement problems.SCJ:一种类似于断点的距离,可简化多个重排问题。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Sep-Oct;8(5):1318-29. doi: 10.1109/TCBB.2011.34.
4
Multichromosomal median and halving problems under different genomic distances.不同基因组距离下的多染色体中位数和减半问题
BMC Bioinformatics. 2009 Apr 22;10:120. doi: 10.1186/1471-2105-10-120.
5
Restricted DCJ model: rearrangement problems with chromosome reincorporation.受限DCJ模型:带染色体重新纳入的重排问题
J Comput Biol. 2011 Sep;18(9):1231-41. doi: 10.1089/cmb.2011.0116.
6
On the distribution of cycles and paths in multichromosomal breakpoint graphs and the expected value of rearrangement distance.关于多染色体断点图中圈和路径的分布以及重排距离的期望值
BMC Bioinformatics. 2015;16 Suppl 19(Suppl 19):S1. doi: 10.1186/1471-2105-16-S19-S1. Epub 2015 Dec 16.
7
Computation of perfect DCJ rearrangement scenarios with linear and circular chromosomes.具有线性和环状染色体的完美DCJ重排方案的计算。
J Comput Biol. 2009 Oct;16(10):1287-309. doi: 10.1089/cmb.2009.0088.
8
Multi-break rearrangements and breakpoint re-uses: from circular to linear genomes.多断点重排与断点再利用:从环状基因组到线性基因组
J Comput Biol. 2008 Oct;15(8):1117-31. doi: 10.1089/cmb.2008.0080.
9
Algorithms for sorting unsigned linear genomes by the DCJ operations.基于 DCJ 操作的无符号线性基因组排序算法。
Bioinformatics. 2011 Feb 1;27(3):311-6. doi: 10.1093/bioinformatics/btq674. Epub 2010 Dec 6.
10
Algebraic double cut and join : A group-theoretic approach to the operator on multichromosomal genomes.代数双切割与连接:一种关于多染色体基因组上算子的群论方法。
J Math Biol. 2015 Nov;71(5):1149-78. doi: 10.1007/s00285-014-0852-1. Epub 2014 Dec 11.

引用本文的文献

1
Evaluating impacts of syntenic block detection strategies on rearrangement phylogeny using Mycobacterium tuberculosis isolates.评估基于连锁块检测策略对结核分枝杆菌分离株重排系统发育的影响。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btad024.
2
On the rank-distance median of 3 permutations.关于 3 个排列的秩距中值。
BMC Bioinformatics. 2018 May 8;19(Suppl 6):142. doi: 10.1186/s12859-018-2131-4.
3
Fast ancestral gene order reconstruction of genomes with unequal gene content.具有不等基因含量的基因组的快速祖先基因顺序重建
BMC Bioinformatics. 2016 Nov 11;17(Suppl 14):413. doi: 10.1186/s12859-016-1261-9.
4
Reconstruction of ancestral gene orders using intermediate genomes.利用中间基因组重建祖先基因顺序
BMC Bioinformatics. 2015;16 Suppl 14(Suppl 14):S3. doi: 10.1186/1471-2105-16-S14-S3. Epub 2015 Oct 2.