Suppr超能文献

一种基于双切割连接排序的中位数求解器与系统发育推断。

A Median Solver and Phylogenetic Inference Based on Double-Cut-and-Join Sorting.

作者信息

Xia Ruofan, Lin Yu, Zhou Jun, Feng Bing, Tang Jijun

机构信息

1 School of Computer Science and Technology, Tianjin University , Tianjin, China .

2 Department of Computer Science and Engineering, University of South Carolina , Columbia, South Carolina.

出版信息

J Comput Biol. 2018 Mar;25(3):302-312. doi: 10.1089/cmb.2017.0157. Epub 2017 Oct 16.

Abstract

Genome rearrangement is known as one of the main evolutionary mechanisms on the genomic level. Phylogenetic analysis based on rearrangement played a crucial role in biological research in the past decades, especially with the increasing availability of fully sequenced genomes. In general, phylogenetic analysis aims to solve two problems: small parsimony problem (SPP) and big parsimony problem (BPP). Maximum parsimony is a popular approach for SPP and BPP, which relies on iteratively solving an NP-hard problem, the median problem. As a result, current median solvers and phylogenetic inference methods based on the median problem all face serious problems on scalability and cannot be applied to data sets with large and distant genomes. In this article, we propose a new median solver for gene order data that combines double-cut-and-join sorting with the simulated annealing algorithm. Based on this median solver, we built a new phylogenetic inference method to solve both SPP and BPP problems. Our experimental results show that the new median solver achieves an excellent performance on simulated data sets, and the phylogenetic inference tool built based on the new median solver has a better performance than other existing methods.

摘要

基因组重排是基因组水平上主要的进化机制之一。在过去几十年中,基于重排的系统发育分析在生物学研究中发挥了关键作用,尤其是随着全基因组测序数据的日益丰富。一般来说,系统发育分析旨在解决两个问题:小简约问题(SPP)和大约简问题(BPP)。最大简约法是解决SPP和BPP的常用方法,它依赖于迭代求解一个NP难问题——中位数问题。因此,当前基于中位数问题的中位数求解器和系统发育推断方法在可扩展性方面都面临严重问题,无法应用于具有大量且差异较大基因组的数据集。在本文中,我们提出了一种新的基因顺序数据中位数求解器,它将双切割连接排序与模拟退火算法相结合。基于这个中位数求解器,我们构建了一种新的系统发育推断方法来解决SPP和BPP问题。我们的实验结果表明,新的中位数求解器在模拟数据集上表现优异,并且基于新中位数求解器构建的系统发育推断工具比其他现有方法具有更好的性能。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验