• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

支架填充在断点下和相关距离。

Scaffold filling under the breakpoint and related distances.

机构信息

School of Computer Science and Technology, Shandong University, Jinan, Shandong 210500, China.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1220-9. doi: 10.1109/TCBB.2012.57.

DOI:10.1109/TCBB.2012.57
PMID:22529329
Abstract

Motivated by the trend of genome sequencing without completing the sequence of the whole genomes, a problem on filling an incomplete multichromosomal genome (or scaffold) I with respect to a complete target genome G was studied. The objective is to minimize the resulting genomic distance between I' and G, where I' is the corresponding filled scaffold. We call this problem the onesided scaffold filling problem. In this paper, we conduct a systematic study for the scaffold filling problem under the breakpoint distance and its variants, for both unichromosomal and multichromosomal genomes (with and without gene repetitions). When the input genome contains no gene repetition (i.e., is a fragment of a permutation), we show that the two-sided scaffold filling problem (i.e., G is also incomplete) is polynomially solvable for unichromosomal genomes under the breakpoint distance and for multichromosomal genomes under the genomic (or DCJ--Double-Cut-and-Join) distance. However, when the input genome contains some repeated genes, even the one-sided scaffold filling problem becomes NP-complete when the similarity measure is the maximum number of adjacencies between two sequences. For this problem, we also present efficient constant-factor approximation algorithms: factor-2 for the general case and factor 1.33 for the one-sided case.

摘要

受基因组测序趋势的启发,在不完成整个基因组序列的情况下,研究了一个关于用完整目标基因组 G 填充不完整的多染色体基因组(或支架)I 的问题。目标是使 I'和 G 之间的基因组距离最小化,其中 I'是相应的填充支架。我们将这个问题称为单边支架填充问题。在本文中,我们对支架填充问题进行了系统的研究,包括断点距离及其变体,适用于单染色体和多染色体基因组(有和没有基因重复)。当输入基因组不含基因重复(即,是排列的一个片段)时,我们证明了在断点距离下,双边支架填充问题(即 G 也是不完整的)对于单染色体基因组是多项式可解的,对于基因组(或 DCJ--Double-Cut-and-Join)距离下的多染色体基因组也是多项式可解的。然而,当输入基因组包含一些重复基因时,即使相似性度量是两个序列之间邻接的最大数量,单边支架填充问题也变得 NP 完全。对于这个问题,我们还提出了有效的常数因子近似算法:对于一般情况为 2 倍,对于单边情况为 1.33 倍。

相似文献

1
Scaffold filling under the breakpoint and related distances.支架填充在断点下和相关距离。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1220-9. doi: 10.1109/TCBB.2012.57.
2
An improved approximation algorithm for scaffold filling to maximize the common adjacencies.一种改进的支架填充近似算法,以最大化公共邻接。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Jul-Aug;10(4):905-13. doi: 10.1109/TCBB.2013.100.
3
A 2-approximation algorithm for the contig-based genomic scaffold filling problem.
J Bioinform Comput Biol. 2018 Dec;16(6):1850022. doi: 10.1142/S0219720018500221.
4
Algorithms for sorting unsigned linear genomes by the DCJ operations.基于 DCJ 操作的无符号线性基因组排序算法。
Bioinformatics. 2011 Feb 1;27(3):311-6. doi: 10.1093/bioinformatics/btq674. Epub 2010 Dec 6.
5
Multichromosomal median and halving problems under different genomic distances.不同基因组距离下的多染色体中位数和减半问题
BMC Bioinformatics. 2009 Apr 22;10:120. doi: 10.1186/1471-2105-10-120.
6
SCJ: a breakpoint-like distance that simplifies several rearrangement problems.SCJ:一种类似于断点的距离,可简化多个重排问题。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Sep-Oct;8(5):1318-29. doi: 10.1109/TCBB.2011.34.
7
The median problems on linear multichromosomal genomes: graph representation and fast exact solutions.线性多染色体基因组上的中位数问题:图形表示与快速精确解
J Comput Biol. 2010 Sep;17(9):1195-211. doi: 10.1089/cmb.2010.0106.
8
On Computing Breakpoint Distances for Genomes with Duplicate Genes.关于计算具有重复基因的基因组的断点距离
J Comput Biol. 2017 Jun;24(6):571-580. doi: 10.1089/cmb.2016.0149. Epub 2016 Oct 27.
9
Approximating the DCJ distance of balanced genomes in linear time.在线性时间内近似平衡基因组的DCJ距离。
Algorithms Mol Biol. 2017 Mar 9;12:3. doi: 10.1186/s13015-017-0095-y. eCollection 2017.
10
An Exact Algorithm to Compute the Double-Cut-and-Join Distance for Genomes with Duplicate Genes.一种用于计算具有重复基因的基因组的双切割连接距离的精确算法。
J Comput Biol. 2015 May;22(5):425-35. doi: 10.1089/cmb.2014.0096. Epub 2014 Dec 17.

引用本文的文献

1
Filling a Protein Scaffold With a Reference.用参考物填充蛋白质支架。
IEEE Trans Nanobioscience. 2017 Mar;16(2):123-130. doi: 10.1109/TNB.2017.2666780. Epub 2017 Feb 9.