• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在线性时间内近似平衡基因组的DCJ距离。

Approximating the DCJ distance of balanced genomes in linear time.

作者信息

Rubert Diego P, Feijão Pedro, Braga Marília Dias Vieira, Stoye Jens, Martinez Fábio Henrique Viduani

机构信息

Faculdade de Computação, Universidade Federal de Mato Grosso do Sul, Campo Grande, MS Brazil.

Faculty of Technology and Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld, Germany.

出版信息

Algorithms Mol Biol. 2017 Mar 9;12:3. doi: 10.1186/s13015-017-0095-y. eCollection 2017.

DOI:10.1186/s13015-017-0095-y
PMID:28293275
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5345319/
Abstract

BACKGROUND

Rearrangements are large-scale mutations in genomes, responsible for complex changes and structural variations. Most rearrangements that modify the organization of a genome can be represented by the double cut and join (DCJ) operation. Given two balanced genomes, i.e., two genomes that have exactly the same number of occurrences of each gene in each genome, we are interested in the problem of computing the rearrangement distance between them, i.e., finding the minimum number of DCJ operations that transform one genome into the other. This problem is known to be NP-hard.

RESULTS

We propose a linear time approximation algorithm with approximation factor () for the DCJ distance problem, where is the maximum number of occurrences of any gene in the input genomes. Our algorithm works for linear and circular unichromosomal balanced genomes and uses as an intermediate step an ()-approximation for the minimum common string partition problem, which is closely related to the DCJ distance problem.

CONCLUSIONS

Experiments on simulated data sets show that our approximation algorithm is very competitive both in efficiency and in quality of the solutions.

摘要

背景

重排是基因组中的大规模突变,会导致复杂的变化和结构变异。大多数改变基因组组织的重排都可以用双切割与连接(DCJ)操作来表示。给定两个平衡基因组,即每个基因组中每个基因出现次数完全相同的两个基因组,我们关注计算它们之间重排距离的问题,也就是找到将一个基因组转化为另一个基因组所需的最少DCJ操作次数。已知这个问题是NP难的。

结果

我们针对DCJ距离问题提出了一种近似因子为()的线性时间近似算法,其中是输入基因组中任何基因的最大出现次数。我们的算法适用于线性和环状单染色体平衡基因组,并将最小公共字符串划分问题的()近似作为中间步骤,该问题与DCJ距离问题密切相关。

结论

在模拟数据集上进行的实验表明,我们的近似算法在效率和解决方案质量方面都非常有竞争力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/fc9acb68d617/13015_2017_95_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/495af67aa938/13015_2017_95_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/69c0568ebe00/13015_2017_95_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/602f009561e0/13015_2017_95_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/3482027f993b/13015_2017_95_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/bbb796c2dc6d/13015_2017_95_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/f3ede1b8a73d/13015_2017_95_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/84f1571fdea2/13015_2017_95_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/a6c25c285d9e/13015_2017_95_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/a1e5b3c77d5e/13015_2017_95_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/b43ea2fecfb3/13015_2017_95_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/fc9acb68d617/13015_2017_95_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/495af67aa938/13015_2017_95_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/69c0568ebe00/13015_2017_95_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/602f009561e0/13015_2017_95_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/3482027f993b/13015_2017_95_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/bbb796c2dc6d/13015_2017_95_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/f3ede1b8a73d/13015_2017_95_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/84f1571fdea2/13015_2017_95_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/a6c25c285d9e/13015_2017_95_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/a1e5b3c77d5e/13015_2017_95_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/b43ea2fecfb3/13015_2017_95_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3bc7/5345319/fc9acb68d617/13015_2017_95_Fig11_HTML.jpg

相似文献

1
Approximating the DCJ distance of balanced genomes in linear time.在线性时间内近似平衡基因组的DCJ距离。
Algorithms Mol Biol. 2017 Mar 9;12:3. doi: 10.1186/s13015-017-0095-y. eCollection 2017.
2
Algorithms for sorting unsigned linear genomes by the DCJ operations.基于 DCJ 操作的无符号线性基因组排序算法。
Bioinformatics. 2011 Feb 1;27(3):311-6. doi: 10.1093/bioinformatics/btq674. Epub 2010 Dec 6.
3
Sorting Linear Genomes with Rearrangements and Indels.通过重排和插入缺失对线性基因组进行排序
IEEE/ACM Trans Comput Biol Bioinform. 2015 May-Jun;12(3):500-6. doi: 10.1109/TCBB.2014.2329297.
4
On the family-free DCJ distance and similarity.关于无家族的DCJ距离和相似度。
Algorithms Mol Biol. 2015 Apr 1;10:13. doi: 10.1186/s13015-015-0041-9. eCollection 2015.
5
Computation of perfect DCJ rearrangement scenarios with linear and circular chromosomes.具有线性和环状染色体的完美DCJ重排方案的计算。
J Comput Biol. 2009 Oct;16(10):1287-309. doi: 10.1089/cmb.2009.0088.
6
Recombinations, chains and caps: resolving problems with the DCJ-indel model.重组、链与端粒帽:用DCJ-插入缺失模型解决问题
Algorithms Mol Biol. 2024 Feb 27;19(1):8. doi: 10.1186/s13015-024-00253-7.
7
An Exact Algorithm to Compute the Double-Cut-and-Join Distance for Genomes with Duplicate Genes.一种用于计算具有重复基因的基因组的双切割连接距离的精确算法。
J Comput Biol. 2015 May;22(5):425-35. doi: 10.1089/cmb.2014.0096. Epub 2014 Dec 17.
8
Restricted DCJ-indel model: sorting linear genomes with DCJ and indels.受限 DCJ 插入缺失模型:使用 DCJ 和插入缺失对线性基因组进行排序。
BMC Bioinformatics. 2012;13 Suppl 19(Suppl 19):S14. doi: 10.1186/1471-2105-13-S19-S14. Epub 2012 Dec 19.
9
On the rank-distance median of 3 permutations.关于 3 个排列的秩距中值。
BMC Bioinformatics. 2018 May 8;19(Suppl 6):142. doi: 10.1186/s12859-018-2131-4.
10
Computing the Inversion-Indel Distance.计算倒位-插入缺失距离。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2314-2326. doi: 10.1109/TCBB.2020.2988950. Epub 2021 Dec 8.

引用本文的文献

1
Natural family-free genomic distance.自然的无家族基因组距离。
Algorithms Mol Biol. 2021 May 10;16(1):4. doi: 10.1186/s13015-021-00183-8.
2
The distance and median problems in the single-cut-or-join model with single-gene duplications.具有单基因复制的单切割或连接模型中的距离和中位数问题。
Algorithms Mol Biol. 2020 May 4;15:8. doi: 10.1186/s13015-020-00169-y. eCollection 2020.
3
Computing the family-free DCJ similarity.计算无亲缘关系的 DCJ 相似度。

本文引用的文献

1
An Exact Algorithm to Compute the Double-Cut-and-Join Distance for Genomes with Duplicate Genes.一种用于计算具有重复基因的基因组的双切割连接距离的精确算法。
J Comput Biol. 2015 May;22(5):425-35. doi: 10.1089/cmb.2014.0096. Epub 2014 Dec 17.
2
Inapproximability of (1,2)-exemplar distance.(1,2)-典范距离的不可近似性。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Nov-Dec;10(6):1384-90. doi: 10.1109/TCBB.2012.144.
3
The solution space of sorting by DCJ.基于DCJ排序的解空间
BMC Bioinformatics. 2018 May 8;19(Suppl 6):152. doi: 10.1186/s12859-018-2130-5.
J Comput Biol. 2010 Sep;17(9):1145-65. doi: 10.1089/cmb.2010.0109.
4
Efficient tools for computing the number of breakpoints and the number of adjacencies between two genomes with duplicate genes.用于计算具有重复基因的两个基因组之间断点数量和邻接数量的高效工具。
J Comput Biol. 2008 Oct;15(8):1093-115. doi: 10.1089/cmb.2008.0061.
5
Efficient sorting of genomic permutations by translocation, inversion and block interchange.通过易位、倒位和块交换对基因组排列进行高效排序。
Bioinformatics. 2005 Aug 15;21(16):3340-6. doi: 10.1093/bioinformatics/bti535. Epub 2005 Jun 9.