• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种考虑基因顺序和基因间大小的反转与转座距离的改进近似算法。

An improved approximation algorithm for the reversal and transposition distance considering gene order and intergenic sizes.

作者信息

Brito Klairton L, Oliveira Andre R, Alexandrino Alexsandro O, Dias Ulisses, Dias Zanoni

机构信息

Institute of Computing, University of Campinas, 1251 Albert Einstein Ave., 13083-852, Campinas, Brazil.

School of Technology, University of Campinas, 1888 Paschoal Marmo St., 13484-332, Limeira, Brazil.

出版信息

Algorithms Mol Biol. 2021 Dec 29;16(1):24. doi: 10.1186/s13015-021-00203-7.

DOI:10.1186/s13015-021-00203-7
PMID:34965857
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8717661/
Abstract

BACKGROUND

In the comparative genomics field, one of the goals is to estimate a sequence of genetic changes capable of transforming a genome into another. Genome rearrangement events are mutations that can alter the genetic content or the arrangement of elements from the genome. Reversal and transposition are two of the most studied genome rearrangement events. A reversal inverts a segment of a genome while a transposition swaps two consecutive segments. Initial studies in the area considered only the order of the genes. Recent works have incorporated other genetic information in the model. In particular, the information regarding the size of intergenic regions, which are structures between each pair of genes and in the extremities of a linear genome.

RESULTS AND CONCLUSIONS

In this work, we investigate the SORTING BY INTERGENIC REVERSALS AND TRANSPOSITIONS problem on genomes sharing the same set of genes, considering the cases where the orientation of genes is known and unknown. Besides, we explored a variant of the problem, which generalizes the transposition event. As a result, we present an approximation algorithm that guarantees an approximation factor of 4 for both cases considering the reversal and transposition (classic definition) events, an improvement from the 4.5-approximation previously known for the scenario where the orientation of the genes is unknown. We also present a 3-approximation algorithm by incorporating the generalized transposition event, and we propose a greedy strategy to improve the performance of the algorithms. We performed practical tests adopting simulated data which indicated that the algorithms, in both cases, tend to perform better when compared with the best-known algorithms for the problem. Lastly, we conducted experiments using real genomes to demonstrate the applicability of the algorithms.

摘要

背景

在比较基因组学领域,目标之一是估计能够将一个基因组转化为另一个基因组的遗传变化序列。基因组重排事件是能够改变基因组中遗传内容或元件排列的突变。反转和转座是研究最多的两种基因组重排事件。反转会使基因组的一段序列发生倒置,而转座会交换两个连续的片段。该领域的初步研究仅考虑基因的顺序。最近的研究在模型中纳入了其他遗传信息。特别是关于基因间区域大小的信息,基因间区域是线性基因组中每对基因之间以及两端的结构。

结果与结论

在这项工作中,我们研究了具有相同基因集的基因组上的基因间反转和转座排序问题,考虑了基因方向已知和未知的情况。此外,我们探索了该问题的一个变体,它推广了转座事件。结果,我们提出了一种近似算法,对于考虑反转和转座(经典定义)事件的两种情况,都保证了4倍的近似因子,相比之前已知的基因方向未知情况下的4.5倍近似有了改进。我们还通过纳入广义转座事件提出了一种3倍近似算法,并提出了一种贪心策略来提高算法的性能。我们采用模拟数据进行了实际测试,结果表明,与该问题的最知名算法相比,这两种情况下的算法往往表现更好。最后,我们使用真实基因组进行了实验,以证明算法的适用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c47/8717661/93f196e548f9/13015_2021_203_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c47/8717661/1f9c2cf110e3/13015_2021_203_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c47/8717661/9cfcefb6ae03/13015_2021_203_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c47/8717661/25ca842eec1b/13015_2021_203_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c47/8717661/2fa296a1d4ea/13015_2021_203_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c47/8717661/47f3671a5e03/13015_2021_203_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c47/8717661/93f196e548f9/13015_2021_203_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c47/8717661/1f9c2cf110e3/13015_2021_203_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c47/8717661/9cfcefb6ae03/13015_2021_203_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c47/8717661/25ca842eec1b/13015_2021_203_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c47/8717661/2fa296a1d4ea/13015_2021_203_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c47/8717661/47f3671a5e03/13015_2021_203_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c47/8717661/93f196e548f9/13015_2021_203_Fig6_HTML.jpg

相似文献

1
An improved approximation algorithm for the reversal and transposition distance considering gene order and intergenic sizes.一种考虑基因顺序和基因间大小的反转与转座距离的改进近似算法。
Algorithms Mol Biol. 2021 Dec 29;16(1):24. doi: 10.1186/s13015-021-00203-7.
2
Sorting by Genome Rearrangements on Both Gene Order and Intergenic Sizes.基于基因顺序和基因间大小的基因组重排排序
J Comput Biol. 2020 Feb;27(2):156-174. doi: 10.1089/cmb.2019.0293. Epub 2019 Dec 18.
3
Sorting Permutations by Intergenic Operations.按基因间操作对排列进行分类。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2080-2093. doi: 10.1109/TCBB.2021.3077418. Epub 2021 Dec 8.
4
Reversal and Transposition Distance on Unbalanced Genomes Using Intergenic Information.利用基因间信息计算不平衡基因组的反转和转位距离
J Comput Biol. 2023 Aug;30(8):861-876. doi: 10.1089/cmb.2023.0087. Epub 2023 May 24.
5
Reversals and transpositions distance with proportion restriction.反转和转位与比例限制的距离。
J Bioinform Comput Biol. 2021 Aug;19(4):2150013. doi: 10.1142/S021972002150013X. Epub 2021 Jun 23.
6
Genome Rearrangement Distance With a Flexible Intergenic Regions Aspect.
IEEE/ACM Trans Comput Biol Bioinform. 2023 May-Jun;20(3):1641-1653. doi: 10.1109/TCBB.2022.3165443. Epub 2023 Jun 5.
7
Heuristics for Genome Rearrangement Distance With Replicated Genes.含重复基因的基因组重排距离的启发式算法。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2094-2108. doi: 10.1109/TCBB.2021.3095021. Epub 2021 Dec 8.
8
Approximation algorithm for rearrangement distances considering repeated genes and intergenic regions.考虑重复基因和基因间区域的重排距离近似算法。
Algorithms Mol Biol. 2021 Oct 13;16(1):21. doi: 10.1186/s13015-021-00200-w.
9
Incorporating intergenic regions into reversal and transposition distances with indels.将基因间区纳入插入缺失的倒位和转座距离中。
J Bioinform Comput Biol. 2021 Dec;19(6):2140011. doi: 10.1142/S0219720021400114. Epub 2021 Nov 13.
10
Rearrangement distance with reversals, indels, and moves in intergenic regions on signed and unsigned permutations.带有倒位、缺失和基因间区域移动的重排距离在有符号和无符号排列上。
J Bioinform Comput Biol. 2023 Apr;21(2):2350009. doi: 10.1142/S0219720023500099. Epub 2023 Apr 27.

本文引用的文献

1
Sorting Permutations by Intergenic Operations.按基因间操作对排列进行分类。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2080-2093. doi: 10.1109/TCBB.2021.3077418. Epub 2021 Dec 8.
2
Cyanorak v2.1: a scalable information system dedicated to the visualization and expert curation of marine and brackish picocyanobacteria genomes.Cyanorak v2.1:一个可扩展的信息系统,专门用于海洋和半咸水蓝藻基因组的可视化和专家编辑。
Nucleic Acids Res. 2021 Jan 8;49(D1):D667-D676. doi: 10.1093/nar/gkaa958.
3
Sorting Signed Permutations by Intergenic Reversals.
按基因间倒位对有符号排列进行分类。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2870-2876. doi: 10.1109/TCBB.2020.2993002. Epub 2021 Dec 8.
4
Sorting by Genome Rearrangements on Both Gene Order and Intergenic Sizes.基于基因顺序和基因间大小的基因组重排排序
J Comput Biol. 2020 Feb;27(2):156-174. doi: 10.1089/cmb.2019.0293. Epub 2019 Dec 18.
5
Super short operations on both gene order and intergenic sizes.基因顺序和基因间大小的超短操作。
Algorithms Mol Biol. 2019 Nov 5;14:21. doi: 10.1186/s13015-019-0156-5. eCollection 2019.
6
Treeio: An R Package for Phylogenetic Tree Input and Output with Richly Annotated and Associated Data.Treeio:一个用于系统发育树输入和输出的 R 包,具有丰富的注释和相关数据。
Mol Biol Evol. 2020 Feb 1;37(2):599-603. doi: 10.1093/molbev/msz240.
7
On the Complexity of Sorting by Reversals and Transpositions Problems.关于通过反转和转置问题进行排序的复杂性
J Comput Biol. 2019 Nov;26(11):1223-1229. doi: 10.1089/cmb.2019.0078. Epub 2019 May 23.
8
Algorithms for computing the double cut and join distance on both gene order and intergenic sizes.用于计算基因顺序和基因间大小的双切割与连接距离的算法。
Algorithms Mol Biol. 2017 Jun 5;12:16. doi: 10.1186/s13015-017-0107-y. eCollection 2017.
9
Genome rearrangements with indels in intergenes restrict the scenario space.基因间存在插入缺失的基因组重排限制了情景空间。
BMC Bioinformatics. 2016 Nov 11;17(Suppl 14):426. doi: 10.1186/s12859-016-1264-6.
10
Breaking Good: Accounting for Fragility of Genomic Regions in Rearrangement Distance Estimation.打破常规:在重排距离估计中考虑基因组区域的脆弱性
Genome Biol Evol. 2016 May 22;8(5):1427-39. doi: 10.1093/gbe/evw083.