• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于转座事件的系统发育重建。

Phylogenetic reconstruction from transpositions.

作者信息

Yue Feng, Zhang Meng, Tang Jijun

机构信息

Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, USA.

出版信息

BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S15. doi: 10.1186/1471-2164-9-S2-S15.

DOI:10.1186/1471-2164-9-S2-S15
PMID:18831780
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2559879/
Abstract

BACKGROUND

Because of the advent of high-throughput sequencing and the consequent reduction in the cost of sequencing, many organisms have been completely sequenced and most of their genes identified. It thus has become possible to represent whole genomes as ordered lists of gene identifiers and to study the rearrangement of these entities through computational means. As a result, genome rearrangement data has attracted increasing attentions from both biologists and computer scientists as a new type of data for phylogenetic analysis. The main events of genome rearrangements include inversions, transpositions and transversions. To date, GRAPPA and MGR are the most accurate methods for rearrangement phylogeny, both assuming inversion as the only event. However, due to the complexity of computing transposition distance, it is very difficult to analyze datasets when transpositions are dominant.

RESULTS

We extend GRAPPA to handle transpositions. The new method is named GRAPPA-TP, with two major extensions: a heuristic method to estimate transposition distance, and a new transposition median solver for three genomes. Although GRAPPA-TP uses a greedy approach to compute the transposition distance, it is very accurate when genomes are relatively close. The new GRAPPA-TP is available from http://phylo.cse.sc.edu/.

CONCLUSION

Our extensive testing using simulated datasets shows that GRAPPA-TP is very accurate in terms of ancestor genome inference and phylogenetic reconstruction. Simulation results also suggest that model match is critical in genome rearrangement analysis: it is not accurate to simulate transpositions with other events including inversions.

摘要

背景

由于高通量测序技术的出现以及随之而来的测序成本降低,许多生物体已被完全测序,并且其大部分基因已被识别。因此,将整个基因组表示为基因标识符的有序列表并通过计算手段研究这些实体的重排成为可能。结果,基因组重排数据作为一种用于系统发育分析的新型数据,已引起生物学家和计算机科学家越来越多的关注。基因组重排的主要事件包括倒位、转座和颠换。迄今为止,GRAPPA和MGR是重排系统发育分析中最准确的方法,两者都假设倒位是唯一的事件。然而,由于计算转座距离的复杂性,当转座占主导时,很难分析数据集。

结果

我们对GRAPPA进行了扩展以处理转座。新方法名为GRAPPA-TP,有两个主要扩展:一种估计转座距离的启发式方法,以及一种用于三个基因组的新的转座中位数求解器。虽然GRAPPA-TP使用贪心方法来计算转座距离,但当基因组相对接近时,它非常准确。新的GRAPPA-TP可从http://phylo.cse.sc.edu/获取。

结论

我们使用模拟数据集进行的广泛测试表明,GRAPPA-TP在祖先基因组推断和系统发育重建方面非常准确。模拟结果还表明,模型匹配在基因组重排分析中至关重要:用包括倒位在内的其他事件来模拟转座是不准确的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6ee/2559879/49a5abf6bc18/1471-2164-9-S2-S15-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6ee/2559879/0f29a7648ce4/1471-2164-9-S2-S15-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6ee/2559879/584bbcf971d6/1471-2164-9-S2-S15-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6ee/2559879/7c7dbb7e5b57/1471-2164-9-S2-S15-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6ee/2559879/5591f14a21d8/1471-2164-9-S2-S15-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6ee/2559879/2f74f32e4f83/1471-2164-9-S2-S15-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6ee/2559879/f8c339b6e9be/1471-2164-9-S2-S15-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6ee/2559879/49a5abf6bc18/1471-2164-9-S2-S15-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6ee/2559879/0f29a7648ce4/1471-2164-9-S2-S15-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6ee/2559879/584bbcf971d6/1471-2164-9-S2-S15-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6ee/2559879/7c7dbb7e5b57/1471-2164-9-S2-S15-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6ee/2559879/5591f14a21d8/1471-2164-9-S2-S15-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6ee/2559879/2f74f32e4f83/1471-2164-9-S2-S15-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6ee/2559879/f8c339b6e9be/1471-2164-9-S2-S15-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6ee/2559879/49a5abf6bc18/1471-2164-9-S2-S15-7.jpg

相似文献

1
Phylogenetic reconstruction from transpositions.基于转座事件的系统发育重建。
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S15. doi: 10.1186/1471-2164-9-S2-S15.
2
Gene rearrangement analysis and ancestral order inference from chloroplast genomes with inverted repeat.基于具有反向重复序列的叶绿体基因组进行基因重排分析和祖先顺序推断。
BMC Genomics. 2008;9 Suppl 1(Suppl 1):S25. doi: 10.1186/1471-2164-9-S1-S25.
3
A fast algorithm for the multiple genome rearrangement problem with weighted reversals and transpositions.一种用于解决带权重反转和转置的多基因组重排问题的快速算法。
BMC Bioinformatics. 2008 Dec 4;9:516. doi: 10.1186/1471-2105-9-516.
4
Improving reversal median computation using commuting reversals and cycle information.利用可交换反转和循环信息改进反转中位数计算。
J Comput Biol. 2008 Oct;15(8):1079-92. doi: 10.1089/cmb.2008.0116.
5
Reconstruction of ancestral gene orders using probabilistic and gene encoding approaches.使用概率和基因编码方法重建祖先基因顺序。
PLoS One. 2014 Oct 10;9(10):e108796. doi: 10.1371/journal.pone.0108796. eCollection 2014.
6
Ancestral genome inference using a genetic algorithm approach.基于遗传算法的祖先基因组推断。
PLoS One. 2013 May 2;8(5):e62156. doi: 10.1371/journal.pone.0062156. Print 2013.
7
Distance-based genome rearrangement phylogeny.基于距离的基因组重排系统发育学。
J Mol Evol. 2006 Oct;63(4):473-83. doi: 10.1007/s00239-005-0216-y. Epub 2006 Oct 4.
8
Heuristics for Genome Rearrangement Distance With Replicated Genes.含重复基因的基因组重排距离的启发式算法。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2094-2108. doi: 10.1109/TCBB.2021.3095021. Epub 2021 Dec 8.
9
Sorting by weighted reversals, transpositions, and inverted transpositions.通过加权反转、转位和反向转位进行排序。
J Comput Biol. 2007 Jun;14(5):615-36. doi: 10.1089/cmb.2007.R006.
10
On the Complexity of Sorting by Reversals and Transpositions Problems.关于通过反转和转置问题进行排序的复杂性
J Comput Biol. 2019 Nov;26(11):1223-1229. doi: 10.1089/cmb.2019.0078. Epub 2019 May 23.

引用本文的文献

1
Complete chloroplast genome sequences of important oilseed crop Sesamum indicum L.重要油料作物芝麻的完整叶绿体基因组序列
PLoS One. 2012;7(5):e35872. doi: 10.1371/journal.pone.0035872. Epub 2012 May 14.
2
Genomics, molecular imaging, bioinformatics, and bio-nano-info integration are synergistic components of translational medicine and personalized healthcare research.基因组学、分子成像、生物信息学以及生物纳米信息整合是转化医学和个性化医疗研究的协同组成部分。
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):I1. doi: 10.1186/1471-2164-9-S2-I1.

本文引用的文献

1
Genome rearrangement based on reversals that preserve conserved intervals.基于保留保守区间的反转操作的基因组重排。
IEEE/ACM Trans Comput Biol Bioinform. 2006 Jul-Sep;3(3):275-88. doi: 10.1109/TCBB.2006.38.
2
Efficient sorting of genomic permutations by translocation, inversion and block interchange.通过易位、倒位和块交换对基因组排列进行高效排序。
Bioinformatics. 2005 Aug 15;21(16):3340-6. doi: 10.1093/bioinformatics/bti535. Epub 2005 Jun 9.
3
Scaling up accurate phylogenetic reconstruction from gene-order data.从基因顺序数据扩大准确的系统发育重建。
Bioinformatics. 2003;19 Suppl 1:i305-12. doi: 10.1093/bioinformatics/btg1042.
4
Fast phylogenetic methods for the analysis of genome rearrangement data: an empirical study.用于分析基因组重排数据的快速系统发育方法:一项实证研究。
Pac Symp Biocomput. 2002:524-35.
5
Genome-scale evolution: reconstructing gene orders in the ancestral species.基因组规模的进化:重建祖先物种中的基因顺序。
Genome Res. 2002 Jan;12(1):26-36.
6
A linear-time algorithm for computing inversion distance between signed permutations with an experimental study.一种用于计算带符号排列之间反转距离的线性时间算法及实验研究
J Comput Biol. 2001;8(5):483-91. doi: 10.1089/106652701753216503.
7
A new implementation and detailed study of breakpoint analysis.断点分析的一种新实现与详细研究。
Pac Symp Biocomput. 2001:583-94. doi: 10.1142/9789814447362_0056.
8
Big trees from little genomes: mitochondrial gene order as a phylogenetic tool.小基因组孕育大树:线粒体基因顺序作为一种系统发育工具
Curr Opin Genet Dev. 1998 Dec;8(6):668-74. doi: 10.1016/s0959-437x(98)80035-x.
9
The highly rearranged chloroplast genome of Trachelium caeruleum (Campanulaceae): multiple inversions, inverted repeat expansion and contraction, transposition, insertions/deletions, and several repeat families.天蓝风铃草(桔梗科)高度重排的叶绿体基因组:多次倒位、反向重复序列的扩增与收缩、转座、插入/缺失以及多个重复序列家族
Curr Genet. 1997 May;31(5):419-29. doi: 10.1007/s002940050225.
10
The neighbor-joining method: a new method for reconstructing phylogenetic trees.邻接法:一种重建系统发育树的新方法。
Mol Biol Evol. 1987 Jul;4(4):406-25. doi: 10.1093/oxfordjournals.molbev.a040454.