• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ComPhy:从全基因组基因集推断的原核生物复合距离系统发育树。

ComPhy: prokaryotic composite distance phylogenies inferred from whole-genome gene sets.

作者信息

Lin Guan Ning, Cai Zhipeng, Lin Guohui, Chakraborty Sounak, Xu Dong

机构信息

Digital Biology Laboratory, Informatics Institute, Computer Science Department and Christopher S, Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA.

出版信息

BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S5. doi: 10.1186/1471-2105-10-S1-S5.

DOI:10.1186/1471-2105-10-S1-S5
PMID:19208152
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2648732/
Abstract

BACKGROUND

With the increasing availability of whole genome sequences, it is becoming more and more important to use complete genome sequences for inferring species phylogenies. We developed a new tool ComPhy, 'Composite Distance Phylogeny', based on a composite distance matrix calculated from the comparison of complete gene sets between genome pairs to produce a prokaryotic phylogeny.

RESULTS

The composite distance between two genomes is defined by three components: Gene Dispersion Distance (GDD), Genome Breakpoint Distance (GBD) and Gene Content Distance (GCD). GDD quantifies the dispersion of orthologous genes along the genomic coordinates from one genome to another; GBD measures the shared breakpoints between two genomes; GCD measures the level of shared orthologs between two genomes. The phylogenetic tree is constructed from the composite distance matrix using a neighbor joining method. We tested our method on 9 datasets from 398 completely sequenced prokaryotic genomes. We have achieved above 90% agreement in quartet topologies between the tree created by our method and the tree from the Bergey's taxonomy. In comparison to several other phylogenetic analysis methods, our method showed consistently better performance.

CONCLUSION

ComPhy is a fast and robust tool for genome-wide inference of evolutionary relationship among genomes. It can be downloaded from http://digbio.missouri.edu/ComPhy.

摘要

背景

随着全基因组序列的可得性不断增加,使用完整基因组序列来推断物种系统发育变得越来越重要。我们开发了一种新工具ComPhy,即“复合距离系统发育”,它基于通过比较基因组对之间的完整基因集计算出的复合距离矩阵来构建原核生物系统发育。

结果

两个基因组之间的复合距离由三个部分定义:基因分散距离(GDD)、基因组断点距离(GBD)和基因含量距离(GCD)。GDD量化直系同源基因沿着基因组坐标从一个基因组到另一个基因组的分散情况;GBD测量两个基因组之间共享的断点;GCD测量两个基因组之间共享直系同源基因的水平。系统发育树使用邻接法从复合距离矩阵构建而成。我们在来自398个完全测序的原核生物基因组的9个数据集上测试了我们的方法。我们的方法构建的树与基于伯杰氏分类法构建的树在四重奏拓扑结构上的一致性达到了90%以上。与其他几种系统发育分析方法相比,我们的方法表现出持续更好的性能。

结论

ComPhy是一种用于全基因组推断基因组间进化关系的快速且强大的工具。它可从http://digbio.missouri.edu/ComPhy下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa34/2648732/ed37f0e1d52d/1471-2105-10-S1-S5-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa34/2648732/fe618e67c1a1/1471-2105-10-S1-S5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa34/2648732/7bcebc38802b/1471-2105-10-S1-S5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa34/2648732/8758938ab802/1471-2105-10-S1-S5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa34/2648732/a8dd81377b80/1471-2105-10-S1-S5-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa34/2648732/ed37f0e1d52d/1471-2105-10-S1-S5-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa34/2648732/fe618e67c1a1/1471-2105-10-S1-S5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa34/2648732/7bcebc38802b/1471-2105-10-S1-S5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa34/2648732/8758938ab802/1471-2105-10-S1-S5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa34/2648732/a8dd81377b80/1471-2105-10-S1-S5-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa34/2648732/ed37f0e1d52d/1471-2105-10-S1-S5-5.jpg

相似文献

1
ComPhy: prokaryotic composite distance phylogenies inferred from whole-genome gene sets.ComPhy:从全基因组基因集推断的原核生物复合距离系统发育树。
BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S5. doi: 10.1186/1471-2105-10-S1-S5.
2
Prokaryotic phylogenies inferred from whole-genome sequence and annotation data.基于全基因组序列和注释数据推断的原核生物系统发育。
Biomed Res Int. 2013;2013:409062. doi: 10.1155/2013/409062. Epub 2013 Aug 29.
3
BPhyOG: an interactive server for genome-wide inference of bacterial phylogenies based on overlapping genes.BPhyOG:一个基于重叠基因进行全基因组细菌系统发育推断的交互式服务器。
BMC Bioinformatics. 2007 Jul 25;8:266. doi: 10.1186/1471-2105-8-266.
4
Genome BLAST distance phylogenies inferred from whole plastid and whole mitochondrion genome sequences.基于整个质体和整个线粒体基因组序列推断的基因组BLAST距离系统发育树。
BMC Bioinformatics. 2006 Jul 19;7:350. doi: 10.1186/1471-2105-7-350.
5
Genome trees constructed using five different approaches suggest new major bacterial clades.使用五种不同方法构建的基因组树表明了新的主要细菌进化枝。
BMC Evol Biol. 2001 Oct 20;1:8. doi: 10.1186/1471-2148-1-8.
6
Polytomy identification in microbial phylogenetic reconstruction.微生物系统发育重建中的多歧分类识别
BMC Syst Biol. 2011;5 Suppl 3(Suppl 3):S2. doi: 10.1186/1752-0509-5-S3-S2. Epub 2011 Dec 23.
7
Whole-genome prokaryotic phylogeny.全基因组原核生物系统发育学。
Bioinformatics. 2005 May 15;21(10):2329-35. doi: 10.1093/bioinformatics/bth324. Epub 2004 May 27.
8
Roundup: a multi-genome repository of orthologs and evolutionary distances.综述:一个直系同源基因和进化距离的多基因组储存库。
Bioinformatics. 2006 Aug 15;22(16):2044-6. doi: 10.1093/bioinformatics/btl286. Epub 2006 Jun 15.
9
An emerging phylogenetic core of Archaea: phylogenies of transcription and translation machineries converge following addition of new genome sequences.古菌一个新出现的系统发育核心:随着新基因组序列的增加,转录和翻译机制的系统发育趋同。
BMC Evol Biol. 2005 Jun 2;5:36. doi: 10.1186/1471-2148-5-36.
10
Algorithms for computing parsimonious evolutionary scenarios for genome evolution, the last universal common ancestor and dominance of horizontal gene transfer in the evolution of prokaryotes.用于计算基因组进化简约进化情景、最后共同祖先以及原核生物进化中水平基因转移主导地位的算法。
BMC Evol Biol. 2003 Jan 6;3:2. doi: 10.1186/1471-2148-3-2.

引用本文的文献

1
Imputing missing distances in molecular phylogenetics.在分子系统发育学中估算缺失距离
PeerJ. 2018 Jul 24;6:e5321. doi: 10.7717/peerj.5321. eCollection 2018.
2
Theoretical Foundation of the RelTime Method for Estimating Divergence Times from Variable Evolutionary Rates.RelTime 方法估计具有可变进化率的分歧时间的理论基础。
Mol Biol Evol. 2018 Jul 1;35(7):1770-1782. doi: 10.1093/molbev/msy044.
3
DIME: a novel framework for de novo metagenomic sequence assembly.DIME:一种用于从头宏基因组序列组装的新型框架。

本文引用的文献

1
Evidence of a chimeric genome in the cyanobacterial ancestor of plastids.质体蓝细菌祖先中嵌合基因组的证据。
BMC Evol Biol. 2008 Apr 23;8:117. doi: 10.1186/1471-2148-8-117.
2
Metagenomic analysis of a freshwater toxic cyanobacteria bloom.淡水有毒蓝藻水华的宏基因组分析
FEMS Microbiol Ecol. 2008 Apr;64(1):9-27. doi: 10.1111/j.1574-6941.2008.00448.x. Epub 2008 Mar 4.
3
Constraints on anaerobic respiration in the hyperthermophilic Archaea Pyrobaculum islandicum and Pyrobaculum aerophilum.嗜热古菌冰岛嗜火菌和嗜氧嗜火菌中无氧呼吸的限制因素
J Comput Biol. 2015 Feb;22(2):159-77. doi: 10.1089/cmb.2014.0251.
4
Prokaryotic phylogenies inferred from whole-genome sequence and annotation data.基于全基因组序列和注释数据推断的原核生物系统发育。
Biomed Res Int. 2013;2013:409062. doi: 10.1155/2013/409062. Epub 2013 Aug 29.
5
Polytomy identification in microbial phylogenetic reconstruction.微生物系统发育重建中的多歧分类识别
BMC Syst Biol. 2011;5 Suppl 3(Suppl 3):S2. doi: 10.1186/1752-0509-5-S3-S2. Epub 2011 Dec 23.
6
A protein domain co-occurrence network approach for predicting protein function and inferring species phylogeny.一种基于蛋白质结构域共现网络的蛋白质功能预测和物种进化关系推断方法。
PLoS One. 2011 Mar 24;6(3):e17906. doi: 10.1371/journal.pone.0017906.
7
GO4genome: a prokaryotic phylogeny based on genome organization.GO4genome:一种基于基因组组织的原核生物系统发育。
J Mol Evol. 2009 May;68(5):550-62. doi: 10.1007/s00239-009-9233-6. Epub 2009 May 13.
Appl Environ Microbiol. 2008 Jan;74(2):396-402. doi: 10.1128/AEM.02033-07. Epub 2007 Nov 26.
4
Prokaryote phylogeny meets taxonomy: an exhaustive comparison of composition vector trees with systematic bacteriology.原核生物系统发生学与分类学的结合:成分向量树与系统细菌学的详尽比较
Sci China C Life Sci. 2007 Oct;50(5):587-99. doi: 10.1007/s11427-007-0084-3.
5
BPhyOG: an interactive server for genome-wide inference of bacterial phylogenies based on overlapping genes.BPhyOG:一个基于重叠基因进行全基因组细菌系统发育推断的交互式服务器。
BMC Bioinformatics. 2007 Jul 25;8:266. doi: 10.1186/1471-2105-8-266.
6
The human phylome.人类种系发生组
Genome Biol. 2007;8(6):R109. doi: 10.1186/gb-2007-8-6-r109.
7
Nucleotide composition string selection in HIV-1 subtyping using whole genomes.使用全基因组进行HIV-1亚型分型中的核苷酸组成字符串选择
Bioinformatics. 2007 Jul 15;23(14):1744-52. doi: 10.1093/bioinformatics/btm248. Epub 2007 May 11.
8
Transposon-aided capture (TRACA) of plasmids resident in the human gut mobile metagenome.人类肠道移动宏基因组中质粒的转座子辅助捕获(TRACA)
Nat Methods. 2007 Jan;4(1):55-61. doi: 10.1038/nmeth964. Epub 2006 Nov 26.
9
Phylogenetic position of Salinibacter ruber based on concatenated protein alignments.基于串联蛋白质比对的嗜盐红菌的系统发育位置。
Syst Appl Microbiol. 2007 Apr;30(3):171-9. doi: 10.1016/j.syapm.2006.07.001. Epub 2006 Sep 12.
10
Toward automatic reconstruction of a highly resolved tree of life.迈向高分辨率生命树的自动重建。
Science. 2006 Mar 3;311(5765):1283-7. doi: 10.1126/science.1123061.