• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于整个质体和整个线粒体基因组序列推断的基因组BLAST距离系统发育树。

Genome BLAST distance phylogenies inferred from whole plastid and whole mitochondrion genome sequences.

作者信息

Auch Alexander F, Henz Stefan R, Holland Barbara R, Göker Markus

机构信息

Center for Bioinformatics ZBIT, Sand 14, Tübingen, University of Tübingen, Germany.

出版信息

BMC Bioinformatics. 2006 Jul 19;7:350. doi: 10.1186/1471-2105-7-350.

DOI:10.1186/1471-2105-7-350
PMID:16854218
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1564419/
Abstract

BACKGROUND

Phylogenetic methods which do not rely on multiple sequence alignments are important tools in inferring trees directly from completely sequenced genomes. Here, we extend the recently described Genome BLAST Distance Phylogeny (GBDP) strategy to compute phylogenetic trees from all completely sequenced plastid genomes currently available and from a selection of mitochondrial genomes representing the major eukaryotic lineages. BLASTN, TBLASTX, or combinations of both are used to locate high-scoring segment pairs (HSPs) between two sequences from which pairwise similarities and distances are computed in different ways resulting in a total of 96 GBDP variants. The suitability of these distance formulae for phylogeny reconstruction is directly estimated by computing a recently described measure of "treelikeness", the so-called delta value, from the respective distance matrices. Additionally, we compare the trees inferred from these matrices using UPGMA, NJ, BIONJ, FastME, or STC, respectively, with the NCBI taxonomy tree of the taxa under study.

RESULTS

Our results indicate that, at this taxonomic level, plastid genomes are much more valuable for inferring phylogenies than are mitochondrial genomes, and that distances based on breakpoints are of little use. Distances based on the proportion of "matched" HSP length to average genome length were best for tree estimation. Additionally we found that using TBLASTX instead of BLASTN and, particularly, combining TBLASTX and BLASTN leads to a small but significant increase in accuracy. Other factors do not significantly affect the phylogenetic outcome. The BIONJ algorithm results in phylogenies most in accordance with the current NCBI taxonomy, with NJ and FastME performing insignificantly worse, and STC performing as well if applied to high quality distance matrices. delta values are found to be a reliable predictor of phylogenetic accuracy.

CONCLUSION

Using the most treelike distance matrices, as judged by their delta values, distance methods are able to recover all major plant lineages, and are more in accordance with Apicomplexa organelles being derived from "green" plastids than from plastids of the "red" type. GBDP-like methods can be used to reliably infer phylogenies from different kinds of genomic data. A framework is established to further develop and improve such methods. delta values are a topology-independent tool of general use for the development and assessment of distance methods for phylogenetic inference.

摘要

背景

不依赖多序列比对的系统发育方法是直接从全测序基因组推断树的重要工具。在此,我们扩展了最近描述的基因组BLAST距离系统发育(GBDP)策略,以从目前所有可用的全测序质体基因组以及代表主要真核生物谱系的线粒体基因组中计算系统发育树。使用BLASTN、TBLASTX或两者的组合来定位两个序列之间的高分片段对(HSP),并以不同方式计算成对相似性和距离,从而产生总共96种GBDP变体。通过从各自的距离矩阵计算最近描述的“树状度”度量(即所谓的δ值),直接估计这些距离公式对系统发育重建的适用性。此外,我们分别使用UPGMA、NJ、BIONJ、FastME或STC将从这些矩阵推断出的树与所研究分类单元的NCBI分类树进行比较。

结果

我们的结果表明,在此分类水平上,质体基因组在推断系统发育方面比线粒体基因组更有价值,并且基于断点的距离用处不大。基于“匹配”HSP长度与平均基因组长度比例的距离最适合用于树估计。此外,我们发现使用TBLASTX代替BLASTN,特别是将TBLASTX和BLASTN结合使用,会使准确性有小幅但显著的提高。其他因素对系统发育结果没有显著影响。BIONJ算法得出的系统发育树最符合当前的NCBI分类法,NJ和FastME的表现稍差但不显著,而STC应用于高质量距离矩阵时表现也不错。发现δ值是系统发育准确性的可靠预测指标。

结论

使用根据其δ值判断为最具树状性的距离矩阵,距离方法能够恢复所有主要植物谱系,并且更符合顶复门细胞器源自“绿色”质体而非“红色”质体的观点。类似GBDP的方法可用于可靠地从不同类型的基因组数据推断系统发育。建立了一个框架以进一步开发和改进此类方法。δ值是一种与拓扑无关的通用工具,用于系统发育推断距离方法的开发和评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19e0/1564419/cf2855a3c499/1471-2105-7-350-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19e0/1564419/b322075cb998/1471-2105-7-350-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19e0/1564419/5cc25b377d07/1471-2105-7-350-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19e0/1564419/bbd9757838d5/1471-2105-7-350-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19e0/1564419/c9e340b49179/1471-2105-7-350-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19e0/1564419/cf2855a3c499/1471-2105-7-350-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19e0/1564419/b322075cb998/1471-2105-7-350-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19e0/1564419/5cc25b377d07/1471-2105-7-350-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19e0/1564419/bbd9757838d5/1471-2105-7-350-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19e0/1564419/c9e340b49179/1471-2105-7-350-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19e0/1564419/cf2855a3c499/1471-2105-7-350-5.jpg

相似文献

1
Genome BLAST distance phylogenies inferred from whole plastid and whole mitochondrion genome sequences.基于整个质体和整个线粒体基因组序列推断的基因组BLAST距离系统发育树。
BMC Bioinformatics. 2006 Jul 19;7:350. doi: 10.1186/1471-2105-7-350.
2
Fast NJ-like algorithms to deal with incomplete distance matrices.用于处理不完整距离矩阵的类似快速NJ的算法。
BMC Bioinformatics. 2008 Mar 26;9:166. doi: 10.1186/1471-2105-9-166.
3
Whole-genome prokaryotic phylogeny.全基因组原核生物系统发育学。
Bioinformatics. 2005 May 15;21(10):2329-35. doi: 10.1093/bioinformatics/bth324. Epub 2004 May 27.
4
Incongruence between gene trees and species trees and phylogenetic signal variation in plastid genes.质体基因中基因树与种系树之间的不一致性和系统发育信号变异。
Mol Phylogenet Evol. 2019 Sep;138:219-232. doi: 10.1016/j.ympev.2019.05.022. Epub 2019 May 28.
5
PTreeRec: Phylogenetic Tree Reconstruction based on genome BLAST distance.PTreeRec:基于基因组BLAST距离的系统发育树重建
Comput Biol Chem. 2006 Aug;30(4):300-2. doi: 10.1016/j.compbiolchem.2006.04.003. Epub 2006 Jun 21.
6
ComPhy: prokaryotic composite distance phylogenies inferred from whole-genome gene sets.ComPhy:从全基因组基因集推断的原核生物复合距离系统发育树。
BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S5. doi: 10.1186/1471-2105-10-S1-S5.
7
Phylogeny of plastids based on cladistic analysis of gene loss inferred from complete plastid genome sequences.基于从完整质体基因组序列推断出的基因丢失的分支分析的质体系统发育。
J Mol Evol. 2003 Oct;57(4):377-82. doi: 10.1007/s00239-003-2486-6.
8
Plastid phylogenomics of the Gynoxoid group (Senecioneae, Asteraceae) highlights the importance of motif-based sequence alignment amid low genetic distances.千里光族(菊科)的质体系统发育基因组学凸显了在低遗传距离下基于基序的序列比对的重要性。
Am J Bot. 2021 Nov;108(11):2235-2256. doi: 10.1002/ajb2.1775. Epub 2021 Nov 17.
9
Entire plastid phylogeny of the carrot genus (, Apiaceae): Concordance with nuclear data and mitochondrial and nuclear DNA insertions to the plastid.胡萝卜属(伞形科)的完整质体系统发育:与核数据以及质体中的线粒体和核DNA插入的一致性。
Am J Bot. 2017 Feb;104(2):296-312. doi: 10.3732/ajb.1600415. Epub 2017 Feb 15.
10
On the quality of tree-based protein classification.论基于树的蛋白质分类的质量。
Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.

引用本文的文献

1
Identification and Characterization of New Strains from Common Carp (), Potentially Possessing Probiotic Properties and Plastic Biodegradation Capabilities.从鲤鱼中鉴定和表征新菌株,这些菌株可能具有益生菌特性和塑料生物降解能力。
Int J Mol Sci. 2025 Jan 28;26(3):1119. doi: 10.3390/ijms26031119.
2
Investigating the resistome of haemolytic bacteria in Arctic soils.研究北极土壤中溶血性细菌的耐药组。
Environ Microbiol Rep. 2024 Oct;16(5):e70028. doi: 10.1111/1758-2229.70028.
3
De novo assembly and comparative genome analysis for polyhydroxyalkanoates-producing Bacillus sp. BNPI-92 strain.

本文引用的文献

1
Fixed Character States and the Optimization of Molecular Sequence Data.固定性状状态与分子序列数据的优化
Cladistics. 1999 Dec;15(4):379-385. doi: 10.1111/j.1096-0031.1999.tb00274.x.
2
CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP.系统发育树的置信区间:一种使用自展法的方法。
Evolution. 1985 Jul;39(4):783-791. doi: 10.1111/j.1558-5646.1985.tb00420.x.
3
A revision of Bremia graminicola.禾生盘梗霉的修订
产聚羟基脂肪酸酯芽孢杆菌BNPI-92菌株的从头组装与比较基因组分析
J Genet Eng Biotechnol. 2023 Nov 22;21(1):132. doi: 10.1186/s43141-023-00578-7.
4
Advanced prokaryotic systematics: the modern face of an ancient science.高级原核生物系统学:一门古老科学的现代面貌。
New Microbes New Infect. 2022 Nov 11;49-50:101036. doi: 10.1016/j.nmni.2022.101036. eCollection 2022 Nov-Dec.
5
Draft Genomic Sequences of Four Pseudomonas spp. and a sp. from Cranberry Stem Galls.四种假单胞菌属细菌及一种来自蔓越莓茎瘿瘤的细菌的基因组序列草图
Microbiol Resour Announc. 2022 Mar 17;11(3):e0099921. doi: 10.1128/mra.00999-21. Epub 2022 Mar 1.
6
Whole Genome Sequencing Based Taxonomic Classification, and Comparative Genomic Analysis of Potentially Human Pathogenic spp. Isolated from Chlorinated Wastewater in the North West Province, South Africa.基于全基因组测序的分类学分类以及从南非西北省氯化废水中分离出的潜在人类致病物种的比较基因组分析。
Microorganisms. 2021 Sep 10;9(9):1928. doi: 10.3390/microorganisms9091928.
7
Draft Genome Sequences of Pseudomonas spp. Isolated from Berry Surfaces in Commercial Cranberry Bogs in Massachusetts, USA.从美国马萨诸塞州商业蔓越莓沼泽地的浆果表面分离出的假单胞菌属细菌的基因组序列草图
Microbiol Resour Announc. 2021 Jul 8;10(27):e0020421. doi: 10.1128/MRA.00204-21.
8
Machine learning based imputation techniques for estimating phylogenetic trees from incomplete distance matrices.基于机器学习的填补技术,用于从不完全距离矩阵估计系统发育树。
BMC Genomics. 2020 Jul 20;21(1):497. doi: 10.1186/s12864-020-06892-5.
9
Analysis of 1,000+ Type-Strain Genomes Substantially Improves Taxonomic Classification of .对1000多个模式菌株基因组的分析极大地改进了……的分类学分类。 (注:原文结尾不完整,翻译只能到这里)
Front Microbiol. 2020 Apr 7;11:468. doi: 10.3389/fmicb.2020.00468. eCollection 2020.
10
Analysis of 1,000 Type-Strain Genomes Improves Taxonomic Classification of .对1000个模式菌株基因组的分析改进了……的分类学分类。 (原文此处不完整)
Front Microbiol. 2019 Sep 23;10:2083. doi: 10.3389/fmicb.2019.02083. eCollection 2019.
Mycol Res. 2006 Jun;110(Pt 6):646-56. doi: 10.1016/j.mycres.2006.04.001. Epub 2006 Jun 12.
4
The new higher level classification of eukaryotes with emphasis on the taxonomy of protists.强调原生生物分类学的真核生物新的高级别分类。
J Eukaryot Microbiol. 2005 Sep-Oct;52(5):399-451. doi: 10.1111/j.1550-7408.2005.00053.x.
5
Application of phylogenetic networks in evolutionary studies.系统发育网络在进化研究中的应用。
Mol Biol Evol. 2006 Feb;23(2):254-67. doi: 10.1093/molbev/msj030. Epub 2005 Oct 12.
6
The net of life: reconstructing the microbial phylogenetic network.生命之网:重建微生物系统发育网络
Genome Res. 2005 Jul;15(7):954-9. doi: 10.1101/gr.3666505. Epub 2005 Jun 17.
7
Identifying the basal angiosperm node in chloroplast genome phylogenies: sampling one's way out of the Felsenstein zone.在叶绿体基因组系统发育中识别基部被子植物节点:通过抽样走出费尔斯滕森区域。
Mol Biol Evol. 2005 Oct;22(10):1948-63. doi: 10.1093/molbev/msi191. Epub 2005 Jun 8.
8
Analysis of Acorus calamus chloroplast genome and its phylogenetic implications.菖蒲叶绿体基因组分析及其系统发育意义。
Mol Biol Evol. 2005 Sep;22(9):1813-22. doi: 10.1093/molbev/msi173. Epub 2005 Jun 1.
9
The chloroplast genome sequence of the green alga Pseudendoclonium akinetum (Ulvophyceae) reveals unusual structural features and new insights into the branching order of chlorophyte lineages.绿藻伪枝藻(绿藻纲)的叶绿体基因组序列揭示了不寻常的结构特征以及对绿藻谱系分支顺序的新见解。
Mol Biol Evol. 2005 Sep;22(9):1903-18. doi: 10.1093/molbev/msi182. Epub 2005 Jun 1.
10
Chlorophyll c-containing plastid relationships based on analyses of a multigene data set with all four chromalveolate lineages.基于对包含所有四个色藻谱系的多基因数据集的分析,含叶绿素c的质体关系。
Mol Biol Evol. 2005 Sep;22(9):1772-82. doi: 10.1093/molbev/msi172. Epub 2005 May 25.