• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生物信息学分析大规模病毒序列:从数据集的构建到系统发育树的注释。

Bioinformatics analysis of large-scale viral sequences: from construction of data sets to annotation of a phylogenetic tree.

机构信息

Department of Biomedical Sciences and Veterinary Public Health, Section of Virology, Swedish University of Agricultural Sciences, Uppsala, Sweden.

出版信息

Virulence. 2013 Jan 1;4(1):97-106. doi: 10.4161/viru.23161.

DOI:10.4161/viru.23161
PMID:23314574
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3544756/
Abstract

Due to a significant decrease in the cost of DNA sequencing, the number of sequences submitted to the public databases has dramatically increased in recent years. Efficient analysis of these data sets may lead to a significant understanding of the nature of pathogens such as bacteria, viruses, parasites, etc. However, this has raised questions about the efficacy of currently available algorithms for the study of pathogen evolution and construction of phylogenetic trees. While the advanced algorithms and corresponding programs are being developed, it is crucial to optimize the available ones in order to cope with the current need. The protocol presented in this study is optimized using a number of strategies currently being proposed for handling large-scale DNA sequence data sets, and offers a highly efficacious and accurate method for computing phylogenetic trees with limited computer resources. The protocol may take up to 36 h for construction and annotation of a final tree of about 20,000 sequences.

摘要

由于 DNA 测序成本的大幅降低,近年来提交到公共数据库的序列数量急剧增加。对这些数据集进行有效的分析可能会使人们对细菌、病毒、寄生虫等病原体的本质有更深入的了解。然而,这也引发了对现有算法在病原体进化研究和系统发育树构建方面的功效的质疑。虽然正在开发更先进的算法和相应的程序,但优化现有的算法以应对当前的需求至关重要。本研究提出的方案使用了目前提出的一些策略来处理大规模的 DNA 序列数据集,并且为在有限的计算机资源下计算系统发育树提供了一种高效、准确的方法。该方案构建和注释一个大约 20000 个序列的最终树可能需要长达 36 小时。

相似文献

1
Bioinformatics analysis of large-scale viral sequences: from construction of data sets to annotation of a phylogenetic tree.生物信息学分析大规模病毒序列:从数据集的构建到系统发育树的注释。
Virulence. 2013 Jan 1;4(1):97-106. doi: 10.4161/viru.23161.
2
Building (Viral) Phylogenetic Trees Using a Maximum Likelihood Approach.使用最大似然法构建(病毒)系统发育树
Curr Protoc Microbiol. 2018 Nov;51(1):e63. doi: 10.1002/cpmc.63. Epub 2018 Sep 28.
3
Computational framework for next-generation sequencing of heterogeneous viral populations using combinatorial pooling.基于组合池化的异质病毒群体下一代测序计算框架
Bioinformatics. 2015 Mar 1;31(5):682-90. doi: 10.1093/bioinformatics/btu726. Epub 2014 Oct 29.
4
Living Trees: High-Quality Reproducible and Reusable Construction of Bacterial Phylogenetic Trees.活体树木:高质量可重复且可重用的细菌系统发育树构建。
Mol Biol Evol. 2020 Feb 1;37(2):563-575. doi: 10.1093/molbev/msz241.
5
Bioinformatics Goes Viral: I. Databases, Phylogenetics and Phylodynamics Tools for Boosting Virus Research.生物信息学病毒学:I. 数据库、系统发生学和系统进化动力学工具,助力病毒研究。
Viruses. 2024 Sep 6;16(9):1425. doi: 10.3390/v16091425.
6
BLAST-EXPLORER helps you building datasets for phylogenetic analysis.BLAST-EXPLORER 可帮助您构建用于系统发育分析的数据集。
BMC Evol Biol. 2010 Jan 12;10:8. doi: 10.1186/1471-2148-10-8.
7
On the quality of tree-based protein classification.论基于树的蛋白质分类的质量。
Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.
8
DPRml: distributed phylogeny reconstruction by maximum likelihood.DPRml:基于最大似然法的分布式系统发育重建
Bioinformatics. 2005 Apr 1;21(7):969-74. doi: 10.1093/bioinformatics/bti100. Epub 2004 Oct 28.
9
Phylogenetic supermatrix analysis of GenBank sequences from 2228 papilionoid legumes.对2228种蝶形花科豆科植物的GenBank序列进行系统发育超矩阵分析。
Syst Biol. 2006 Oct;55(5):818-36. doi: 10.1080/10635150600999150.
10
Efficient error correction for next-generation sequencing of viral amplicons.高效的病毒扩增子下一代测序错误校正。
BMC Bioinformatics. 2012 Jun 25;13 Suppl 10(Suppl 10):S6. doi: 10.1186/1471-2105-13-S10-S6.

引用本文的文献

1
Estimation of evolutionary dynamics and selection pressure in coronaviruses.冠状病毒进化动力学和选择压力的估计
Methods Mol Biol. 2015;1282:41-8. doi: 10.1007/978-1-4939-2438-7_4.

本文引用的文献

1
Among-site rate variation and its impact on phylogenetic analyses.种间变异率及其对系统发育分析的影响。
Trends Ecol Evol. 1996 Sep;11(9):367-72. doi: 10.1016/0169-5347(96)10041-0.
2
OrthoSelect: a protocol for selecting orthologous groups in phylogenomics.OrthoSelect:一种在系统发育基因组学中选择直系同源组的方案。
BMC Bioinformatics. 2009 Jul 16;10:219. doi: 10.1186/1471-2105-10-219.
3
HaMStR: profile hidden markov model based search for orthologs in ESTs.HaMStR:基于隐马尔可夫模型的ESTs直系同源物搜索工具
BMC Evol Biol. 2009 Jul 8;9:157. doi: 10.1186/1471-2148-9-157.
4
Predominance and circulation of enteric viruses in the region of Greater Cairo, Egypt.埃及大开罗地区肠道病毒的优势及传播情况
J Clin Microbiol. 2009 Apr;47(4):1037-45. doi: 10.1128/JCM.01381-08. Epub 2009 Feb 4.
5
Next-generation DNA sequencing.下一代DNA测序
Nat Biotechnol. 2008 Oct;26(10):1135-45. doi: 10.1038/nbt1486.
6
Assessing performance of orthology detection strategies applied to eukaryotic genomes.评估应用于真核生物基因组的直系同源检测策略的性能。
PLoS One. 2007 Apr 18;2(4):e383. doi: 10.1371/journal.pone.0000383.
7
RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees.RAxML-III:一个基于最大似然法推断大型系统发育树的快速程序。
Bioinformatics. 2005 Feb 15;21(4):456-63. doi: 10.1093/bioinformatics/bti191. Epub 2004 Dec 17.
8
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.空位BLAST和位置特异性迭代BLAST:新一代蛋白质数据库搜索程序。
Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389.
9
Fitting discrete probability distributions to evolutionary events.将离散概率分布拟合到进化事件中。
Science. 1971 Jun 11;172(3988):1089-96. doi: 10.1126/science.172.3988.1089.
10
Limitations of the evolutionary parsimony method of phylogenetic analysis.系统发育分析的进化简约法的局限性。
Mol Biol Evol. 1990 Jan;7(1):82-102. doi: 10.1093/oxfordjournals.molbev.a040588.