Suppr超能文献

非专业人士的系统发育学实用指南。

A practical guide to phylogenetics for nonexperts.

作者信息

O'Halloran Damien

机构信息

Department of Biological Sciences and Institute for Neuroscience, The George Washington University;

出版信息

J Vis Exp. 2014 Feb 5(84):e50975. doi: 10.3791/50975.

Abstract

Many researchers, across incredibly diverse foci, are applying phylogenetics to their research question(s). However, many researchers are new to this topic and so it presents inherent problems. Here we compile a practical introduction to phylogenetics for nonexperts. We outline in a step-by-step manner, a pipeline for generating reliable phylogenies from gene sequence datasets. We begin with a user-guide for similarity search tools via online interfaces as well as local executables. Next, we explore programs for generating multiple sequence alignments followed by protocols for using software to determine best-fit models of evolution. We then outline protocols for reconstructing phylogenetic relationships via maximum likelihood and Bayesian criteria and finally describe tools for visualizing phylogenetic trees. While this is not by any means an exhaustive description of phylogenetic approaches, it does provide the reader with practical starting information on key software applications commonly utilized by phylogeneticists. The vision for this article would be that it could serve as a practical training tool for researchers embarking on phylogenetic studies and also serve as an educational resource that could be incorporated into a classroom or teaching-lab.

摘要

许多研究人员,其研究重点极为多样,正在将系统发育学应用于他们的研究问题。然而,许多研究人员对这个主题并不熟悉,因此这带来了一些固有问题。在此,我们为非专业人士编写了一份系统发育学实用入门指南。我们逐步概述了从基因序列数据集生成可靠系统发育树的流程。我们首先为通过在线界面以及本地可执行文件进行相似性搜索工具提供用户指南。接下来,我们探讨用于生成多序列比对的程序,随后介绍使用软件确定最佳进化模型的协议。然后,我们概述通过最大似然法和贝叶斯准则重建系统发育关系的协议,最后描述用于可视化系统发育树的工具。虽然这绝不是对系统发育方法的详尽描述,但它确实为读者提供了有关系统发育学家常用关键软件应用的实用入门信息。本文的愿景是它可以作为刚开始进行系统发育研究的研究人员的实用培训工具,也可以作为一种教育资源,可纳入课堂或教学实验室。

相似文献

1
A practical guide to phylogenetics for nonexperts.
J Vis Exp. 2014 Feb 5(84):e50975. doi: 10.3791/50975.
3
Bayesian coestimation of phylogeny and sequence alignment.
BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.
4
πBUSS: a parallel BEAST/BEAGLE utility for sequence simulation under complex evolutionary scenarios.
BMC Bioinformatics. 2014 May 7;15:133. doi: 10.1186/1471-2105-15-133.
5
Molecular Phylogenetics: Concepts for a Newcomer.
Adv Biochem Eng Biotechnol. 2017;160:185-196. doi: 10.1007/10_2016_49.
6
SATe-II: very fast and accurate simultaneous estimation of multiple sequence alignments and phylogenetic trees.
Syst Biol. 2012 Jan;61(1):90-106. doi: 10.1093/sysbio/syr095. Epub 2011 Dec 1.
7
StatAlign: an extendable software package for joint Bayesian estimation of alignments and evolutionary trees.
Bioinformatics. 2008 Oct 15;24(20):2403-4. doi: 10.1093/bioinformatics/btn457. Epub 2008 Aug 27.
9
Multiple Sequence Alignment.
Methods Mol Biol. 2017;1525:167-189. doi: 10.1007/978-1-4939-6622-6_8.
10
Inferring Trees.
Methods Mol Biol. 2017;1525:349-377. doi: 10.1007/978-1-4939-6622-6_14.

引用本文的文献

1
A new phylogenetic protocol: dealing with model misspecification and confirmation bias in molecular phylogenetics.
NAR Genom Bioinform. 2020 Jun 23;2(2):lqaa041. doi: 10.1093/nargab/lqaa041. eCollection 2020 Jun.
2
Analysis of the Na+/Ca2+ exchanger gene family within the phylum Nematoda.
PLoS One. 2014 Nov 14;9(11):e112841. doi: 10.1371/journal.pone.0112841. eCollection 2014.

本文引用的文献

1
jModelTest 2: more models, new heuristics and parallel computing.
Nat Methods. 2012 Jul 30;9(8):772. doi: 10.1038/nmeth.2109.
2
ProtTest 3: fast selection of best-fit models of protein evolution.
Bioinformatics. 2011 Apr 15;27(8):1164-5. doi: 10.1093/bioinformatics/btr088. Epub 2011 Feb 17.
3
TreeDyn: towards dynamic graphics and annotations for analyses of trees.
BMC Bioinformatics. 2006 Oct 10;7:439. doi: 10.1186/1471-2105-7-439.
4
Kalign--an accurate and fast multiple sequence alignment algorithm.
BMC Bioinformatics. 2005 Dec 12;6:298. doi: 10.1186/1471-2105-6-298.
5
ProbCons: Probabilistic consistency-based multiple sequence alignment.
Genome Res. 2005 Feb;15(2):330-40. doi: 10.1101/gr.2821705.
6
MAFFT version 5: improvement in accuracy of multiple sequence alignment.
Nucleic Acids Res. 2005 Jan 20;33(2):511-8. doi: 10.1093/nar/gki198. Print 2005.
7
MUSCLE: multiple sequence alignment with high accuracy and high throughput.
Nucleic Acids Res. 2004 Mar 19;32(5):1792-7. doi: 10.1093/nar/gkh340. Print 2004.
8
A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood.
Syst Biol. 2003 Oct;52(5):696-704. doi: 10.1080/10635150390235520.
9
MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.
Nucleic Acids Res. 2002 Jul 15;30(14):3059-66. doi: 10.1093/nar/gkf436.
10
MRBAYES: Bayesian inference of phylogenetic trees.
Bioinformatics. 2001 Aug;17(8):754-5. doi: 10.1093/bioinformatics/17.8.754.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验