Institute of Natural and Applied Sciences, Akdeniz University, Antalya, Turkey.
Department of Agricultural Biotechnology, Faculty of Agriculture, Akdeniz University, Antalya, Turkey.
PLoS One. 2020 Dec 15;15(12):e0243927. doi: 10.1371/journal.pone.0243927. eCollection 2020.
Phylogenetic analyses can provide a wealth of information about the past demography of a population and the level of genetic diversity within and between species. By using special computer programs developed in recent years, large amounts of data have been produced in the molecular genetics area. To analyze these data, powerful new methods based on large computations have been applied in various software packages and programs. But these programs have their own specific input and output formats, and users need to create different input formats for almost every program. R is an open source software environment, and it supports open contribution and modification to its libraries. Furthermore, it is also possible to perform several analyses using a single input file format. In this article, by using the multiple sequences FASTA format file (.fas extension) we demonstrate and share a workflow of how to extract haplotypes and perform phylogenetic analyses and visualizations in R. As an example dataset, we used 120 Bombus terrestris dalmatinus mitochondrial cytochrome b gene (cyt b) sequences (373 bp) collected from eight different beehives in Antalya. This article presents a short guide on how to perform phylogenetic analyses using R and RStudio.
系统发育分析可以提供大量有关过去种群人口统计学和物种内及物种间遗传多样性水平的信息。近年来,通过使用专门开发的计算机程序,在分子遗传学领域产生了大量的数据。为了分析这些数据,已经在各种软件包和程序中应用了基于大量计算的强大新方法。但是,这些程序具有其自己的特定输入和输出格式,用户几乎需要为每个程序创建不同的输入格式。R 是一个开源软件环境,它支持对其库进行开放贡献和修改。此外,还可以使用单个输入文件格式执行多个分析。在本文中,我们通过使用多序列 FASTA 格式文件(.fas 扩展名),展示并共享了如何在 R 中提取单倍型并进行系统发育分析和可视化的工作流程。作为示例数据集,我们使用了从安塔利亚的八个不同蜂箱中收集的 120 个 Bombus terrestris dalmatinus 线粒体细胞色素 b 基因(cyt b)序列(373bp)。本文介绍了如何使用 R 和 RStudio 进行系统发育分析的简短指南。