AMAS：一种用于比对操作和汇总统计计算的快速工具。

AMAS: a fast tool for alignment manipulation and computing of summary statistics.

作者信息

Borowiec Marek L

机构信息

Department of Entomology and Nematology, UC Davis , Davis , United States.

出版信息

PeerJ. 2016 Jan 28;4:e1660. doi: 10.7717/peerj.1660. eCollection 2016.

DOI:10.7717/peerj.1660

PMID:26835189

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4734057/

Abstract

The amount of data used in phylogenetics has grown explosively in the recent years and many phylogenies are inferred with hundreds or even thousands of loci and many taxa. These modern phylogenomic studies often entail separate analyses of each of the loci in addition to multiple analyses of subsets of genes or concatenated sequences. Computationally efficient tools for handling and computing properties of thousands of single-locus or large concatenated alignments are needed. Here I present AMAS (Alignment Manipulation And Summary), a tool that can be used either as a stand-alone command-line utility or as a Python package. AMAS works on amino acid and nucleotide alignments and combines capabilities of sequence manipulation with a function that calculates basic statistics. The manipulation functions include conversions among popular formats, concatenation, extracting sites and splitting according to a pre-defined partitioning scheme, creation of replicate data sets, and removal of taxa. The statistics calculated include the number of taxa, alignment length, total count of matrix cells, overall number of undetermined characters, percent of missing data, AT and GC contents (for DNA alignments), count and proportion of variable sites, count and proportion of parsimony informative sites, and counts of all characters relevant for a nucleotide or amino acid alphabet. AMAS is particularly suitable for very large alignments with hundreds of taxa and thousands of loci. It is computationally efficient, utilizes parallel processing, and performs better at concatenation than other popular tools. AMAS is a Python 3 program that relies solely on Python's core modules and needs no additional dependencies. AMAS source code and manual can be downloaded from http://github.com/marekborowiec/AMAS/ under GNU General Public License.

摘要

近年来，系统发育学中使用的数据量呈爆炸式增长，许多系统发育树是通过数百个甚至数千个基因座和众多分类群推断出来的。这些现代系统发育基因组学研究除了对基因子集或串联序列进行多次分析外，通常还需要对每个基因座进行单独分析。因此，需要计算效率高的工具来处理和计算数千个单基因座或大型串联比对的属性。在此，我介绍AMAS（比对操作与总结），这是一个既可以作为独立的命令行实用程序使用，也可以作为Python包使用的工具。AMAS适用于氨基酸和核苷酸比对，并将序列操作功能与计算基本统计量的功能结合在一起。操作功能包括在流行格式之间进行转换、串联、提取位点以及根据预定义的划分方案进行拆分、创建重复数据集和去除分类群。计算的统计量包括分类群数量、比对长度、矩阵单元格总数、未确定字符的总数、缺失数据的百分比、AT和GC含量（对于DNA比对）、可变位点的数量和比例、简约信息位点的数量和比例，以及与核苷酸或氨基酸字母表相关的所有字符的数量。AMAS特别适用于具有数百个分类群和数千个基因座的非常大的比对。它计算效率高，利用并行处理，并且在串联方面比其他流行工具表现更好。AMAS是一个仅依赖Python核心模块的Python 3程序，无需其他依赖项。AMAS的源代码和手册可在GNU通用公共许可证下从http://github.com/marekborowiec/AMAS/下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd99/4734057/05eaa71a3550/peerj-04-1660-g001.jpg

相似文献

AMAS: a fast tool for alignment manipulation and computing of summary statistics.

PeerJ. 2016 Jan 28;4:e1660. doi: 10.7717/peerj.1660. eCollection 2016.

FASconCAT-G: extensive functions for multiple sequence alignment preparations concerning phylogenetic studies.

Front Zool. 2014 Nov 18;11(1):81. doi: 10.1186/s12983-014-0081-x. eCollection 2014.

GET_PHYLOMARKERS, a Software Package to Select Optimal Orthologous Clusters for Phylogenomics and Inferring Pan-Genome Phylogenies, Used for a Critical Geno-Taxonomic Revision of the Genus .

Front Microbiol. 2018 May 1;9:771. doi: 10.3389/fmicb.2018.00771. eCollection 2018.

Do Alignment and Trimming Methods Matter for Phylogenomic (UCE) Analyses?

Syst Biol. 2021 Apr 15;70(3):440-462. doi: 10.1093/sysbio/syaa064.

2matrix: A utility for indel coding and phylogenetic matrix concatenation(1.).

Appl Plant Sci. 2014 Jan 7;2(1). doi: 10.3732/apps.1300083. eCollection 2014 Jan.

PHYLUCE is a software package for the analysis of conserved genomic loci.

Bioinformatics. 2016 Mar 1;32(5):786-8. doi: 10.1093/bioinformatics/btv646. Epub 2015 Nov 2.

SCaFoS: a tool for selection, concatenation and fusion of sequences for phylogenomics.

BMC Evol Biol. 2007 Feb 8;7 Suppl 1(Suppl 1):S2. doi: 10.1186/1471-2148-7-S1-S2.

An Automated Model Annotation System (AMAS) for SBML Models.

bioRxiv. 2023 Jul 21:2023.07.19.549722. doi: 10.1101/2023.07.19.549722.

Transcriptome Ortholog Alignment Sequence Tools (TOAST) for phylogenomic dataset assembly.

BMC Evol Biol. 2020 Mar 30;20(1):41. doi: 10.1186/s12862-020-01603-w.

BuddySuite: Command-Line Toolkits for Manipulating Sequences, Alignments, and Phylogenetic Trees.

Mol Biol Evol. 2017 Jun 1;34(6):1543-1546. doi: 10.1093/molbev/msx089.

引用本文的文献

, a new species of (Rosaceae) from southwest China.

PhytoKeys. 2025 Aug 15;261:175-187. doi: 10.3897/phytokeys.261.152449. eCollection 2025.

Convergent Evolution in Amblyopsid Cavefishes and the Age of Eastern North American Subterranean Ecosystems.

Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf185.

The complete mitochondrion genome of the Hoge's Side-necked turtle Ranacephala hogei (Chelidae), a critically endangered species from South America.

Genet Mol Biol. 2025 Aug 15;48(3):e20240203. doi: 10.1590/1678-4685-GMB-2024-0203. eCollection 2025.

Phylogenetic Insights from a Novel Species Challenge Generic Boundaries in Orthotrichaceae.

Plants (Basel). 2025 Aug 1;14(15):2373. doi: 10.3390/plants14152373.

Phylogenomic insights into and its allies (Campanulaceae): Revisiting generic delimitation and hybridization dynamics.

Plant Divers. 2025 May 27;47(4):576-592. doi: 10.1016/j.pld.2025.05.010. eCollection 2025 Jul.

Evaluating the utility of deep genome skimming for phylogenomic analyses: A case study in the species-rich genus .

Plant Divers. 2025 May 2;47(4):593-603. doi: 10.1016/j.pld.2025.04.006. eCollection 2025 Jul.

Elucidation of the phylogeny of Cucurbitaceae, particularly Trichosanthes, based on plastome data and nuclear single-copy genes.

BMC Plant Biol. 2025 Jul 19;25(1):929. doi: 10.1186/s12870-025-06970-4.

Genomically-selected antifungal Bacillaceae strains improve wheat yield and baking quality.

Appl Microbiol Biotechnol. 2025 Jul 10;109(1):164. doi: 10.1007/s00253-025-13544-9.

Ancient polyploidization events influence the evolution of the ginseng family (Araliaceae).

Front Plant Sci. 2025 Jun 13;16:1595321. doi: 10.3389/fpls.2025.1595321. eCollection 2025.

Avian Lifespan Network Reveals Shared Mechanisms and New Key Players in Animal Longevity.

Aging Cell. 2025 Sep;24(9):e70156. doi: 10.1111/acel.70156. Epub 2025 Jun 28.

本文引用的文献

Extracting phylogenetic signal and accounting for bias in whole-genome data sets supports the Ctenophora as sister to remaining Metazoa.

BMC Genomics. 2015 Nov 23;16:987. doi: 10.1186/s12864-015-2146-4.

Phylogenomic analyses data of the avian phylogenomics project.

Gigascience. 2015 Feb 12;4:4. doi: 10.1186/s13742-014-0038-1. eCollection 2015.

FASconCAT-G: extensive functions for multiple sequence alignment preparations concerning phylogenetic studies.

Front Zool. 2014 Nov 18;11(1):81. doi: 10.1186/s12983-014-0081-x. eCollection 2014.

Phylogenomics resolves the timing and pattern of insect evolution.

Science. 2014 Nov 7;346(6210):763-7. doi: 10.1126/science.1257570. Epub 2014 Nov 6.

Phylogenomic interrogation of arachnida reveals systemic conflicts in phylogenetic signal.

Mol Biol Evol. 2014 Nov;31(11):2963-84. doi: 10.1093/molbev/msu235. Epub 2014 Aug 8.

Phylogenomics resolves evolutionary relationships among ants, bees, and wasps.

Curr Biol. 2013 Oct 21;23(20):2058-62. doi: 10.1016/j.cub.2013.08.050. Epub 2013 Oct 3.

Applications of next-generation sequencing to phylogeography and phylogenetics.

Mol Phylogenet Evol. 2013 Feb;66(2):526-38. doi: 10.1016/j.ympev.2011.12.007. Epub 2011 Dec 14.

Biopython: freely available Python tools for computational molecular biology and bioinformatics.

Bioinformatics. 2009 Jun 1;25(11):1422-3. doi: 10.1093/bioinformatics/btp163. Epub 2009 Mar 20.

Phyutility: a phyloinformatics tool for trees, alignments and molecular data.

Bioinformatics. 2008 Mar 1;24(5):715-6. doi: 10.1093/bioinformatics/btm619. Epub 2008 Jan 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

AMAS：一种用于比对操作和汇总统计计算的快速工具。

AMAS: a fast tool for alignment manipulation and computing of summary statistics.

作者信息

Borowiec Marek L

机构信息

Department of Entomology and Nematology, UC Davis , Davis , United States.

出版信息

PeerJ. 2016 Jan 28;4:e1660. doi: 10.7717/peerj.1660. eCollection 2016.

DOI:10.7717/peerj.1660

PMID:26835189

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4734057/

Abstract

摘要

AMAS：一种用于比对操作和汇总统计计算的快速工具。

AMAS: a fast tool for alignment manipulation and computing of summary statistics.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

AMAS：一种用于比对操作和汇总统计计算的快速工具。

AMAS: a fast tool for alignment manipulation and computing of summary statistics.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献