Department of Microbiology, Perelman School of Medicine, University of Pennsylvania.
Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA.
Bioinformatics. 2020 Jun 1;36(11):3607-3609. doi: 10.1093/bioinformatics/btaa167.
High-throughput sequencing is a powerful technique for addressing biological questions. Grabseqs streamlines access to publicly available metagenomic data by providing a single, easy-to-use interface to download data and metadata from multiple repositories, including the Sequence Read Archive, the Metagenomics Rapid Annotation through Subsystems Technology server and iMicrobe. Users can download data and metadata in a standardized format from any number of samples or projects from a given repository with a single grabseqs command.
Grabseqs is an open-source tool implemented in Python and licensed under the MIT license. The source code is freely available at https://github.com/louiejtaylor/grabseqs, the Python Package Index and Anaconda Cloud repository.
bushman@pennmedicine.upenn.edu.
Supplementary data are available at Bioinformatics online.
高通量测序是解决生物学问题的强大技术。Grabseqs 通过提供一个单一的、易于使用的界面,从多个存储库(包括 Sequence Read Archive、Metagenomics Rapid Annotation through Subsystems Technology 服务器和 iMicrobe)下载数据和元数据,简化了对公共宏基因组数据的访问。用户可以使用单个 grabseqs 命令从给定存储库中的任意数量的样本或项目以标准化格式下载数据和元数据。
Grabseqs 是一个用 Python 实现的开源工具,根据 MIT 许可证获得许可。源代码可在以下网址免费获得:https://github.com/louiejtaylor/grabseqs、Python 包索引和 Anaconda Cloud 存储库。
bushman@pennmedicine.upenn.edu。
补充数据可在 Bioinformatics 在线获得。