Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria.
Vienna Graduate School of Population Genetics, Wien, Vienna.
Mol Ecol Resour. 2018 May;18(3):676-680. doi: 10.1111/1755-0998.12741. Epub 2017 Dec 8.
Sequencing whole genomes has become a standard research tool in many disciplines including Molecular Ecology, but the rapid technological advances in combination with several competing platforms have resulted in a confusing diversity of formats. This lack of standard formats causes several problems, such as undocumented preprocessing steps or the loss of information in downstream software tools, which do not account for the specifics of the different available formats. ReadTools is an open-source Java toolkit designed to standardize and preprocess read data from different platforms. It manages FASTQ- and SAM-formatted inputs while dealing with platform-specific peculiarities and provides a standard SAM compliant output. The code and executable are available at https://github.com/magicDGS/ReadTools.
测序全基因组已成为分子生态学等多个学科的标准研究工具,但快速的技术进步与几种竞争平台相结合,导致格式混乱多样。这种缺乏标准格式的情况导致了一些问题,例如未经记录的预处理步骤或下游软件工具中信息的丢失,这些工具没有考虑到不同可用格式的具体情况。ReadTools 是一个开源 Java 工具包,旨在对来自不同平台的读取数据进行标准化和预处理。它管理 FASTQ 和 SAM 格式的输入,同时处理特定于平台的特性,并提供符合标准的 SAM 输出。代码和可执行文件可在 https://github.com/magicDGS/ReadTools 上获得。