Suppr超能文献

XS:一款FASTQ读取模拟器。

XS: a FASTQ read simulator.

作者信息

Pratas Diogo, Pinho Armando J, Rodrigues João M O S

机构信息

Signal Processing Lab, IEETA/DETI University of Aveiro, Aveiro 3810-193, Portugal.

出版信息

BMC Res Notes. 2014 Jan 16;7:40. doi: 10.1186/1756-0500-7-40.

Abstract

BACKGROUND

The emerging next-generation sequencing (NGS) is bringing, besides the natural huge amounts of data, an avalanche of new specialized tools (for analysis, compression, alignment, among others) and large public and private network infrastructures. Therefore, a direct necessity of specific simulation tools for testing and benchmarking is rising, such as a flexible and portable FASTQ read simulator, without the need of a reference sequence, yet correctly prepared for producing approximately the same characteristics as real data.

FINDINGS

We present XS, a skilled FASTQ read simulation tool, flexible, portable (does not need a reference sequence) and tunable in terms of sequence complexity. It has several running modes, depending on the time and memory available, and is aimed at testing computing infrastructures, namely cloud computing of large-scale projects, and testing FASTQ compression algorithms. Moreover, XS offers the possibility of simulating the three main FASTQ components individually (headers, DNA sequences and quality-scores).

CONCLUSIONS

XS provides an efficient and convenient method for fast simulation of FASTQ files, such as those from Ion Torrent (currently uncovered by other simulators), Roche-454, Illumina and ABI-SOLiD sequencing machines. This tool is publicly available at http://bioinformatics.ua.pt/software/xs/.

摘要

背景

新兴的下一代测序(NGS)技术除了带来海量的自然数据外,还催生了大量新的专业工具(用于分析、压缩、比对等)以及大型公共和私人网络基础设施。因此,对用于测试和基准测试的特定模拟工具的直接需求日益增加,例如一种灵活且便携的FASTQ读取模拟器,它无需参考序列,但能正确模拟出与真实数据大致相同的特征。

研究结果

我们展示了XS,这是一款技术娴熟的FASTQ读取模拟工具,具有灵活性、便携性(无需参考序列)且在序列复杂度方面可调。它有多种运行模式,取决于可用的时间和内存,旨在测试计算基础设施,即大规模项目的云计算,并测试FASTQ压缩算法。此外,XS还提供了单独模拟FASTQ三个主要组件(头部、DNA序列和质量得分)的可能性。

结论

XS为快速模拟FASTQ文件提供了一种高效便捷的方法,例如来自Ion Torrent(目前其他模拟器未涉及)、Roche-454、Illumina和ABI-SOLiD测序仪的文件。该工具可在http://bioinformatics.ua.pt/software/xs/上公开获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1415/3927261/b9f0ec5fa3c0/1756-0500-7-40-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验