Batley Jacqueline, Edwards David
Australian Centre for Plant Functional Genomics, School of Land, Crop and Food Sciences and ARC Centre of Excellence for Integrative Legume Research, University of Queensland, Brisbane, QLD 4072, Australia.
Biotechniques. 2009 Apr;46(5):333-4, 336. doi: 10.2144/000113134.
Over the last few years there has been a revolution in DNA sequencing technology that has brought down the cost of DNA sequencing and made the sequencing of an increasing number of genomes both feasible and cost effective. There has also been a dramatic shift in the type of sequence data being generated, with vast numbers of short reads or pairs of short reads replacing the traditional relatively long reads produced by Sanger sequencing. These changes in data quantity and format have led to a rethinking of sequence data management, storage, and visualization, and provide a challenge for bioinformatics. The vast amount of sequence data that will be generated over the next few years will require a change in what data are stored and how users query the information.
在过去几年中,DNA测序技术发生了一场变革,降低了DNA测序成本,使得越来越多的基因组测序变得可行且具有成本效益。所生成的序列数据类型也发生了巨大转变,大量短读段或短读段对取代了传统的由桑格测序产生的相对较长的读段。数据量和格式的这些变化引发了对序列数据管理、存储和可视化的重新思考,并给生物信息学带来了挑战。未来几年将生成的海量序列数据将需要改变存储的数据内容以及用户查询信息的方式。