Suppr超能文献

基因组序列数据:管理、存储与可视化

Genome sequence data: management, storage, and visualization.

作者信息

Batley Jacqueline, Edwards David

机构信息

Australian Centre for Plant Functional Genomics, School of Land, Crop and Food Sciences and ARC Centre of Excellence for Integrative Legume Research, University of Queensland, Brisbane, QLD 4072, Australia.

出版信息

Biotechniques. 2009 Apr;46(5):333-4, 336. doi: 10.2144/000113134.

Abstract

Over the last few years there has been a revolution in DNA sequencing technology that has brought down the cost of DNA sequencing and made the sequencing of an increasing number of genomes both feasible and cost effective. There has also been a dramatic shift in the type of sequence data being generated, with vast numbers of short reads or pairs of short reads replacing the traditional relatively long reads produced by Sanger sequencing. These changes in data quantity and format have led to a rethinking of sequence data management, storage, and visualization, and provide a challenge for bioinformatics. The vast amount of sequence data that will be generated over the next few years will require a change in what data are stored and how users query the information.

摘要

在过去几年中,DNA测序技术发生了一场变革,降低了DNA测序成本,使得越来越多的基因组测序变得可行且具有成本效益。所生成的序列数据类型也发生了巨大转变,大量短读段或短读段对取代了传统的由桑格测序产生的相对较长的读段。数据量和格式的这些变化引发了对序列数据管理、存储和可视化的重新思考,并给生物信息学带来了挑战。未来几年将生成的海量序列数据将需要改变存储的数据内容以及用户查询信息的方式。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验