Suppr超能文献

大规模测序环境中的自动化序列预处理

Automated sequence preprocessing in a large-scale sequencing environment.

作者信息

Wendl M C, Dear S, Hodgson D, Hillier L

机构信息

Genome Sequencing Center, Washington University, St. Louis, Missouri 63108 USA.

出版信息

Genome Res. 1998 Sep;8(9):975-84. doi: 10.1101/gr.8.9.975.

Abstract

A software system for transforming fragments from four-color fluorescence-based gel electrophoresis experiments into assembled sequence is described. It has been developed for large-scale processing of all trace data, including shotgun and finishing reads, regardless of clone origin. Design considerations are discussed in detail, as are programming implementation and graphic tools. The importance of input validation, record tracking, and use of base quality values is emphasized. Several quality analysis metrics are proposed and applied to sample results from recently sequenced clones. Such quantities prove to be a valuable aid in evaluating modifications of sequencing protocol. The system is in full production use at both the Genome Sequencing Center and the Sanger Centre, for which combined weekly production is approximately 100, 000 sequencing reads per week.

摘要

本文描述了一种软件系统,该系统可将基于四色荧光的凝胶电泳实验中的片段转化为组装序列。它是为大规模处理所有微量数据而开发的,包括鸟枪法测序和完成测序读段,无论克隆来源如何。详细讨论了设计考量、编程实现和图形工具。强调了输入验证、记录跟踪以及碱基质量值使用的重要性。提出了几种质量分析指标,并将其应用于最近测序克隆的样本结果。这些指标被证明是评估测序方案修改的宝贵辅助手段。该系统已在基因组测序中心和桑格中心全面投入生产使用,两者每周的联合产量约为每周100,000个测序读段。

相似文献

3
Automated finishing with autofinish.使用自动完成功能进行自动整理。
Genome Res. 2001 Apr;11(4):614-25. doi: 10.1101/gr.171401.
7
Techview: DNA sequencing. Sequencing the genome, fast.
Science. 1999 Mar 19;283(5409):1867-9. doi: 10.1126/science.283.5409.1867.
9
Sequence assembly and finishing methods.序列组装与完成方法。
Methods Biochem Anal. 2001;43:303-22. doi: 10.1002/0471223921.ch13.

引用本文的文献

9
The Ensembl core software libraries.Ensembl核心软件库。
Genome Res. 2004 May;14(5):929-33. doi: 10.1101/gr.1857204.

本文引用的文献

6
Experiment files and their application during large-scale sequencing projects.
DNA Seq. 1996;6(2):109-17. doi: 10.3109/10425179609010197.
7
The Staden sequence analysis package.Staden序列分析软件包。
Mol Biotechnol. 1996 Jun;5(3):233-41. doi: 10.1007/BF02900361.
9
A new DNA sequence assembly program.一个新的DNA序列组装程序。
Nucleic Acids Res. 1995 Dec 25;23(24):4992-9. doi: 10.1093/nar/23.24.4992.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验