Suppr超能文献

一种用于自动DNA测序数据分析的软件系统。

A software system for data analysis in automated DNA sequencing.

作者信息

Giddings M C, Severin J, Westphall M, Wu J, Smith L M

机构信息

Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA.

出版信息

Genome Res. 1998 Jun;8(6):644-65. doi: 10.1101/gr.8.6.644.

Abstract

Software for gel image analysis and base-calling in fluorescence-based sequencing consisting of two primary programs, BaseFinder and GelImager, is described. BaseFinder is a framework for trace processing, analysis, and base-calling. BaseFinder is highly extensible, allowing the addition of trace analysis and processing modules without recompilation. Powerful scripting capabilities combined with modularity and multilane handling allow the user to customize BaseFinder to virtually any type of trace processing. We have developed an extensive set of data processing and analysis modules for use with the program in fluorescence-based sequencing. GelImager is a framework for gel image manipulation. It can be used for gel visualization, lane retracking, and as a front end to the Washington University Getlanes program. The programs were designed using a cross-platform development environment, currently allowing them to run in Windows NT, Windows 95, Openstep/Mach, and Rhapsody. Work is ongoing to deploy the software on additional platforms, including Solaris, Linux, and MacOS. This software has been thoroughly tested and debugged in the analysis of >2 million bp of raw sequence data from human chromosome 19 region q13. Overall sequencing accuracy was measured using a significant subset of these data, consisting of approximately 600 sequences, by comparing the individual shotgun sequences against the final assembled contigs. Also, results are reported from experiments that analyzed the accuracy of the software and two other well-known base-calling programs for sequencing the M13mp18 vector sequence. [The sequence data described in this paper have been submitted to the GenBank data library under accession no. AF025422]

摘要

介绍了用于基于荧光测序的凝胶图像分析和碱基识别的软件,该软件由两个主要程序BaseFinder和GelImager组成。BaseFinder是一个用于轨迹处理、分析和碱基识别的框架。BaseFinder具有高度可扩展性,无需重新编译即可添加轨迹分析和处理模块。强大的脚本功能与模块化和多泳道处理相结合,允许用户将BaseFinder定制为几乎任何类型的轨迹处理。我们已经开发了一套广泛的数据处理和分析模块,用于基于荧光测序的程序。GelImager是一个用于凝胶图像处理的框架。它可用于凝胶可视化、泳道重新追踪,并作为华盛顿大学Getlanes程序的前端。这些程序是使用跨平台开发环境设计的,目前允许它们在Windows NT、Windows 95、Openstep/Mach和Rhapsody中运行。正在进行将该软件部署到其他平台的工作,包括Solaris、Linux和MacOS。该软件在分析来自人类19号染色体q13区域的超过200万碱基对的原始序列数据时经过了全面测试和调试。通过将单个鸟枪法序列与最终组装的重叠群进行比较,使用这些数据的一个重要子集(约600个序列)测量了总体测序准确性。此外,还报告了分析该软件和其他两个著名的碱基识别程序对M13mp18载体序列进行测序准确性的实验结果。[本文所述的序列数据已提交到GenBank数据库,登录号为AF025422]

相似文献

1
A software system for data analysis in automated DNA sequencing.
Genome Res. 1998 Jun;8(6):644-65. doi: 10.1101/gr.8.6.644.
2
Automated sequence preprocessing in a large-scale sequencing environment.
Genome Res. 1998 Sep;8(9):975-84. doi: 10.1101/gr.8.9.975.
3
preAssemble: a tool for automatic sequencer trace data processing.
BMC Bioinformatics. 2006 Jan 17;7:22. doi: 10.1186/1471-2105-7-22.
4
Automatic DNA Diagnosis for 1D Gel Electrophoresis Images using Bio-image Processing Technique.
BMC Genomics. 2015;16 Suppl 12(Suppl 12):S15. doi: 10.1186/1471-2164-16-S12-S15. Epub 2015 Dec 9.
5
Improvement of base-calling in multilane automated DNA sequencing by use of electrophoretic calibration standards, data linearization, and trace alignment.
Electrophoresis. 2001 Jun;22(10):1906-14. doi: 10.1002/1522-2683(200106)22:10<1906::AID-ELPS1906>3.0.CO;2-5.
6
ABI sequencing analysis. Manipulation of sequence data from the ABI DNA sequencer.
Mol Biotechnol. 1999 Dec 1;13(2):137-52. doi: 10.1385/MB:13:2:137.
7
Automated band mapping in electrophoretic gel images using background information.
Nucleic Acids Res. 2005 May 13;33(9):2806-12. doi: 10.1093/nar/gki580. Print 2005.
8
Using a neural network for lane-tracking of DNA sequencing slab gels.
J Biochem Biophys Methods. 2000 Aug 10;45(1):65-74. doi: 10.1016/s0165-022x(00)00099-3.
9
An estimate of the crosstalk matrix in four-dye fluorescence-based DNA sequencing.
Electrophoresis. 1999 Jun;20(7):1433-42. doi: 10.1002/(SICI)1522-2683(19990601)20:7<1433::AID-ELPS1433>3.0.CO;2-0.
10
GelClust: a software tool for gel electrophoresis images analysis and dendrogram generation.
Comput Methods Programs Biomed. 2013 Aug;111(2):512-8. doi: 10.1016/j.cmpb.2013.04.013. Epub 2013 May 30.

引用本文的文献

1
Trypanosomatid, fluorescence-based U-insertion/U-deletion RNA-editing (FIDE).
Bio Protoc. 2021 Mar 5;11(5):e3935. doi: 10.21769/BioProtoc.3935.
2
RNA secondary structure prediction using high-throughput SHAPE.
J Vis Exp. 2013 May 31(75):e50243. doi: 10.3791/50243.
3
A template for mutational data analysis of the CFTR gene.
Clin Chem Lab Med. 2011 Sep;49(9):1447-51. doi: 10.1515/CCLM.2011.604. Epub 2011 May 31.
7
High throughput DNA sequencing with a microfabricated 96-lane capillary array electrophoresis bioprocessor.
Proc Natl Acad Sci U S A. 2002 Jan 22;99(2):574-9. doi: 10.1073/pnas.012608699. Epub 2002 Jan 15.
8
Basecalling with LifeTrace.
Genome Res. 2001 May;11(5):875-88. doi: 10.1101/gr.177901.

本文引用的文献

1
Consed: a graphical tool for sequence finishing.
Genome Res. 1998 Mar;8(3):195-202. doi: 10.1101/gr.8.3.195.
2
Base-calling of automated sequencer traces using phred. I. Accuracy assessment.
Genome Res. 1998 Mar;8(3):175-85. doi: 10.1101/gr.8.3.175.
3
Fully automated DNA reaction and analysis in a fluidic capillary instrument.
Anal Chem. 1997 Mar 1;69(5):848-55. doi: 10.1021/ac961104o.
4
A method to determine the filter matrix in four-dye fluorescence-based DNA sequencing.
Electrophoresis. 1997 Jan;18(1):23-5. doi: 10.1002/elps.1150180106.
5
Lane tracking software for four-color fluorescence-based electrophoretic gel images.
Genome Res. 1996 Nov;6(11):1110-7. doi: 10.1101/gr.6.11.1110.
6
A graph theoretic approach to the analysis of DNA sequencing data.
Genome Res. 1996 Feb;6(2):80-91. doi: 10.1101/gr.6.2.80.
8
Automatic matrix determination in four dye fluorescence-based DNA sequencing.
Electrophoresis. 1996 Jun;17(6):1143-50. doi: 10.1002/elps.1150170626.
9
Deconvolution of gel filtration chromatographs of human plasma lipoproteins.
Anal Biochem. 1995 Nov 1;231(2):301-8. doi: 10.1006/abio.1995.0055.
10
Automated DNA sequencing: a look into the future.
Cancer Detect Prev. 1993;17(2):283-8.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验