• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预组装:一种用于自动测序仪追踪数据处理的工具。

preAssemble: a tool for automatic sequencer trace data processing.

作者信息

Adzhubei Alexei A, Laerdahl Jon K, Vlasova Anna V

机构信息

Norwegian School of Veterinary Science, BasAM-Genetics, PO Box 8146 Dep, NO-0033 Oslo, Norway.

出版信息

BMC Bioinformatics. 2006 Jan 17;7:22. doi: 10.1186/1471-2105-7-22.

DOI:10.1186/1471-2105-7-22
PMID:16417643
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1352386/
Abstract

BACKGROUND

Trace or chromatogram files (raw data) are produced by automatic nucleic acid sequencing equipment or sequencers. Each file contains information which can be interpreted by specialised software to reveal the sequence (base calling). This is done by the sequencer proprietary software or publicly available programs. Depending on the size of a sequencing project the number of trace files can vary from just a few to thousands of files. Sequencing quality assessment on various criteria is important at the stage preceding clustering and contig assembly. Two major publicly available packages--Phred and Staden are used by preAssemble to perform sequence quality processing.

RESULTS

The preAssemble pre-assembly sequence processing pipeline has been developed for small to large scale automatic processing of DNA sequencer chromatogram (trace) data. The Staden Package Pregap4 module and base-calling program Phred are utilized in the pipeline, which produces detailed and self-explanatory output that can be displayed with a web browser. preAssemble can be used successfully with very little previous experience, however options for parameter tuning are provided for advanced users. preAssemble runs under UNIX and LINUX operating systems. It is available for downloading and will run as stand-alone software. It can also be accessed on the Norwegian Salmon Genome Project web site where preAssemble jobs can be run on the project server.

CONCLUSION

preAssemble is a tool allowing to perform quality assessment of sequences generated by automatic sequencing equipment. preAssemble is flexible since both interactive jobs on the preAssemble server and the stand alone downloadable version are available. Virtually no previous experience is necessary to run a default preAssemble job, on the other hand options for parameter tuning are provided. Consequently preAssemble can be used as efficiently for just several trace files as for large scale sequence processing.

摘要

背景

痕量或色谱图文件(原始数据)由自动核酸测序设备或测序仪生成。每个文件都包含可由专门软件解释以揭示序列(碱基识别)的信息。这由测序仪的专有软件或公开可用的程序完成。根据测序项目的规模,痕量文件的数量可能从几个到数千个不等。在聚类和重叠群组装之前的阶段,根据各种标准进行测序质量评估很重要。预组装程序使用两个主要的公开可用软件包——Phred和Staden来进行序列质量处理。

结果

已开发出预组装预装配序列处理流程,用于对DNA测序仪色谱图(痕量)数据进行从小规模到大规模的自动处理。该流程利用了Staden软件包的Pregap4模块和碱基识别程序Phred,生成的详细且自解释的输出可通过网络浏览器显示。几乎无需任何经验即可成功使用预组装程序,不过也为高级用户提供了参数调整选项。预组装程序在UNIX和LINUX操作系统下运行。它可供下载并将作为独立软件运行。也可以在挪威鲑鱼基因组计划网站上访问,在该网站上可以在项目服务器上运行预组装任务。

结论

预组装程序是一种可对自动测序设备生成的序列进行质量评估的工具。预组装程序很灵活,因为既可以使用预组装服务器上的交互式任务,也可以使用独立的可下载版本。运行默认的预组装任务几乎无需任何经验,另一方面也提供了参数调整选项。因此,预组装程序对于几个痕量文件和大规模序列处理都能同样高效地使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c74/1352386/f1c2a5bbd86d/1471-2105-7-22-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c74/1352386/f1c2a5bbd86d/1471-2105-7-22-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c74/1352386/f1c2a5bbd86d/1471-2105-7-22-1.jpg

相似文献

1
preAssemble: a tool for automatic sequencer trace data processing.预组装:一种用于自动测序仪追踪数据处理的工具。
BMC Bioinformatics. 2006 Jan 17;7:22. doi: 10.1186/1471-2105-7-22.
2
DraGnET: software for storing, managing and analyzing annotated draft genome sequence data.DraGnET:用于存储、管理和分析带注释的草图基因组序列数据的软件。
BMC Bioinformatics. 2010 Feb 22;11:100. doi: 10.1186/1471-2105-11-100.
3
SeqDoC: rapid SNP and mutation detection by direct comparison of DNA sequence chromatograms.SeqDoC:通过直接比较DNA序列色谱图快速检测单核苷酸多态性和突变。
BMC Bioinformatics. 2005 May 31;6:133. doi: 10.1186/1471-2105-6-133.
4
Tracy: basecalling, alignment, assembly and deconvolution of sanger chromatogram trace files.泰西:桑格色谱迹线文件的碱基调用、比对、组装和解卷积。
BMC Genomics. 2020 Mar 14;21(1):230. doi: 10.1186/s12864-020-6635-8.
5
Base-calling of automated sequencer traces using phred. I. Accuracy assessment.使用Phred对自动测序仪轨迹进行碱基识别。I. 准确性评估。
Genome Res. 1998 Mar;8(3):175-85. doi: 10.1101/gr.8.3.175.
6
GATA: a graphic alignment tool for comparative sequence analysis.GATA:一种用于比较序列分析的图形比对工具。
BMC Bioinformatics. 2005 Jan 17;6:9. doi: 10.1186/1471-2105-6-9.
7
Ridom TraceEdit: a DNA trace editor and viewer.Ridom TraceEdit:一款DNA序列编辑器与查看器。
Bioinformatics. 2006 Feb 15;22(4):493-4. doi: 10.1093/bioinformatics/btk002. Epub 2005 Dec 15.
8
ESAP plus: a web-based server for EST-SSR marker development.ESAP plus:一个用于EST-SSR标记开发的基于网络的服务器。
BMC Genomics. 2016 Dec 22;17(Suppl 13):1035. doi: 10.1186/s12864-016-3328-4.
9
MAGIC-SPP: a database-driven DNA sequence processing package with associated management tools.MAGIC-SPP:一个由数据库驱动的DNA序列处理软件包及相关管理工具。
BMC Bioinformatics. 2006 Mar 7;7:115. doi: 10.1186/1471-2105-7-115.
10
SNP mining porcine ESTs with MAVIANT, a novel tool for SNP evaluation and annotation.利用MAVIANT挖掘猪EST中的单核苷酸多态性,MAVIANT是一种用于单核苷酸多态性评估和注释的新型工具。
Bioinformatics. 2007 Jul 1;23(13):i387-91. doi: 10.1093/bioinformatics/btm192.

引用本文的文献

1
PIMS sequencing extension: a laboratory information management system for DNA sequencing facilities.PIMS测序扩展:一种用于DNA测序设施的实验室信息管理系统。
BMC Res Notes. 2011 Mar 7;4:48. doi: 10.1186/1756-0500-4-48.
2
ESTPiper--a web-based analysis pipeline for expressed sequence tags.ESTPiper——一个用于表达序列标签的基于网络的分析流程。
BMC Genomics. 2009 Apr 21;10:174. doi: 10.1186/1471-2164-10-174.
3
VarDetect: a nucleotide sequence variation exploratory tool.VarDetect:一种核苷酸序列变异探索工具。

本文引用的文献

1
ESTWeb: bioinformatics services for EST sequencing projects.ESTWeb:为EST测序项目提供的生物信息学服务。
Bioinformatics. 2003 Aug 12;19(12):1587-8. doi: 10.1093/bioinformatics/btg196.
2
Automated finishing with autofinish.使用自动完成功能进行自动整理。
Genome Res. 2001 Apr;11(4):614-25. doi: 10.1101/gr.171401.
3
The Staden package, 1998.斯塔登软件包,1998年。
BMC Bioinformatics. 2008 Dec 12;9 Suppl 12(Suppl 12):S9. doi: 10.1186/1471-2105-9-S12-S9.
4
Expressed sequence tags with cDNA termini: previously overlooked resources for gene annotation and transcriptome exploration in Chlamydomonas reinhardtii.具有cDNA末端的表达序列标签:莱茵衣藻中用于基因注释和转录组探索的先前被忽视的资源。
Genetics. 2008 May;179(1):83-93. doi: 10.1534/genetics.107.085605.
5
Annotated expressed sequence tags (ESTs) from pre-smolt Atlantic salmon (Salmo salar) in a searchable data resource.来自可搜索数据资源中前银化大西洋鲑(Salmo salar)的注释表达序列标签(ESTs)。
BMC Genomics. 2007 Jul 2;8:209. doi: 10.1186/1471-2164-8-209.
6
Gender-specific selection on codon usage in plant genomes.植物基因组中密码子使用的性别特异性选择。
BMC Genomics. 2007 Jun 13;8:169. doi: 10.1186/1471-2164-8-169.
7
WebTraceMiner: a web service for processing and mining EST sequence trace files.WebTraceMiner:一个用于处理和挖掘EST序列追踪文件的网络服务。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W137-42. doi: 10.1093/nar/gkm299. Epub 2007 May 8.
Methods Mol Biol. 2000;132:115-30. doi: 10.1385/1-59259-192-2:115.
4
Base-calling of automated sequencer traces using phred. II. Error probabilities.使用Phred对自动测序仪追踪结果进行碱基识别。II. 错误概率。
Genome Res. 1998 Mar;8(3):186-94.
5
Base-calling of automated sequencer traces using phred. I. Accuracy assessment.使用Phred对自动测序仪轨迹进行碱基识别。I. 准确性评估。
Genome Res. 1998 Mar;8(3):175-85. doi: 10.1101/gr.8.3.175.