• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ESTclean:新一代转录组鸟枪法测序的清洗工具。

ESTclean: a cleaning tool for next-gen transcriptome shotgun sequencing.

机构信息

The Center for Genomics and Bioinformatics, Indiana University, Bloomington, IN 47401, USA.

出版信息

BMC Bioinformatics. 2012 Sep 26;13:247. doi: 10.1186/1471-2105-13-247.

DOI:10.1186/1471-2105-13-247
PMID:23009593
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3630001/
Abstract

BACKGROUND

With the advent of next-generation sequencing (NGS) technologies, full cDNA shotgun sequencing has become a major approach in the study of transcriptomes, and several different protocols in 454 sequencing have been invented. As each protocol uses its own short DNA tags or adapters attached to the ends of cDNA fragments for labeling or sequencing, different contaminants may lead to mis-assembly and inaccurate sequence products.

RESULTS

We have designed and implemented a new program for raw sequence cleaning in a graphical user interface and a batch script. The cleaning process consists of several modules including barcode trimming, sequencing adapter trimming, amplification primer trimming, poly-A tail trimming, vector screening and low quality region trimming. These modules can be combined based on various sequencing applications.

CONCLUSIONS

ESTclean is a software package not only for cleaning cDNA sequences, but also for helping to develop sequencing protocols by providing summary tables and figures for sequencing quality control in a graphical user interface. It outperforms in cleaning read sequences from complicated sequencing protocols which use barcodes and multiple amplification primers.

摘要

背景

随着下一代测序(NGS)技术的出现,全长 cDNA 鸟枪法测序已成为转录组研究的主要方法,并且已经发明了几种不同的 454 测序方案。由于每个方案都使用自己的短 DNA 标签或接头连接到 cDNA 片段的末端进行标记或测序,因此不同的污染物可能导致错误组装和不准确的序列产物。

结果

我们设计并实现了一个新的图形用户界面和批处理脚本程序,用于原始序列清理。清理过程包括几个模块,包括条形码修剪、测序接头修剪、扩增引物修剪、多 A 尾修剪、载体筛选和低质量区域修剪。这些模块可以根据各种测序应用进行组合。

结论

ESTclean 是一个软件包,不仅可以用于清洁 cDNA 序列,还可以通过在图形用户界面中提供测序质量控制的汇总表和图形来帮助开发测序方案。它在清理使用条形码和多个扩增引物的复杂测序方案的读取序列方面表现出色。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/360e5737e036/1471-2105-13-247-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/86143ab14d2b/1471-2105-13-247-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/15ab66bb69de/1471-2105-13-247-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/937786504b1b/1471-2105-13-247-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/14d09e2cc712/1471-2105-13-247-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/09cc5b45d732/1471-2105-13-247-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/a9e42207f89a/1471-2105-13-247-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/c02a705472cc/1471-2105-13-247-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/360e5737e036/1471-2105-13-247-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/86143ab14d2b/1471-2105-13-247-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/15ab66bb69de/1471-2105-13-247-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/937786504b1b/1471-2105-13-247-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/14d09e2cc712/1471-2105-13-247-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/09cc5b45d732/1471-2105-13-247-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/a9e42207f89a/1471-2105-13-247-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/c02a705472cc/1471-2105-13-247-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f78c/3630001/360e5737e036/1471-2105-13-247-8.jpg

相似文献

1
ESTclean: a cleaning tool for next-gen transcriptome shotgun sequencing.ESTclean:新一代转录组鸟枪法测序的清洗工具。
BMC Bioinformatics. 2012 Sep 26;13:247. doi: 10.1186/1471-2105-13-247.
2
Skewer: a fast and accurate adapter trimmer for next-generation sequencing paired-end reads.Skewer:一种用于新一代测序双端读段的快速且准确的接头修剪工具。
BMC Bioinformatics. 2014 Jun 12;15:182. doi: 10.1186/1471-2105-15-182.
3
ClinQC: a tool for quality control and cleaning of Sanger and NGS data in clinical research.ClinQC:临床研究中用于Sanger测序和二代测序(NGS)数据质量控制与清理的工具
BMC Bioinformatics. 2016 Feb 2;17:56. doi: 10.1186/s12859-016-0915-y.
4
Btrim: a fast, lightweight adapter and quality trimming program for next-generation sequencing technologies.Btrim:一种快速、轻量级的适用于新一代测序技术的接头和质量修剪程序。
Genomics. 2011 Aug;98(2):152-3. doi: 10.1016/j.ygeno.2011.05.009. Epub 2011 May 30.
5
EasyQC: Tool with Interactive User Interface for Efficient Next-Generation Sequencing Data Quality Control.EasyQC:用于高效下一代测序数据质量控制的具有交互式用户界面的工具。
J Comput Biol. 2018 Dec;25(12):1301-1311. doi: 10.1089/cmb.2017.0186. Epub 2018 Sep 8.
6
AlienTrimmer: a tool to quickly and accurately trim off multiple short contaminant sequences from high-throughput sequencing reads.AlienTrimmer:一种快速准确地从高通量测序读取中修剪掉多个短污染序列的工具。
Genomics. 2013 Nov-Dec;102(5-6):500-6. doi: 10.1016/j.ygeno.2013.07.011. Epub 2013 Aug 1.
7
Levenshtein error-correcting barcodes for multiplexed DNA sequencing.莱文斯坦纠错条码在多重 DNA 测序中的应用。
BMC Bioinformatics. 2013 Sep 11;14:272. doi: 10.1186/1471-2105-14-272.
8
cutPrimers: A New Tool for Accurate Cutting of Primers from Reads of Targeted Next Generation Sequencing.cutPrimers:一种从靶向新一代测序读段中准确切割引物的新工具。
J Comput Biol. 2017 Nov;24(11):1138-1143. doi: 10.1089/cmb.2017.0096. Epub 2017 Jul 17.
9
SeqPurge: highly-sensitive adapter trimming for paired-end NGS data.SeqPurge:用于双端NGS数据的高灵敏度接头修剪
BMC Bioinformatics. 2016 May 10;17:208. doi: 10.1186/s12859-016-1069-7.
10
RNA sequencing and quantitation using the Helicos Genetic Analysis System.使用Helicos遗传分析系统进行RNA测序和定量分析。
Methods Mol Biol. 2011;733:37-49. doi: 10.1007/978-1-61779-089-8_3.

引用本文的文献

1
Evolutionary divergence of core and post-translational circadian clock genes in the pitcher-plant mosquito, Wyeomyia smithii.猪笼草蚊(Wyeomyia smithii)核心及翻译后生物钟基因的进化分歧
BMC Genomics. 2015 Oct 6;16:754. doi: 10.1186/s12864-015-1937-y.
2
SeqControl: process control for DNA sequencing.SeqControl:DNA 测序的过程控制。
Nat Methods. 2014 Oct;11(10):1071-5. doi: 10.1038/nmeth.3094. Epub 2014 Aug 31.

本文引用的文献

1
SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read.SeqTrim:一种用于预处理任何类型序列读取的高通量管道。
BMC Bioinformatics. 2010 Jan 20;11:38. doi: 10.1186/1471-2105-11-38.
2
Sequencing and de novo analysis of a coral larval transcriptome using 454 GSFlx.使用454 GSFlx对珊瑚幼虫转录组进行测序和从头分析。
BMC Genomics. 2009 May 12;10:219. doi: 10.1186/1471-2164-10-219.
3
Figaro: a novel statistical method for vector sequence removal.费加罗:一种用于去除向量序列的新型统计方法。
Bioinformatics. 2008 Feb 15;24(4):462-7. doi: 10.1093/bioinformatics/btm632. Epub 2008 Jan 17.
4
Next-generation sequencing transforms today's biology.新一代测序技术改变了当今的生物学。
Nat Methods. 2008 Jan;5(1):16-8. doi: 10.1038/nmeth1156. Epub 2007 Dec 19.
5
GMAP: a genomic mapping and alignment program for mRNA and EST sequences.GMAP:一种用于mRNA和EST序列的基因组图谱绘制与比对程序。
Bioinformatics. 2005 May 1;21(9):1859-75. doi: 10.1093/bioinformatics/bti310. Epub 2005 Feb 22.
6
DNA sequence quality trimming and vector removal.DNA序列质量修剪和载体去除。
Bioinformatics. 2001 Dec;17(12):1093-104. doi: 10.1093/bioinformatics/17.12.1093.
7
A general method applicable to the search for similarities in the amino acid sequence of two proteins.一种适用于寻找两种蛋白质氨基酸序列相似性的通用方法。
J Mol Biol. 1970 Mar;48(3):443-53. doi: 10.1016/0022-2836(70)90057-4.
8
Basic local alignment search tool.基本局部比对搜索工具
J Mol Biol. 1990 Oct 5;215(3):403-10. doi: 10.1016/S0022-2836(05)80360-2.