• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CANGS:一种用于生物多样性研究中处理和分析454 GS-FLX数据的用户友好型实用工具。

CANGS: a user-friendly utility for processing and analyzing 454 GS-FLX data in biodiversity studies.

作者信息

Pandey Ram Vinay, Nolte Viola, Schlötterer Christian

机构信息

Institut für Populationsgenetik, Veterinärmedizinische Universität Wien, Veterinärplatz 1, Vienna, Austria.

出版信息

BMC Res Notes. 2010 Jan 11;3:3. doi: 10.1186/1756-0500-3-3.

DOI:10.1186/1756-0500-3-3
PMID:20180949
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2830946/
Abstract

BACKGROUND

Next generation sequencing (NGS) technologies have substantially increased the sequence output while the costs were dramatically reduced. In addition to the use in whole genome sequencing, the 454 GS-FLX platform is becoming a widely used tool for biodiversity surveys based on amplicon sequencing. In order to use NGS for biodiversity surveys, software tools are required, which perform quality control, trimming of the sequence reads, removal of PCR primers, and generation of input files for downstream analyses. A user-friendly software utility that carries out these steps is still lacking.

FINDINGS

We developed CANGS (Cleaning and Analyzing Next Generation Sequences) a flexible and user-friendly integrated software utility: CANGS is designed for amplicon based biodiversity surveys using the 454 sequencing platform. CANGS filters low quality sequences, removes PCR primers, filters singletons, identifies barcodes, and generates input files for downstream analyses. The downstream analyses rely either on third party software (e.g.: rarefaction analyses) or CANGS-specific scripts. The latter include modules linking 454 sequences with the name of the closest taxonomic reference retrieved from the NCBI database and the sequence divergence between them. Our software can be easily adapted to handle sequencing projects with different amplicon sizes, primer sequences, and quality thresholds, which makes this software especially useful for non-bioinformaticians.

CONCLUSION

CANGS performs PCR primer clipping, filtering of low quality sequences, links sequences to NCBI taxonomy and provides input files for common rarefaction analysis software programs. CANGS is written in Perl and runs on Mac OS X/Linux and is available at http://i122server.vu-wien.ac.at/pop/software.html.

摘要

背景

新一代测序(NGS)技术在大幅增加测序产出的同时,成本也显著降低。除了用于全基因组测序外,454 GS - FLX平台正成为基于扩增子测序进行生物多样性调查的广泛使用工具。为了将NGS用于生物多样性调查,需要软件工具来进行质量控制、修剪序列读数、去除PCR引物以及生成用于下游分析的输入文件。目前仍缺乏一个执行这些步骤的用户友好型软件实用程序。

研究结果

我们开发了CANGS(下一代序列清理与分析),这是一个灵活且用户友好的集成软件实用程序:CANGS专为使用454测序平台基于扩增子的生物多样性调查而设计。CANGS可过滤低质量序列、去除PCR引物、过滤单例序列、识别条形码,并生成用于下游分析的输入文件。下游分析可依赖第三方软件(如:稀疏化分析)或CANGS特定脚本。后者包括将454序列与从NCBI数据库检索到的最接近分类学参考名称及其之间的序列差异相链接的模块。我们的软件可以轻松适应处理具有不同扩增子大小、引物序列和质量阈值的测序项目,这使得该软件对非生物信息学家特别有用。

结论

CANGS执行PCR引物剪切、低质量序列过滤、将序列与NCBI分类学相链接,并为常见的稀疏化分析软件程序提供输入文件。CANGS用Perl编写,可在Mac OS X/Linux上运行,可从http://i122server.vu - wien.ac.at/pop/software.html获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac4/2830946/367229827a4b/1756-0500-3-3-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac4/2830946/9e69983bed1f/1756-0500-3-3-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac4/2830946/21eae73ca5c8/1756-0500-3-3-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac4/2830946/cdf904d6045c/1756-0500-3-3-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac4/2830946/367229827a4b/1756-0500-3-3-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac4/2830946/9e69983bed1f/1756-0500-3-3-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac4/2830946/21eae73ca5c8/1756-0500-3-3-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac4/2830946/cdf904d6045c/1756-0500-3-3-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ac4/2830946/367229827a4b/1756-0500-3-3-4.jpg

相似文献

1
CANGS: a user-friendly utility for processing and analyzing 454 GS-FLX data in biodiversity studies.CANGS:一种用于生物多样性研究中处理和分析454 GS-FLX数据的用户友好型实用工具。
BMC Res Notes. 2010 Jan 11;3:3. doi: 10.1186/1756-0500-3-3.
2
CANGS DB: a stand-alone web-based database tool for processing, managing and analyzing 454 data in biodiversity studies.CANGS数据库:一个用于生物多样性研究中处理、管理和分析454数据的独立网络数据库工具。
BMC Res Notes. 2011 Jun 30;4:227. doi: 10.1186/1756-0500-4-227.
3
ClinQC: a tool for quality control and cleaning of Sanger and NGS data in clinical research.ClinQC:临床研究中用于Sanger测序和二代测序(NGS)数据质量控制与清理的工具
BMC Bioinformatics. 2016 Feb 2;17:56. doi: 10.1186/s12859-016-0915-y.
4
Software for pre-processing Illumina next-generation sequencing short read sequences.用于预处理Illumina下一代测序短读序列的软件。
Source Code Biol Med. 2014 May 3;9:8. doi: 10.1186/1751-0473-9-8. eCollection 2014.
5
Tourmaline: A containerized workflow for rapid and iterable amplicon sequence analysis using QIIME 2 and Snakemake.电气石:使用 QIIME 2 和 Snakemake 进行快速可迭代扩增子序列分析的集装箱工作流程。
Gigascience. 2022 Jul 28;11. doi: 10.1093/gigascience/giac066.
6
PyroTrimmer: a software with GUI for pre-processing 454 amplicon sequences.PyroTrimmer:一个带有图形用户界面的软件,用于预处理 454 扩增子序列。
J Microbiol. 2012 Oct;50(5):766-9. doi: 10.1007/s12275-012-2494-6. Epub 2012 Nov 4.
7
CLOTU: an online pipeline for processing and clustering of 454 amplicon reads into OTUs followed by taxonomic annotation.CLOTU:一个在线工作流程,用于对 454 扩增子读取进行处理和聚类,形成 OTUs,然后进行分类注释。
BMC Bioinformatics. 2011 May 20;12:182. doi: 10.1186/1471-2105-12-182.
8
OrthoSelect: a protocol for selecting orthologous groups in phylogenomics.OrthoSelect:一种在系统发育基因组学中选择直系同源组的方案。
BMC Bioinformatics. 2009 Jul 16;10:219. doi: 10.1186/1471-2105-10-219.
9
StreamingTrim 1.0: a Java software for dynamic trimming of 16S rRNA sequence data from metagenetic studies.StreamingTrim 1.0:一个用于从宏基因组研究中动态修剪 16S rRNA 序列数据的 Java 软件。
Mol Ecol Resour. 2014 Mar;14(2):426-34. doi: 10.1111/1755-0998.12187. Epub 2013 Nov 16.
10
EasyQC: Tool with Interactive User Interface for Efficient Next-Generation Sequencing Data Quality Control.EasyQC:用于高效下一代测序数据质量控制的具有交互式用户界面的工具。
J Comput Biol. 2018 Dec;25(12):1301-1311. doi: 10.1089/cmb.2017.0186. Epub 2018 Sep 8.

引用本文的文献

1
Structural venomics reveals evolution of a complex venom by duplication and diversification of an ancient peptide-encoding gene.结构 venomomics 通过古老的肽编码基因的复制和多样化揭示了复杂毒液的进化。
Proc Natl Acad Sci U S A. 2020 May 26;117(21):11399-11408. doi: 10.1073/pnas.1914536117. Epub 2020 May 12.
2
Exploring the Human Microbiome: The Potential Future Role of Next-Generation Sequencing in Disease Diagnosis and Treatment.探索人类微生物组:下一代测序在疾病诊断和治疗中的潜在未来作用。
Front Immunol. 2019 Jan 7;9:2868. doi: 10.3389/fimmu.2018.02868. eCollection 2018.
3
Gomesin inhibits melanoma growth by manipulating key signaling cascades that control cell death and proliferation.

本文引用的文献

1
Microbial population structures in the deep marine biosphere.深海生物圈中的微生物种群结构。
Science. 2007 Oct 5;318(5847):97-100. doi: 10.1126/science.1146689.
2
Sensitive mutation detection in heterogeneous cancer specimens by massively parallel picoliter reactor sequencing.通过大规模平行皮升反应器测序检测异质性癌症标本中的敏感突变
Nat Med. 2006 Jul;12(7):852-5. doi: 10.1038/nm1437. Epub 2006 Jun 25.
3
FastGroupII: a web-based bioinformatics platform for analyses of large 16S rDNA libraries.FastGroupII:一个用于分析大型16S rDNA文库的基于网络的生物信息学平台。
高美辛通过操纵控制细胞死亡和增殖的关键信号级联来抑制黑色素瘤的生长。
Sci Rep. 2018 Aug 1;8(1):11519. doi: 10.1038/s41598-018-29826-4.
4
Synchrony of Eukaryotic and Prokaryotic Planktonic Communities in Three Seasonally Sampled Austrian Lakes.奥地利三个季节性采样湖泊中真核生物和原核生物浮游生物群落的同步性
Front Microbiol. 2018 Jun 15;9:1290. doi: 10.3389/fmicb.2018.01290. eCollection 2018.
5
Protistan community analysis: key findings of a large-scale molecular sampling.原生生物群落分析:大规模分子采样的主要发现
ISME J. 2016 Sep;10(9):2269-79. doi: 10.1038/ismej.2016.10. Epub 2016 Feb 9.
6
ClinQC: a tool for quality control and cleaning of Sanger and NGS data in clinical research.ClinQC:临床研究中用于Sanger测序和二代测序(NGS)数据质量控制与清理的工具
BMC Bioinformatics. 2016 Feb 2;17:56. doi: 10.1186/s12859-016-0915-y.
7
Toward accurate molecular identification of species in complex environmental samples: testing the performance of sequence filtering and clustering methods.迈向复杂环境样本中物种的准确分子鉴定:测试序列过滤和聚类方法的性能
Ecol Evol. 2015 Jun;5(11):2252-66. doi: 10.1002/ece3.1497. Epub 2015 May 13.
8
Metagenomics: tools and insights for analyzing next-generation sequencing data derived from biodiversity studies.宏基因组学:用于分析源自生物多样性研究的下一代测序数据的工具与见解。
Bioinform Biol Insights. 2015 May 5;9:75-88. doi: 10.4137/BBI.S12462. eCollection 2015.
9
ITScan: a web-based analysis tool for Internal Transcribed Spacer (ITS) sequences.ITScan:一种用于内部转录间隔区(ITS)序列的基于网络的分析工具。
BMC Res Notes. 2014 Nov 27;7:857. doi: 10.1186/1756-0500-7-857.
10
An Illumina metabarcoding pipeline for fungi.Illumina 真菌宏条形码分析工作流程
Ecol Evol. 2014 Jul;4(13):2642-53. doi: 10.1002/ece3.1107. Epub 2014 Jun 2.
BMC Bioinformatics. 2006 Feb 7;7:57. doi: 10.1186/1471-2105-7-57.
4
Introducing DOTUR, a computer program for defining operational taxonomic units and estimating species richness.介绍DOTUR,一个用于定义操作分类单元和估计物种丰富度的计算机程序。
Appl Environ Microbiol. 2005 Mar;71(3):1501-6. doi: 10.1128/AEM.71.3.1501-1506.2005.
5
MAFFT version 5: improvement in accuracy of multiple sequence alignment.MAFFT 5 版本:多重序列比对准确性的提升。
Nucleic Acids Res. 2005 Jan 20;33(2):511-8. doi: 10.1093/nar/gki198. Print 2005.
6
The Bioperl toolkit: Perl modules for the life sciences.生物Perl工具包:用于生命科学的Perl模块。
Genome Res. 2002 Oct;12(10):1611-8. doi: 10.1101/gr.361602.
7
Basic local alignment search tool.基本局部比对搜索工具
J Mol Biol. 1990 Oct 5;215(3):403-10. doi: 10.1016/S0022-2836(05)80360-2.