一种用于处理由下一代测序产生的大数据集的实用生物信息工作流程系统。

A practical, bioinformatic workflow system for large data sets generated by next-generation sequencing.

作者信息

Cantacessi Cinzia, Jex Aaron R, Hall Ross S, Young Neil D, Campbell Bronwyn E, Joachim Anja, Nolan Matthew J, Abubucker Sahar, Sternberg Paul W, Ranganathan Shoba, Mitreva Makedonka, Gasser Robin B

机构信息

Department of Veterinary Science, The University of Melbourne, 250 Princes Highway, Werribee, Victoria 3030, Australia.

出版信息

Nucleic Acids Res. 2010 Sep;38(17):e171. doi: 10.1093/nar/gkq667. Epub 2010 Aug 3.

DOI:10.1093/nar/gkq667

PMID:20682560

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2943614/

Abstract

Transcriptomics (at the level of single cells, tissues and/or whole organisms) underpins many fields of biomedical science, from understanding the basic cellular function in model organisms, to the elucidation of the biological events that govern the development and progression of human diseases, and the exploration of the mechanisms of survival, drug-resistance and virulence of pathogens. Next-generation sequencing (NGS) technologies are contributing to a massive expansion of transcriptomics in all fields and are reducing the cost, time and performance barriers presented by conventional approaches. However, bioinformatic tools for the analysis of the sequence data sets produced by these technologies can be daunting to researchers with limited or no expertise in bioinformatics. Here, we constructed a semi-automated, bioinformatic workflow system, and critically evaluated it for the analysis and annotation of large-scale sequence data sets generated by NGS. We demonstrated its utility for the exploration of differences in the transcriptomes among various stages and both sexes of an economically important parasitic worm (Oesophagostomum dentatum) as well as the prediction and prioritization of essential molecules (including GTPases, protein kinases and phosphatases) as novel drug target candidates. This workflow system provides a practical tool for the assembly, annotation and analysis of NGS data sets, also to researchers with a limited bioinformatic expertise. The custom-written Perl, Python and Unix shell computer scripts used can be readily modified or adapted to suit many different applications. This system is now utilized routinely for the analysis of data sets from pathogens of major socio-economic importance and can, in principle, be applied to transcriptomics data sets from any organism.

摘要

转录组学（在单细胞、组织和/或整个生物体水平）是生物医学科学许多领域的基础，从了解模式生物中的基本细胞功能，到阐明控制人类疾病发生和发展的生物学事件，以及探索病原体的生存、耐药性和毒力机制。新一代测序（NGS）技术正在推动转录组学在所有领域的大规模扩展，并正在降低传统方法所带来的成本、时间和性能障碍。然而，对于生物信息学专业知识有限或没有专业知识的研究人员来说，分析这些技术产生的序列数据集的生物信息学工具可能令人生畏。在这里，我们构建了一个半自动的生物信息学工作流程系统，并对其进行了严格评估，以用于分析和注释由NGS生成的大规模序列数据集。我们展示了它在探索一种具有经济重要性的寄生蠕虫（齿状食道口线虫）不同阶段和两性之间转录组差异方面的效用，以及在预测和确定作为新型药物靶点候选物的必需分子（包括GTP酶、蛋白激酶和磷酸酶）方面的效用。这个工作流程系统为NGS数据集的组装、注释和分析提供了一个实用工具，也适用于生物信息学专业知识有限的研究人员。所使用的定制编写的Perl、Python和Unix shell计算机脚本可以很容易地修改或调整以适应许多不同的应用。该系统现在经常用于分析具有重大社会经济重要性的病原体的数据集，并且原则上可以应用于来自任何生物体的转录组学数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f7f/2943614/1c1f06ba1025/gkq667f1.jpg

相似文献

A practical, bioinformatic workflow system for large data sets generated by next-generation sequencing.

Nucleic Acids Res. 2010 Sep;38(17):e171. doi: 10.1093/nar/gkq667. Epub 2010 Aug 3.

Bioinformatics meets parasitology.

Parasite Immunol. 2012 May;34(5):265-75. doi: 10.1111/j.1365-3024.2011.01304.x.

Raw transcriptomics data to gene specific SSRs: a validated free bioinformatics workflow for biologists.

Sci Rep. 2020 Oct 26;10(1):18236. doi: 10.1038/s41598-020-75270-8.

Getting the most out of parasitic helminth transcriptomes using HelmDB: implications for biology and biotechnology.

Biotechnol Adv. 2013 Dec;31(8):1109-19. doi: 10.1016/j.biotechadv.2012.12.004. Epub 2012 Dec 21.

CoVaCS: a consensus variant calling system.

BMC Genomics. 2018 Feb 5;19(1):120. doi: 10.1186/s12864-018-4508-1.

Isolation and characterisation of sex-specific transcripts from Oesophagostomum dentatum by RNA arbitrarily-primed PCR.

Mol Biochem Parasitol. 2000 May;108(2):217-24. doi: 10.1016/s0166-6851(00)00217-6.

What can next generation sequencing do for you? Next generation sequencing as a valuable tool in plant research.

Plant Biol (Stuttg). 2010 Nov;12(6):831-41. doi: 10.1111/j.1438-8677.2010.00373.x.

SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis.

BMC Bioinformatics. 2016 Feb 4;17:66. doi: 10.1186/s12859-016-0923-y.

JWES: a new pipeline for whole genome/exome sequence data processing, management, and gene-variant discovery, annotation, prediction, and genotyping.

FEBS Open Bio. 2021 Sep;11(9):2441-2452. doi: 10.1002/2211-5463.13261. Epub 2021 Aug 11.

Computational cloning of drug target genes of a parasitic nematode, Oesophagostomum dentatum.

BMC Genet. 2013 Jun 18;14:55. doi: 10.1186/1471-2156-14-55.

引用本文的文献

Comparison of seven SNP calling pipelines for the next-generation sequencing data of chickens.

PLoS One. 2022 Jan 31;17(1):e0262574. doi: 10.1371/journal.pone.0262574. eCollection 2022.

The Tumor Dynamism Is the Dark Matter of the NGS Galaxy: How to Understand It?

Cancers (Basel). 2021 Oct 30;13(21):5476. doi: 10.3390/cancers13215476.

Interactive online application for the prediction, ranking and prioritisation of drug targets in Schistosoma haematobium.

Parasit Vectors. 2018 Nov 27;11(1):605. doi: 10.1186/s13071-018-3197-6.

An integrated Java tool for generating amino acid sequence alignments with mapped secondary structure elements.

3 Biotech. 2015 Feb;5(1):87-92. doi: 10.1007/s13205-014-0222-0. Epub 2014 May 20.

Reverse Genetics and High Throughput Sequencing Methodologies for Plant Functional Genomics.

Curr Genomics. 2016 Dec;17(6):460-475. doi: 10.2174/1389202917666160520102827.

Transcriptome Analysis of the Chrysanthemum Foliar Nematode, Aphelenchoides ritzemabosi (Aphelenchida: Aphelenchoididae).

PLoS One. 2016 Nov 22;11(11):e0166877. doi: 10.1371/journal.pone.0166877. eCollection 2016.

De Novo Assembly and Transcriptome Analysis of Bulb Onion (Allium cepa L.) during Cold Acclimation Using Contrasting Genotypes.

PLoS One. 2016 Sep 14;11(9):e0161987. doi: 10.1371/journal.pone.0161987. eCollection 2016.

The Anisakis Transcriptome Provides a Resource for Fundamental and Applied Studies on Allergy-Causing Parasites.

PLoS Negl Trop Dis. 2016 Jul 29;10(7):e0004845. doi: 10.1371/journal.pntd.0004845. eCollection 2016 Jul.

The Complete Mitochondrial Genome Sequence of Bactericera cockerelli and Comparison with Three Other Psylloidea Species.

PLoS One. 2016 May 26;11(5):e0155318. doi: 10.1371/journal.pone.0155318. eCollection 2016.

Transcriptomic analyses reveal species-specific light-induced anthocyanin biosynthesis in chrysanthemum.

BMC Genomics. 2015 Mar 17;16(1):202. doi: 10.1186/s12864-015-1428-1.

本文引用的文献

Unlocking the transcriptomes of two carcinogenic parasites, Clonorchis sinensis and Opisthorchis viverrini.

PLoS Negl Trop Dis. 2010 Jun 22;4(6):e719. doi: 10.1371/journal.pntd.0000719.

Massively parallel sequencing and analysis of the Necator americanus transcriptome.

PLoS Negl Trop Dis. 2010 May 11;4(5):e684. doi: 10.1371/journal.pntd.0000684.

Differences in transcription between free-living and CO2-activated third-stage larvae of Haemonchus contortus.

BMC Genomics. 2010 Apr 27;11:266. doi: 10.1186/1471-2164-11-266.

Drug target prediction and prioritization: using orthology to predict essentiality in parasite genomes.

BMC Genomics. 2010 Apr 3;11:222. doi: 10.1186/1471-2164-11-222.

Elucidating the transcriptome of Fasciola hepatica - a key to fundamental and biotechnological discoveries for a neglected parasite.

Biotechnol Adv. 2010 Mar-Apr;28(2):222-31. doi: 10.1016/j.biotechadv.2009.12.003. Epub 2009 Dec 16.

Sequencing technologies - the next generation.

Nat Rev Genet. 2010 Jan;11(1):31-46. doi: 10.1038/nrg2626. Epub 2009 Dec 8.

Stage-specific expression profiling of Drosophila spermatogenesis suggests that meiotic sex chromosome inactivation drives genomic relocation of testis-expressed genes.

PLoS Genet. 2009 Nov;5(11):e1000731. doi: 10.1371/journal.pgen.1000731. Epub 2009 Nov 20.

Sense from sequence reads: methods for alignment and assembly.

Nat Methods. 2009 Nov;6(11 Suppl):S6-S12. doi: 10.1038/nmeth.1376.

Elucidating ANTs in worms using genomic and bioinformatic tools--biotechnological prospects?

Biotechnol Adv. 2010 Jan-Feb;28(1):49-60. doi: 10.1016/j.biotechadv.2009.09.001.

PAVE: program for assembling and viewing ESTs.

BMC Genomics. 2009 Aug 26;10:400. doi: 10.1186/1471-2164-10-400.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于处理由下一代测序产生的大数据集的实用生物信息工作流程系统。

A practical, bioinformatic workflow system for large data sets generated by next-generation sequencing.

作者信息

Cantacessi Cinzia, Jex Aaron R, Hall Ross S, Young Neil D, Campbell Bronwyn E, Joachim Anja, Nolan Matthew J, Abubucker Sahar, Sternberg Paul W, Ranganathan Shoba, Mitreva Makedonka, Gasser Robin B

机构信息

Department of Veterinary Science, The University of Melbourne, 250 Princes Highway, Werribee, Victoria 3030, Australia.

出版信息

Nucleic Acids Res. 2010 Sep;38(17):e171. doi: 10.1093/nar/gkq667. Epub 2010 Aug 3.

DOI:10.1093/nar/gkq667

PMID:20682560

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2943614/

Abstract

摘要

一种用于处理由下一代测序产生的大数据集的实用生物信息工作流程系统。

A practical, bioinformatic workflow system for large data sets generated by next-generation sequencing.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

一种用于处理由下一代测序产生的大数据集的实用生物信息工作流程系统。

A practical, bioinformatic workflow system for large data sets generated by next-generation sequencing.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献