Suppr超能文献

在微阵列分析工作流程中使用开普勒进行工具集成。

Using Kepler for Tool Integration in Microarray Analysis Workflows.

作者信息

Gan Zhuohui, Stowe Jennifer C, Altintas Ilkay, McCulloch Andrew D, Zambon Alexander C

机构信息

Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA.

San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA, USA.

出版信息

Procedia Comput Sci. 2014;29:2162-2167. doi: 10.1016/j.procs.2014.05.201.

Abstract

Increasing numbers of genomic technologies are leading to massive amounts of genomic data, all of which requires complex analysis. More and more bioinformatics analysis tools are being developed by scientist to simplify these analyses. However, different pipelines have been developed using different software environments. This makes integrations of these diverse bioinformatics tools difficult. Kepler provides an open source environment to integrate these disparate packages. Using Kepler, we integrated several external tools including Bioconductor packages, AltAnalyze, a python-based open source tool, and R-based comparison tool to build an automated workflow to meta-analyze both online and local microarray data. The automated workflow connects the integrated tools seamlessly, delivers data flow between the tools smoothly, and hence improves efficiency and accuracy of complex data analyses. Our workflow exemplifies the usage of Kepler as a scientific workflow platform for bioinformatics pipelines.

摘要

越来越多的基因组技术正在产生海量的基因组数据,所有这些数据都需要进行复杂的分析。科学家们开发了越来越多的生物信息学分析工具来简化这些分析。然而,不同的流程是使用不同的软件环境开发的。这使得整合这些不同的生物信息学工具变得困难。开普勒提供了一个开源环境来整合这些不同的软件包。我们使用开普勒整合了几个外部工具,包括生物导体软件包、AltAnalyze(一个基于Python的开源工具)和基于R的比较工具,以构建一个自动化工作流程,对在线和本地微阵列数据进行元分析。这个自动化工作流程无缝连接了整合的工具,在工具之间顺畅地传递数据流,从而提高了复杂数据分析的效率和准确性。我们的工作流程例证了开普勒作为生物信息学流程的科学工作流程平台的用法。

相似文献

1
Using Kepler for Tool Integration in Microarray Analysis Workflows.
Procedia Comput Sci. 2014;29:2162-2167. doi: 10.1016/j.procs.2014.05.201.
2
Workflows for microarray data processing in the Kepler environment.
BMC Bioinformatics. 2012 May 17;13:102. doi: 10.1186/1471-2105-13-102.
4
MAAMD: a workflow to standardize meta-analyses and comparison of affymetrix microarray data.
BMC Bioinformatics. 2014 Mar 12;15:69. doi: 10.1186/1471-2105-15-69.
5
Conveyor: a workflow engine for bioinformatic analyses.
Bioinformatics. 2011 Apr 1;27(7):903-11. doi: 10.1093/bioinformatics/btr040. Epub 2011 Jan 28.
7
Bridging experiment and theory: a template for unifying NMR data and electronic structure calculations.
J Cheminform. 2016 Feb 9;8:8. doi: 10.1186/s13321-016-0120-z. eCollection 2016.
8
systemPipeR: NGS workflow and report generation environment.
BMC Bioinformatics. 2016 Sep 20;17:388. doi: 10.1186/s12859-016-1241-0.
9
JMS: An Open Source Workflow Management System and Web-Based Cluster Front-End for High Performance Computing.
PLoS One. 2015 Aug 17;10(8):e0134273. doi: 10.1371/journal.pone.0134273. eCollection 2015.

本文引用的文献

1
MAAMD: a workflow to standardize meta-analyses and comparison of affymetrix microarray data.
BMC Bioinformatics. 2014 Mar 12;15:69. doi: 10.1186/1471-2105-15-69.
2
GO-Elite: a flexible solution for pathway and ontology over-representation.
Bioinformatics. 2012 Aug 15;28(16):2209-10. doi: 10.1093/bioinformatics/bts366. Epub 2012 Jun 27.
3
Workflows for microarray data processing in the Kepler environment.
BMC Bioinformatics. 2012 May 17;13:102. doi: 10.1186/1471-2105-13-102.
4
Applications of the pipeline environment for visual informatics and genomics computations.
BMC Bioinformatics. 2011 Jul 26;12:304. doi: 10.1186/1471-2105-12-304.
5
Tools for managing and analyzing microarray data.
Brief Bioinform. 2012 Jan;13(1):46-60. doi: 10.1093/bib/bbr010. Epub 2011 Mar 21.
6
Chronic hypoxia impairs muscle function in the Drosophila model of Duchenne's muscular dystrophy (DMD).
PLoS One. 2010 Oct 20;5(10):e13450. doi: 10.1371/journal.pone.0013450.
7
Experimental selection for Drosophila survival in extremely high O2 environments.
PLoS One. 2010 Jul 23;5(7):e11701. doi: 10.1371/journal.pone.0011701.
8
AltAnalyze and DomainGraph: analyzing and visualizing exon expression data.
Nucleic Acids Res. 2010 Jul;38(Web Server issue):W755-62. doi: 10.1093/nar/gkq405. Epub 2010 May 31.
9
Distinct mechanisms underlying tolerance to intermittent and constant hypoxia in Drosophila melanogaster.
PLoS One. 2009;4(4):e5371. doi: 10.1371/journal.pone.0005371. Epub 2009 Apr 29.
10

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验