Institute for Genome Sciences (IGS), University of Maryland School of Medicine, Baltimore, Maryland, USA.
BMC Bioinformatics. 2011 Aug 30;12:356. doi: 10.1186/1471-2105-12-356.
BACKGROUND: Next-generation sequencing technologies have decentralized sequence acquisition, increasing the demand for new bioinformatics tools that are easy to use, portable across multiple platforms, and scalable for high-throughput applications. Cloud computing platforms provide on-demand access to computing infrastructure over the Internet and can be used in combination with custom built virtual machines to distribute pre-packaged with pre-configured software. RESULTS: We describe the Cloud Virtual Resource, CloVR, a new desktop application for push-button automated sequence analysis that can utilize cloud computing resources. CloVR is implemented as a single portable virtual machine (VM) that provides several automated analysis pipelines for microbial genomics, including 16S, whole genome and metagenome sequence analysis. The CloVR VM runs on a personal computer, utilizes local computer resources and requires minimal installation, addressing key challenges in deploying bioinformatics workflows. In addition CloVR supports use of remote cloud computing resources to improve performance for large-scale sequence processing. In a case study, we demonstrate the use of CloVR to automatically process next-generation sequencing data on multiple cloud computing platforms. CONCLUSION: The CloVR VM and associated architecture lowers the barrier of entry for utilizing complex analysis protocols on both local single- and multi-core computers and cloud systems for high throughput data processing.
背景:下一代测序技术使序列采集去中心化,这增加了对新生物信息学工具的需求,这些工具易于使用、可跨多个平台移植、并且可扩展用于高通量应用。云计算平台通过互联网提供对计算基础设施的按需访问,并且可以与自定义构建的虚拟机结合使用,以分发预先打包和预配置软件。
结果:我们描述了 Cloud Virtual Resource,即 CloVR,这是一种用于一键式自动化序列分析的新桌面应用程序,它可以利用云计算资源。CloVR 实现为一个单一的便携式虚拟机 (VM),为微生物基因组学提供了多个自动化分析管道,包括 16S、全基因组和宏基因组序列分析。CloVR VM 在个人计算机上运行,利用本地计算机资源,并且需要最小的安装,解决了在部署生物信息学工作流程方面的关键挑战。此外,CloVR 支持使用远程云计算资源来提高大规模序列处理的性能。在一个案例研究中,我们展示了如何使用 CloVR 在多个云计算平台上自动处理下一代测序数据。
结论:CloVR VM 及其相关架构降低了在本地单核心和多核心计算机以及云系统上利用复杂分析协议进行高通量数据处理的门槛。
BMC Bioinformatics. 2011-8-30
BMC Bioinformatics. 2012-3-19
Methods Mol Biol. 2012
Curr Protoc Bioinformatics. 2013-10-15
PLoS One. 2015-10-26
Bioinform Biol Insights. 2021-7-28
BMC Bioinformatics. 2019-11-8
PLoS Negl Trop Dis. 2019-6-6
Front Microbiol. 2019-5-15
Science. 2011-2-11
Nat Rev Genet. 2011-3
BMC Bioinformatics. 2010-12-21
Nat Biotechnol. 2010-11
Genome Biol. 2010-8-25
Nat Rev Genet. 2010-9
Bioinformatics. 2010-8-12
Genome Biol. 2010-8-11
Nat Biotechnol. 2010-7