• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个使用Globus基因组学系统对NGS数据进行基于云的高通量分析的案例研究。

A case study for cloud based high throughput analysis of NGS data using the globus genomics system.

作者信息

Bhuvaneshwar Krithika, Sulakhe Dinanath, Gauba Robinder, Rodriguez Alex, Madduri Ravi, Dave Utpal, Lacinski Lukasz, Foster Ian, Gusev Yuriy, Madhavan Subha

机构信息

Innovation Center for Biomedical Informatics (ICBI), Georgetown University, Washington, DC 20007, USA.

Computation Institute, University of Chicago, Argonne National Laboratory, 60637, USA; Globus Genomics, USA.

出版信息

Comput Struct Biotechnol J. 2014 Nov 7;13:64-74. doi: 10.1016/j.csbj.2014.11.001. eCollection 2015.

DOI:10.1016/j.csbj.2014.11.001
PMID:26925205
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4720014/
Abstract

Next generation sequencing (NGS) technologies produce massive amounts of data requiring a powerful computational infrastructure, high quality bioinformatics software, and skilled personnel to operate the tools. We present a case study of a practical solution to this data management and analysis challenge that simplifies terabyte scale data handling and provides advanced tools for NGS data analysis. These capabilities are implemented using the "Globus Genomics" system, which is an enhanced Galaxy workflow system made available as a service that offers users the capability to process and transfer data easily, reliably and quickly to address end-to-endNGS analysis requirements. The Globus Genomics system is built on Amazon 's cloud computing infrastructure. The system takes advantage of elastic scaling of compute resources to run multiple workflows in parallel and it also helps meet the scale-out analysis needs of modern translational genomics research.

摘要

下一代测序(NGS)技术会产生海量数据,这需要强大的计算基础设施、高质量的生物信息学软件以及熟练的操作人员来运行这些工具。我们展示了一个针对这一数据管理和分析挑战的实际解决方案的案例研究,该方案简化了TB级数据处理,并为NGS数据分析提供了先进工具。这些功能是通过“Globus基因组学”系统实现的,它是一个增强版的Galaxy工作流系统,作为一项服务提供,使用户能够轻松、可靠且快速地处理和传输数据,以满足端到端的NGS分析需求。Globus基因组学系统构建在亚马逊的云计算基础设施之上。该系统利用计算资源的弹性扩展来并行运行多个工作流,还有助于满足现代转化基因组学研究的横向扩展分析需求。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/beabaf01e3cc/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/c6511215df3b/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/f36ac071561e/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/a7a07b5496e2/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/5c81138591e4/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/6d23a1ab8910/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/914df3d73fcf/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/61e22be9222e/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/beabaf01e3cc/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/c6511215df3b/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/f36ac071561e/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/a7a07b5496e2/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/5c81138591e4/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/6d23a1ab8910/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/914df3d73fcf/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/61e22be9222e/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d63/4720014/beabaf01e3cc/gr7.jpg

相似文献

1
A case study for cloud based high throughput analysis of NGS data using the globus genomics system.一个使用Globus基因组学系统对NGS数据进行基于云的高通量分析的案例研究。
Comput Struct Biotechnol J. 2014 Nov 7;13:64-74. doi: 10.1016/j.csbj.2014.11.001. eCollection 2015.
2
Experiences Building Globus Genomics: A Next-Generation Sequencing Analysis Service using Galaxy, Globus, and Amazon Web Services.构建Globus基因组学的经验:一种使用Galaxy、Globus和亚马逊网络服务的下一代测序分析服务。
Concurr Comput. 2014 Sep 10;26(13):2266-2279. doi: 10.1002/cpe.3274.
3
Closha: bioinformatics workflow system for the analysis of massive sequencing data.Closha:用于大规模测序数据分析的生物信息学工作流系统。
BMC Bioinformatics. 2018 Feb 19;19(Suppl 1):43. doi: 10.1186/s12859-018-2019-3.
4
Genomics Virtual Laboratory: A Practical Bioinformatics Workbench for the Cloud.基因组学虚拟实验室:面向云端的实用生物信息学工作台。
PLoS One. 2015 Oct 26;10(10):e0140829. doi: 10.1371/journal.pone.0140829. eCollection 2015.
5
Tavaxy: integrating Taverna and Galaxy workflows with cloud computing support.Tavaxy:集成 Taverna 和 Galaxy 工作流并提供云计算支持。
BMC Bioinformatics. 2012 May 4;13:77. doi: 10.1186/1471-2105-13-77.
6
Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses.用于大规模下一代测序分析的基于云的生物信息学工作流程平台。
J Biomed Inform. 2014 Jun;49:119-33. doi: 10.1016/j.jbi.2014.01.005. Epub 2014 Jan 22.
7
FDA's Activities Supporting Regulatory Application of "Next Gen" Sequencing Technologies.美国食品药品监督管理局支持“下一代”测序技术监管应用的活动。
PDA J Pharm Sci Technol. 2014 Nov-Dec;68(6):626-30. doi: 10.5731/pdajpst.2014.01024.
8
DDBJ read annotation pipeline: a cloud computing-based pipeline for high-throughput analysis of next-generation sequencing data.DDBJ 读注释流水线:基于云计算的高通量下一代测序数据分析流水线。
DNA Res. 2013 Aug;20(4):383-90. doi: 10.1093/dnares/dst017. Epub 2013 May 8.
9
CloVR: a virtual machine for automated and portable sequence analysis from the desktop using cloud computing.CloVR:一种虚拟机,用于在桌面环境下通过云计算实现自动化和可移植的序列分析。
BMC Bioinformatics. 2011 Aug 30;12:356. doi: 10.1186/1471-2105-12-356.
10
Streaming support for data intensive cloud-based sequence analysis.面向基于云的大数据序列分析的流式支持。
Biomed Res Int. 2013;2013:791051. doi: 10.1155/2013/791051. Epub 2013 Apr 24.

引用本文的文献

1
Big Omics Data Experience.大型组学数据经验。
SC Conf Proc. 2015 Nov;2015. doi: 10.1145/2807591.2807595.
2
viGEN: An Open Source Pipeline for the Detection and Quantification of Viral RNA in Human Tumors.viGEN:用于检测和定量人类肿瘤中病毒RNA的开源流程
Front Microbiol. 2018 Jun 5;9:1172. doi: 10.3389/fmicb.2018.01172. eCollection 2018.
3
GT-WGS: an efficient and economic tool for large-scale WGS analyses based on the AWS cloud service.GT-WGS:一种基于 AWS 云服务的高效、经济的大规模 WGS 分析工具。

本文引用的文献

1
Comparing the Consumption of CPU Hours with Scientific Output for the Extreme Science and Engineering Discovery Environment (XSEDE).比较极端科学与工程发现环境(XSEDE)中CPU小时的消耗与科研产出。
PLoS One. 2016 Jun 16;11(6):e0157628. doi: 10.1371/journal.pone.0157628. eCollection 2016.
2
Experiences Building Globus Genomics: A Next-Generation Sequencing Analysis Service using Galaxy, Globus, and Amazon Web Services.构建Globus基因组学的经验:一种使用Galaxy、Globus和亚马逊网络服务的下一代测序分析服务。
Concurr Comput. 2014 Sep 10;26(13):2266-2279. doi: 10.1002/cpe.3274.
3
Consensus Genotyper for Exome Sequencing (CGES): improving the quality of exome variant genotypes.
BMC Genomics. 2018 Jan 19;19(Suppl 1):959. doi: 10.1186/s12864-017-4334-x.
4
The Lair: a resource for exploratory analysis of published RNA-Seq data.The Lair:一个用于已发表RNA测序数据探索性分析的资源。
BMC Bioinformatics. 2016 Dec 1;17(1):490. doi: 10.1186/s12859-016-1357-2.
5
Needs Assessment for Research Use of High-Throughput Sequencing at a Large Academic Medical Center.大型学术医疗中心高通量测序研究用途的需求评估
PLoS One. 2015 Jun 26;10(6):e0131166. doi: 10.1371/journal.pone.0131166. eCollection 2015.
外显子组测序一致性基因分型器(CGES):提高外显子组变异基因型的质量
Bioinformatics. 2015 Jan 15;31(2):187-93. doi: 10.1093/bioinformatics/btu591. Epub 2014 Sep 29.
4
Analysis of next-generation sequencing data using Galaxy.使用Galaxy分析下一代测序数据。
Methods Mol Biol. 2014;1150:21-43. doi: 10.1007/978-1-4939-0512-6_2.
5
Launching genomics into the cloud: deployment of Mercury, a next generation sequence analysis pipeline.将基因组学推向云端:下一代序列分析流水线 Mercury 的部署。
BMC Bioinformatics. 2014 Jan 29;15:30. doi: 10.1186/1471-2105-15-30.
6
An extensive evaluation of read trimming effects on Illumina NGS data analysis.对读段修剪对Illumina二代测序数据分析的影响进行的广泛评估。
PLoS One. 2013 Dec 23;8(12):e85024. doi: 10.1371/journal.pone.0085024. eCollection 2013.
7
Next-generation sequencing in the clinic.临床中的下一代测序技术。
Nat Biotechnol. 2013 Nov;31(11):990-2. doi: 10.1038/nbt.2743.
8
Genomics in the clouds.云端基因组学
Nat Methods. 2013 Oct;10(10):941-5. doi: 10.1038/nmeth.2654.
9
The next-generation sequencing revolution and its impact on genomics.下一代测序革命及其对基因组学的影响。
Cell. 2013 Sep 26;155(1):27-38. doi: 10.1016/j.cell.2013.09.006.
10
A survey of tools for variant analysis of next-generation genome sequencing data.下一代基因组测序数据变异分析工具综述。
Brief Bioinform. 2014 Mar;15(2):256-78. doi: 10.1093/bib/bbs086. Epub 2013 Jan 21.