QMachine：网页浏览器中的商用超级计算

QMachine: commodity supercomputing in web browsers.

作者信息

Wilkinson Sean R, Almeida Jonas S

机构信息

Division of Informatics, Department of Pathology, University of Alabama at Birmingham, Birmingham, USA.

出版信息

BMC Bioinformatics. 2014 Jun 9;15:176. doi: 10.1186/1471-2105-15-176.

DOI:10.1186/1471-2105-15-176

PMID:24913605

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4063228/

Abstract

BACKGROUND

Ongoing advancements in cloud computing provide novel opportunities in scientific computing, especially for distributed workflows. Modern web browsers can now be used as high-performance workstations for querying, processing, and visualizing genomics' "Big Data" from sources like The Cancer Genome Atlas (TCGA) and the International Cancer Genome Consortium (ICGC) without local software installation or configuration. The design of QMachine (QM) was driven by the opportunity to use this pervasive computing model in the context of the Web of Linked Data in Biomedicine.

RESULTS

QM is an open-sourced, publicly available web service that acts as a messaging system for posting tasks and retrieving results over HTTP. The illustrative application described here distributes the analyses of 20 Streptococcus pneumoniae genomes for shared suffixes. Because all analytical and data retrieval tasks are executed by volunteer machines, few server resources are required. Any modern web browser can submit those tasks and/or volunteer to execute them without installing any extra plugins or programs. A client library provides high-level distribution templates including MapReduce. This stark departure from the current reliance on expensive server hardware running "download and install" software has already gathered substantial community interest, as QM received more than 2.2 million API calls from 87 countries in 12 months.

CONCLUSIONS

QM was found adequate to deliver the sort of scalable bioinformatics solutions that computation- and data-intensive workflows require. Paradoxically, the sandboxed execution of code by web browsers was also found to enable them, as compute nodes, to address critical privacy concerns that characterize biomedical environments.

摘要

背景

云计算的持续发展为科学计算带来了新机遇，特别是对于分布式工作流程而言。现代网页浏览器如今可被用作高性能工作站，用于查询、处理和可视化来自诸如癌症基因组图谱（TCGA）和国际癌症基因组联盟（ICGC）等来源的基因组“大数据”，而无需在本地安装或配置软件。QMachine（QM）的设计是受在生物医学关联数据网络背景下使用这种普及计算模型的机会所驱动。

结果

QM是一个开源的、可公开获取的网络服务，它充当一个消息系统，用于通过HTTP发布任务和检索结果。此处描述的示例应用程序将20个肺炎链球菌基因组的共享后缀分析进行了分布式处理。由于所有分析和数据检索任务均由志愿机器执行，因此所需的服务器资源很少。任何现代网页浏览器都可以提交这些任务和/或志愿执行它们，而无需安装任何额外的插件或程序。一个客户端库提供了包括MapReduce在内的高级分布式模板。这种与当前依赖运行“下载并安装”软件的昂贵服务器硬件的明显不同，已经引起了社区的广泛关注，因为QM在12个月内收到了来自87个国家的超过220万次API调用。

结论

发现QM足以提供计算和数据密集型工作流程所需的那种可扩展生物信息学解决方案。矛盾的是，还发现网页浏览器对代码的沙盒式执行使它们作为计算节点能够解决生物医学环境中特有的关键隐私问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fa5/4063228/056af01aaa5c/1471-2105-15-176-1.jpg

相似文献

QMachine: commodity supercomputing in web browsers.QMachine：网页浏览器中的商用超级计算

BMC Bioinformatics. 2014 Jun 9;15:176. doi: 10.1186/1471-2105-15-176.

Dalliance: interactive genome viewing on the web.Dalliance：在网络上进行交互式基因组浏览。

Bioinformatics. 2011 Mar 15;27(6):889-90. doi: 10.1093/bioinformatics/btr020. Epub 2011 Jan 19.

ImageJS: Personalized, participated, pervasive, and reproducible image bioinformatics in the web browser.ImageJS：网络浏览器中个性化、参与式、普及式且可重复的图像生物信息学。

J Pathol Inform. 2012;3:25. doi: 10.4103/2153-3539.98813. Epub 2012 Jul 20.

Cloudgene: a graphical execution platform for MapReduce programs on private and public clouds.Cloudgene：一个在私有云和公有云上运行 MapReduce 程序的图形化执行平台。

BMC Bioinformatics. 2012 Aug 13;13:200. doi: 10.1186/1471-2105-13-200.

Developing reproducible bioinformatics analysis workflows for heterogeneous computing environments to support African genomics.为异构计算环境开发可重现的生物信息学分析工作流程，以支持非洲基因组学。

BMC Bioinformatics. 2018 Nov 29;19(1):457. doi: 10.1186/s12859-018-2446-1.

Visual Omics Explorer (VOE): a cross-platform portal for interactive data visualization.视觉组学浏览器（VOE）：一个用于交互式数据可视化的跨平台门户。

Bioinformatics. 2016 Jul 1;32(13):2050-2. doi: 10.1093/bioinformatics/btw119. Epub 2016 Mar 7.

SynMap2 and SynMap3D: web-based whole-genome synteny browsers.SynMap2 和 SynMap3D：基于网络的全基因组同线性浏览器。

Bioinformatics. 2017 Jul 15;33(14):2197-2198. doi: 10.1093/bioinformatics/btx144.

ELIXIR-IT HPC@CINECA: high performance computing resources for the bioinformatics community.ELIXIR-IT HPC@CINECA：生物信息学社区的高性能计算资源。

BMC Bioinformatics. 2020 Aug 21;21(Suppl 10):352. doi: 10.1186/s12859-020-03565-8.

Genome Maps, a new generation genome browser.基因组图谱，新一代基因组浏览器。

Nucleic Acids Res. 2013 Jul;41(Web Server issue):W41-6. doi: 10.1093/nar/gkt530. Epub 2013 Jun 8.

CloudDOE: a user-friendly tool for deploying Hadoop clouds and analyzing high-throughput sequencing data with MapReduce.CloudDOE：一款用于部署Hadoop云并使用MapReduce分析高通量测序数据的用户友好型工具。

PLoS One. 2014 Jun 4;9(6):e98146. doi: 10.1371/journal.pone.0098146. eCollection 2014.

引用本文的文献

Browser-based Data Annotation, Active Learning, and Real-Time Distribution of Artificial Intelligence Models: From Tumor Tissue Microarrays to COVID-19 Radiology.基于浏览器的数据标注、主动学习与人工智能模型的实时分发：从肿瘤组织微阵列到COVID-19放射学

J Pathol Inform. 2021 Sep 27;12:38. doi: 10.4103/jpi.jpi_100_20. eCollection 2021.

Towards an Internet of Science.迈向科学互联网。

J Integr Bioinform. 2019 May 30;16(3):20190024. doi: 10.1515/jib-2019-0024.

Serverless OpenHealth at data commons scale-traversing the 20 million patient records of New York's SPARCS dataset in real-time.数据共享规模下的无服务器开放式医疗——实时遍历纽约SPARCS数据集中的2000万份患者记录。

PeerJ. 2019 Jan 15;7:e6230. doi: 10.7717/peerj.6230. eCollection 2019.

Augmenting Research, Education, and Outreach with Client-Side Web Programming.客户端网页编程在研究、教育和拓展方面的应用。

Trends Biotechnol. 2018 May;36(5):473-476. doi: 10.1016/j.tibtech.2017.11.009. Epub 2017 Dec 15.

Alignment-free sequence comparison: benefits, applications, and tools.无比对信息的序列比对：优势、应用和工具。

Genome Biol. 2017 Oct 3;18(1):186. doi: 10.1186/s13059-017-1319-7.

Cloud-based interactive analytics for terabytes of genomic variants data.基于云的交互式分析，用于处理 TB 级别的基因组变体数据。

Bioinformatics. 2017 Dec 1;33(23):3709-3715. doi: 10.1093/bioinformatics/btx468.

Safe "cloudification" of large images through picker APIs.通过选取器应用程序编程接口对大图像进行安全的“云处理”。

AMIA Annu Symp Proc. 2017 Feb 10;2016:342-351. eCollection 2016.

OpenHealth Platform for Interactive Contextualization of Population Health Open Data.用于人群健康开放数据交互式情境化的开放健康平台。

AMIA Annu Symp Proc. 2015 Nov 5;2015:297-305. eCollection 2015.

Computational Pathology: A Path Ahead.计算病理学：前行之路。

Arch Pathol Lab Med. 2016 Jan;140(1):41-50. doi: 10.5858/arpa.2015-0093-SA. Epub 2015 Jun 22.

本文引用的文献

The EBI RDF platform: linked open data for the life sciences.EBI RDF 平台：生命科学领域的关联开放数据。

Bioinformatics. 2014 May 1;30(9):1338-9. doi: 10.1093/bioinformatics/btt765. Epub 2014 Jan 11.

Genome Maps, a new generation genome browser.基因组图谱，新一代基因组浏览器。

Nucleic Acids Res. 2013 Jul;41(Web Server issue):W41-6. doi: 10.1093/nar/gkt530. Epub 2013 Jun 8.

A self-updating road map of The Cancer Genome Atlas.癌症基因组图谱的自更新路线图。

Bioinformatics. 2013 May 15;29(10):1333-40. doi: 10.1093/bioinformatics/btt141. Epub 2013 Apr 17.

BioJS: an open source JavaScript framework for biological data visualization.BioJS：用于生物数据可视化的开源 JavaScript 框架。

Bioinformatics. 2013 Apr 15;29(8):1103-4. doi: 10.1093/bioinformatics/btt100. Epub 2013 Feb 23.

Survey of MapReduce frame operation in bioinformatics.生物信息学中MapReduce框架操作的调查。

Brief Bioinform. 2014 Jul;15(4):637-47. doi: 10.1093/bib/bbs088. Epub 2013 Feb 7.

J Pathol Inform. 2012;3:25. doi: 10.4103/2153-3539.98813. Epub 2012 Jul 20.

Fractal MapReduce decomposition of sequence alignment.序列比对的分形MapReduce分解

Algorithms Mol Biol. 2012 May 2;7(1):12. doi: 10.1186/1748-7188-7-12.

Visualizing next-generation sequencing data with JBrowse.使用 JBrowse 可视化下一代测序数据。

Brief Bioinform. 2013 Mar;14(2):172-7. doi: 10.1093/bib/bbr078. Epub 2012 Mar 12.

The 2012 Nucleic Acids Research Database Issue and the online Molecular Biology Database Collection.2012 年核酸研究数据库问题及在线分子生物学数据库汇集。

Nucleic Acids Res. 2012 Jan;40(Database issue):D1-8. doi: 10.1093/nar/gkr1196. Epub 2011 Dec 5.

Reproducible research in computational science.计算科学中的可重复性研究。

Science. 2011 Dec 2;334(6060):1226-7. doi: 10.1126/science.1213847.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

QMachine：网页浏览器中的商用超级计算

QMachine: commodity supercomputing in web browsers.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献