Suppr超能文献

使用 Galaxy 进行大规模交互式数据分析——更新。

Using Galaxy to Perform Large-Scale Interactive Data Analyses-An Update.

机构信息

Johns Hopkins University, Baltimore, Maryland.

Penn State University, University Park, Pennsylvania.

出版信息

Curr Protoc. 2021 Feb;1(2):e31. doi: 10.1002/cpz1.31.

Abstract

Modern biology continues to become increasingly computational. Datasets are becoming progressively larger, more complex, and more abundant. The computational savviness necessary to analyze these data creates an ongoing obstacle for experimental biologists. Galaxy (galaxyproject.org) provides access to computational biology tools in a web-based interface. It also provides access to major public biological data repositories, allowing private data to be combined with public datasets. Galaxy is hosted on high-capacity servers worldwide and is accessible for free, with an option to be installed locally. This article demonstrates how to employ Galaxy to perform biologically relevant analyses on publicly available datasets. These protocols use both standard and custom tools, serving as a tutorial and jumping-off point for more intensive and/or more specific analyses using Galaxy. © 2021 Wiley Periodicals LLC. Basic Protocol 1: Finding human coding exons with highest SNP density Basic Protocol 2: Calling peaks for ChIP-seq data Basic Protocol 3: Compare datasets using genomic coordinates Basic Protocol 4: Working with multiple alignments Basic Protocol 5: Single cell RNA-seq.

摘要

现代生物学继续变得越来越计算化。数据集变得越来越大、越来越复杂、越来越丰富。分析这些数据所需的计算能力对实验生物学家来说是一个持续的障碍。Galaxy(galaxyproject.org)在基于网络的界面中提供了对计算生物学工具的访问。它还提供了对主要公共生物数据存储库的访问,允许将私人数据与公共数据集相结合。Galaxy 托管在全球具有高容量服务器上,并可免费访问,也可以选择在本地安装。本文演示了如何使用 Galaxy 对公共可用数据集执行与生物学相关的分析。这些方案使用标准和自定义工具,作为使用 Galaxy 进行更密集和/或更具体分析的教程和起点。© 2021 Wiley Periodicals LLC. 基础方案 1:查找 SNP 密度最高的人类编码外显子 基础方案 2:对 ChIP-seq 数据进行峰调用 基础方案 3:使用基因组坐标比较数据集 基础方案 4:使用多序列比对 基础方案 5:单细胞 RNA-seq。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验