• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CloudATAC:一个基于云的 ATAC-Seq 数据分析框架。

CloudATAC: a cloud-based framework for ATAC-Seq data analysis.

机构信息

University of Nebraska Medical Center, Omaha, NE 68105 USA.

Deloitte Consulting LLP, Health Data and AI Arlington, VA, USA.

出版信息

Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae090.

DOI:10.1093/bib/bbae090
PMID:39041910
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11264300/
Abstract

Assay for transposase-accessible chromatin with high-throughput sequencing (ATAC-seq) generates genome-wide chromatin accessibility profiles, providing valuable insights into epigenetic gene regulation at both pooled-cell and single-cell population levels. Comprehensive analysis of ATAC-seq data involves the use of various interdependent programs. Learning the correct sequence of steps needed to process the data can represent a major hurdle. Selecting appropriate parameters at each stage, including pre-analysis, core analysis, and advanced downstream analysis, is important to ensure accurate analysis and interpretation of ATAC-seq data. Additionally, obtaining and working within a limited computational environment presents a significant challenge to non-bioinformatic researchers. Therefore, we present Cloud ATAC, an open-source, cloud-based interactive framework with a scalable, flexible, and streamlined analysis framework based on the best practices approach for pooled-cell and single-cell ATAC-seq data. These frameworks use on-demand computational power and memory, scalability, and a secure and compliant environment provided by the Google Cloud. Additionally, we leverage Jupyter Notebook's interactive computing platform that combines live code, tutorials, narrative text, flashcards, quizzes, and custom visualizations to enhance learning and analysis. Further, leveraging GPU instances has significantly improved the run-time of the single-cell framework. The source codes and data are publicly available through NIH Cloud lab https://github.com/NIGMS/ATAC-Seq-and-Single-Cell-ATAC-Seq-Analysis. This manuscript describes the development of a resource module that is part of a learning platform named ``NIGMS Sandbox for Cloud-based Learning'' https://github.com/NIGMS/NIGMS-Sandbox. The overall genesis of the Sandbox is described in the editorial NIGMS Sandbox [1] at the beginning of this Supplement. This module delivers learning materials on the analysis of bulk and single-cell ATAC-seq data in an interactive format that uses appropriate cloud resources for data access and analyses.

摘要

利用高通量测序进行转座酶可及染色质分析(ATAC-seq)可生成全基因组染色质可及性图谱,为 pooled-cell 和单细胞群体水平的表观基因调控提供有价值的见解。全面分析 ATAC-seq 数据涉及使用各种相互依赖的程序。学习处理数据所需的正确步骤序列可能是一个主要障碍。在每个阶段选择适当的参数,包括预分析、核心分析和高级下游分析,对于确保 ATAC-seq 数据的准确分析和解释非常重要。此外,对于非生物信息学研究人员来说,在有限的计算环境中获取和工作也是一个重大挑战。因此,我们提出了 Cloud ATAC,这是一个开源的、基于云的交互式框架,具有基于最佳实践方法的可扩展、灵活和简化的分析框架,适用于 pooled-cell 和单细胞 ATAC-seq 数据。这些框架利用按需计算能力和内存、可扩展性以及由 Google Cloud 提供的安全合规环境。此外,我们利用了 Jupyter Notebook 的交互式计算平台,该平台结合了实时代码、教程、叙述文本、抽认卡、测验和自定义可视化,以增强学习和分析。此外,利用 GPU 实例显著提高了单细胞框架的运行时间。源代码和数据可通过 NIH Cloud lab https://github.com/NIGMS/ATAC-Seq-and-Single-Cell-ATAC-Seq-Analysis 公开获取。本文档描述了开发一个资源模块的过程,该模块是名为“基于云的学习的 NIGMS 沙盒”的学习平台的一部分,网址为 https://github.com/NIGMS/NIGMS-Sandbox。沙盒的总体起源在本增刊开头的社论 NIGMS Sandbox [1] 中进行了描述。该模块以交互式格式提供有关 bulk 和单细胞 ATAC-seq 数据分析的学习材料,该格式使用适当的云资源进行数据访问和分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95ac/11264300/1a718a2b44d1/bbae090f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95ac/11264300/a517ee2b77e5/bbae090f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95ac/11264300/10e40cecaea0/bbae090f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95ac/11264300/e79fe1fd7b92/bbae090f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95ac/11264300/34d21ee546a9/bbae090f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95ac/11264300/471cb87ff6a9/bbae090f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95ac/11264300/1a718a2b44d1/bbae090f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95ac/11264300/a517ee2b77e5/bbae090f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95ac/11264300/10e40cecaea0/bbae090f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95ac/11264300/e79fe1fd7b92/bbae090f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95ac/11264300/34d21ee546a9/bbae090f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95ac/11264300/471cb87ff6a9/bbae090f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95ac/11264300/1a718a2b44d1/bbae090f6.jpg

相似文献

1
CloudATAC: a cloud-based framework for ATAC-Seq data analysis.CloudATAC:一个基于云的 ATAC-Seq 数据分析框架。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae090.
2
Cloud-based introduction to BASH programming for biologists.基于云的生物学 BASH 编程入门。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae244.
3
Whole-genome bisulfite sequencing data analysis learning module on Google Cloud Platform.基于谷歌云平台的全基因组亚硫酸氢盐测序数据分析学习模块。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae236.
4
CCPA: cloud-based, self-learning modules for consensus pathway analysis using GO, KEGG and Reactome.CCPA:基于云的、使用 GO、KEGG 和 Reactome 的自学习共识途径分析模块。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae222.
5
Understanding proteome quantification in an interactive learning module on Google Cloud Platform.理解蛋白质组定量在 Google Cloud Platform 上的交互式学习模块中的应用。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae235.
6
A cloud-based learning module for biomarker discovery.基于云的生物标志物发现学习模块。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae126.
7
Transcriptomics and epigenetic data integration learning module on Google Cloud.转录组学和表观遗传学数据集成学习模块在谷歌云上。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae352.
8
Identifying and training deep learning neural networks on biomedical-related datasets.在生物医学相关数据集上识别和训练深度学习神经网络。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae232.
9
Biofilm marker discovery with cloud-based dockerized metagenomics analysis of microbial communities.基于云的微生物群落宏基因组分析的生物膜标志物发现。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae429.
10
Reusable tutorials for using cloud-based computing environments for the analysis of bacterial gene expression data from bulk RNA sequencing.可重复使用的教程,用于使用基于云的计算环境分析来自批量 RNA 测序的细菌基因表达数据。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae301.

引用本文的文献

1
NIGMS Sandbox: a learning platform toward democratizing cloud computing for biomedical research.NIGMS 沙盒:一个学习平台,旨在为生物医学研究普及云计算。
Brief Bioinform. 2024 Jul 23;25(Supplement_1). doi: 10.1093/bib/bbae478.

本文引用的文献

1
Accelerating genomic workflows using NVIDIA Parabricks.利用 NVIDIA Parabricks 加速基因组工作流程。
BMC Bioinformatics. 2023 May 31;24(1):221. doi: 10.1186/s12859-023-05292-2.
2
Subtle cell states resolved in single-cell data.单细胞数据中解析出的细微细胞状态。
Nat Biotechnol. 2023 Dec;41(12):1690-1691. doi: 10.1038/s41587-023-01797-6.
3
scATACpipe: A nextflow pipeline for comprehensive and reproducible analyses of single cell ATAC-seq data.scATACpipe:用于单细胞ATAC测序数据全面且可重复分析的Nextflow工作流程。
Front Cell Dev Biol. 2022 Sep 27;10:981859. doi: 10.3389/fcell.2022.981859. eCollection 2022.
4
Joint analysis of scATAC-seq datasets using epiConv.使用 epiConv 对 scATAC-seq 数据集进行联合分析。
BMC Bioinformatics. 2022 Jul 29;23(1):309. doi: 10.1186/s12859-022-04858-w.
5
Cloud Computing Enabled Big Multi-Omics Data Analytics.基于云计算的大型多组学数据分析
Bioinform Biol Insights. 2021 Jul 28;15:11779322211035921. doi: 10.1177/11779322211035921. eCollection 2021.
6
Comprehensive analysis of single cell ATAC-seq data with SnapATAC.利用 SnapATAC 对单细胞 ATAC-seq 数据进行全面分析。
Nat Commun. 2021 Feb 26;12(1):1337. doi: 10.1038/s41467-021-21583-9.
7
ATAC-seq footprinting unravels kinetics of transcription factor binding during zygotic genome activation.转座酶可及性染色质测序足迹分析揭示了合子基因组激活过程中转录因子结合的动力学。
Nat Commun. 2020 Aug 26;11(1):4267. doi: 10.1038/s41467-020-18035-1.
8
Single-cell ATAC-seq signal extraction and enhancement with SCATE.利用 SCATE 进行单细胞 ATAC-seq 信号提取和增强。
Genome Biol. 2020 Jul 3;21(1):161. doi: 10.1186/s13059-020-02075-3.
9
Droplet-based combinatorial indexing for massive-scale single-cell chromatin accessibility.基于液滴的组合索引技术用于大规模单细胞染色质可及性分析。
Nat Biotechnol. 2019 Aug;37(8):916-924. doi: 10.1038/s41587-019-0147-6. Epub 2019 Jun 24.
10
Bioconda: sustainable and comprehensive software distribution for the life sciences.生物conda:面向生命科学的可持续且全面的软件发行平台。
Nat Methods. 2018 Jul;15(7):475-476. doi: 10.1038/s41592-018-0046-7.