• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RACS:基于连续基因组的 ChIP-Seq 数据的快速分析。

RACS: rapid analysis of ChIP-Seq data for contig based genomes.

机构信息

Department of Chemistry and Biology, Ryerson University, 350 Victoria St, Toronto, M5B 2K3, Canada.

SciNet High Performance Computing Consortium, University of Toronto, 661 University Ave, Toronto, M5G 1M1, Canada.

出版信息

BMC Bioinformatics. 2019 Oct 29;20(1):533. doi: 10.1186/s12859-019-3100-2.

DOI:10.1186/s12859-019-3100-2
PMID:31664892
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6819487/
Abstract

BACKGROUND

Chromatin immunoprecipitation coupled to next generation sequencing (ChIP-Seq) is a widely-used molecular method to investigate the function of chromatin-related proteins by identifying their associated DNA sequences on a genomic scale. ChIP-Seq generates large quantities of data that is difficult to process and analyze, particularly for organisms with a contig-based sequenced genomes that typically have minimal annotation on their associated set of genes other than their associated coordinates primarily predicted by gene finding programs. Poorly annotated genome sequence makes comprehensive analysis of ChIP-Seq data difficult and as such standardized analysis pipelines are lacking.

RESULTS

We present a one-stop computational pipeline, "Rapid Analysis of ChIP-Seq data" (RACS), that utilizes traditional High-Performance Computing (HPC) techniques in association with open source tools for processing and analyzing raw ChIP-Seq data. RACS is an open source computational pipeline available from any of the following repositories https://bitbucket.org/mjponce/RACS or https://gitrepos.scinet.utoronto.ca/public/?a=summary&p=RACS . RACS is particularly useful for ChIP-Seq in organisms with contig-based genomes that have poor gene annotation to aid protein function discovery.To test the performance and efficiency of RACS, we analyzed ChIP-Seq data previously published in a model organism Tetrahymena thermophila which has a contig-based genome. We assessed the generality of RACS by analyzing a previously published data set generated using the model organism Oxytricha trifallax, whose genome sequence is also contig-based with poor annotation.

CONCLUSIONS

The RACS computational pipeline presented in this report is an efficient and reliable tool to analyze genome-wide raw ChIP-Seq data generated in model organisms with poorly annotated contig-based genome sequence. Because RACS segregates the found read accumulations between genic and intergenic regions, it is particularly efficient for rapid downstream analyses of proteins involved in gene expression.

摘要

背景

染色质免疫沉淀结合下一代测序(ChIP-Seq)是一种广泛使用的分子方法,通过在基因组范围内鉴定染色质相关蛋白的相关 DNA 序列来研究染色质相关蛋白的功能。ChIP-Seq 会生成大量的数据,这些数据很难处理和分析,尤其是对于基于连续体的测序基因组的生物体,这些生物体除了主要由基因发现程序预测的相关坐标之外,很少对其相关基因集进行注释。较差的基因组序列注释使得对 ChIP-Seq 数据的全面分析变得困难,因此缺乏标准化的分析流程。

结果

我们提出了一个一站式计算流程“Rapid Analysis of ChIP-Seq data”(RACS),该流程利用传统的高性能计算(HPC)技术,结合开源工具来处理和分析原始 ChIP-Seq 数据。RACS 是一个开源计算流程,可从以下任何一个存储库中获得:https://bitbucket.org/mjponce/RACS 或 https://gitrepos.scinet.utoronto.ca/public/?a=summary&p=RACS。RACS 特别适用于基于连续体的基因组的生物体的 ChIP-Seq,这些生物体的基因注释较差,有助于蛋白质功能的发现。为了测试 RACS 的性能和效率,我们分析了之前在 Tetrahymena thermophila 模型生物中发表的 ChIP-Seq 数据,该生物具有基于连续体的基因组。我们通过分析之前使用基于连续体的基因组和较差注释的模型生物 Oxytricha trifallax 生成的已发表数据集来评估 RACS 的通用性。

结论

本报告中提出的 RACS 计算流程是一种高效可靠的工具,可用于分析基于连续体的基因组序列注释较差的模型生物中产生的全基因组原始 ChIP-Seq 数据。由于 RACS 将发现的读取积累在基因和非基因区域之间进行划分,因此它特别适合快速进行涉及基因表达的蛋白质的下游分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/313c/6819487/172180141c70/12859_2019_3100_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/313c/6819487/c1a740f5db02/12859_2019_3100_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/313c/6819487/d7fd045058de/12859_2019_3100_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/313c/6819487/4bb21477609a/12859_2019_3100_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/313c/6819487/e5551c4a1c10/12859_2019_3100_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/313c/6819487/980a60730c71/12859_2019_3100_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/313c/6819487/172180141c70/12859_2019_3100_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/313c/6819487/c1a740f5db02/12859_2019_3100_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/313c/6819487/d7fd045058de/12859_2019_3100_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/313c/6819487/4bb21477609a/12859_2019_3100_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/313c/6819487/e5551c4a1c10/12859_2019_3100_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/313c/6819487/980a60730c71/12859_2019_3100_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/313c/6819487/172180141c70/12859_2019_3100_Fig6_HTML.jpg

相似文献

1
RACS: rapid analysis of ChIP-Seq data for contig based genomes.RACS:基于连续基因组的 ChIP-Seq 数据的快速分析。
BMC Bioinformatics. 2019 Oct 29;20(1):533. doi: 10.1186/s12859-019-3100-2.
2
HiChIP: a high-throughput pipeline for integrative analysis of ChIP-Seq data.HiChIP:一种用于 ChIP-Seq 数据综合分析的高通量管道。
BMC Bioinformatics. 2014 Aug 15;15(1):280. doi: 10.1186/1471-2105-15-280.
3
CATCH-UP: A High-Throughput Upstream-Pipeline for Bulk ATAC-Seq and ChIP-Seq Data.CATCH-UP:一种用于批量 ATAC-Seq 和 ChIP-Seq 数据的高通量上游管道。
J Vis Exp. 2023 Sep 22(199). doi: 10.3791/65633.
4
ChIP-Atlas 3.0: a data-mining suite to explore chromosome architecture together with large-scale regulome data.ChIP-Atlas 3.0:一个数据挖掘套件,用于探索染色体结构以及大规模调控组数据。
Nucleic Acids Res. 2024 Jul 5;52(W1):W45-W53. doi: 10.1093/nar/gkae358.
5
CIPHER: a flexible and extensive workflow platform for integrative next-generation sequencing data analysis and genomic regulatory element prediction.CIPHER:一个用于整合下一代测序数据分析和基因组调控元件预测的灵活且功能广泛的工作流程平台。
BMC Bioinformatics. 2017 Aug 8;18(1):363. doi: 10.1186/s12859-017-1770-1.
6
CoBRA: Containerized Bioinformatics Workflow for Reproducible ChIP/ATAC-seq Analysis.CoBRA:用于可重复 ChIP/ATAC-seq 分析的集装箱化生物信息学工作流程。
Genomics Proteomics Bioinformatics. 2021 Aug;19(4):652-661. doi: 10.1016/j.gpb.2020.11.007. Epub 2021 Jul 18.
7
SEAseq: a portable and cloud-based chromatin occupancy analysis suite.SEAseq:一款可移植的基于云的染色质占有率分析套件。
BMC Bioinformatics. 2022 Feb 23;23(1):77. doi: 10.1186/s12859-022-04588-z.
8
BAMscale: quantification of next-generation sequencing peaks and generation of scaled coverage tracks.BAMscale:下一代测序峰的定量分析和缩放覆盖轨道的生成。
Epigenetics Chromatin. 2020 Apr 22;13(1):21. doi: 10.1186/s13072-020-00343-x.
9
ChiLin: a comprehensive ChIP-seq and DNase-seq quality control and analysis pipeline.麒麟:一个全面的染色质免疫沉淀测序(ChIP-seq)和DNA酶超敏感位点测序(DNase-seq)质量控制与分析流程。
BMC Bioinformatics. 2016 Oct 3;17(1):404. doi: 10.1186/s12859-016-1274-4.
10
Efficient yeast ChIP-Seq using multiplex short-read DNA sequencing.使用多重短读长DNA测序进行高效酵母染色质免疫沉淀测序(ChIP-Seq)
BMC Genomics. 2009 Jan 21;10:37. doi: 10.1186/1471-2164-10-37.

引用本文的文献

1
Bromodomain proteins IBD1 and IBD2 link histone acetylation to SWR1- and INO80-mediated H2A.Z regulation in Tetrahymena.溴结构域蛋白IBD1和IBD2将组蛋白乙酰化与四膜虫中SWR1和INO80介导的H2A.Z调控联系起来。
Epigenetics Chromatin. 2025 Aug 6;18(1):51. doi: 10.1186/s13072-025-00614-5.
2
Wastewater-based epidemiology in hazard forecasting and early-warning systems for global health risks.基于污水的流行病学在全球健康风险的危害预测和预警系统中的应用。
Environ Int. 2022 Mar;161:107143. doi: 10.1016/j.envint.2022.107143. Epub 2022 Feb 14.
3
Functional characterization of RebL1 highlights the evolutionary conservation of oncogenic activities of the RBBP4/7 orthologue in Tetrahymena thermophila.

本文引用的文献

1
The Med31 Conserved Component of the Divergent Mediator Complex in Tetrahymena thermophila Participates in Developmental Regulation.嗜热四膜虫分歧的中介复合物中的 Med31 保守成分参与发育调控。
Curr Biol. 2019 Jul 22;29(14):2371-2379.e6. doi: 10.1016/j.cub.2019.06.052. Epub 2019 Jul 4.
2
Nanopore Sequencing Significantly Improves Genome Assembly of the Protozoan Parasite Trypanosoma cruzi.纳米孔测序显著提高原生动物寄生虫克氏锥虫的基因组组装。
Genome Biol Evol. 2019 Jul 1;11(7):1952-1957. doi: 10.1093/gbe/evz129.
3
Proteomic Analysis of Histones H2A/H2B and Variant Hv1 in Tetrahymena thermophila Reveals an Ancient Network of Chaperones.
功能表征 RebL1 突出了 Tetrahymena thermophila 中 RBBP4/7 同源物致癌活性的进化保守性。
Nucleic Acids Res. 2021 Jun 21;49(11):6196-6212. doi: 10.1093/nar/gkab413.
4
Exploring the Histone Acetylation Cycle in the Protozoan Model .探索原生动物模型中的组蛋白乙酰化循环
Front Cell Dev Biol. 2020 Jun 30;8:509. doi: 10.3389/fcell.2020.00509. eCollection 2020.
组蛋白 H2A/H2B 和 Tetrahymena thermophila 中的变体 Hv1 的蛋白质组分析揭示了古老的伴侣蛋白网络。
Mol Biol Evol. 2019 May 1;36(5):1037-1055. doi: 10.1093/molbev/msz039.
4
An accurate and rapid continuous wavelet dynamic time warping algorithm for end-to-end mapping in ultra-long nanopore sequencing.一种用于超长纳米孔测序中端到端映射的精确快速连续小波动态时间规整算法。
Bioinformatics. 2018 Sep 1;34(17):i722-i731. doi: 10.1093/bioinformatics/bty555.
5
DeepSimulator: a deep simulator for Nanopore sequencing.深模拟器:一种用于纳米孔测序的深度模拟器。
Bioinformatics. 2018 Sep 1;34(17):2899-2908. doi: 10.1093/bioinformatics/bty223.
6
Chiron: translating nanopore raw signal directly into nucleotide sequence using deep learning.奇龙:利用深度学习将纳米孔原始信号直接转换为核苷酸序列。
Gigascience. 2018 May 1;7(5). doi: 10.1093/gigascience/giy037.
7
The bromodomain-containing protein Ibd1 links multiple chromatin-related protein complexes to highly expressed genes in Tetrahymena thermophila.溴结构域蛋白 Ibd1 将多个与染色质相关的蛋白复合物连接到嗜热四膜虫中高度表达的基因上。
Epigenetics Chromatin. 2018 Mar 9;11(1):10. doi: 10.1186/s13072-018-0180-6.
8
N6-adenine DNA methylation is associated with the linker DNA of H2A.Z-containing well-positioned nucleosomes in Pol II-transcribed genes in Tetrahymena.N6-腺嘌呤DNA甲基化与嗜热四膜虫中Pol II转录基因中含H2A.Z的定位良好的核小体的连接DNA相关。
Nucleic Acids Res. 2017 Nov 16;45(20):11594-11606. doi: 10.1093/nar/gkx883.
9
The potential impact of nanopore sequencing on human genetics.纳米孔测序对人类遗传学的潜在影响。
Hum Mol Genet. 2017 Oct 1;26(R2):R202-R207. doi: 10.1093/hmg/ddx287.
10
Nanopore long-read RNAseq reveals widespread transcriptional variation among the surface receptors of individual B cells.纳米孔长读 RNA 测序揭示了个体 B 细胞表面受体之间广泛的转录变异性。
Nat Commun. 2017 Jul 19;8:16027. doi: 10.1038/ncomms16027.