• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Watchdog - 一种用于大规模实验数据分析的分布式工作流管理系统。

Watchdog - a workflow management system for the distributed analysis of large-scale experimental data.

机构信息

Institute for Informatics, Ludwig-Maximilians-Universität München, Amalienstraße 17, München, 80333, Germany.

出版信息

BMC Bioinformatics. 2018 Mar 13;19(1):97. doi: 10.1186/s12859-018-2107-4.

DOI:10.1186/s12859-018-2107-4
PMID:29534677
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5850912/
Abstract

BACKGROUND

The development of high-throughput experimental technologies, such as next-generation sequencing, have led to new challenges for handling, analyzing and integrating the resulting large and diverse datasets. Bioinformatical analysis of these data commonly requires a number of mutually dependent steps applied to numerous samples for multiple conditions and replicates. To support these analyses, a number of workflow management systems (WMSs) have been developed to allow automated execution of corresponding analysis workflows. Major advantages of WMSs are the easy reproducibility of results as well as the reusability of workflows or their components.

RESULTS

In this article, we present Watchdog, a WMS for the automated analysis of large-scale experimental data. Main features include straightforward processing of replicate data, support for distributed computer systems, customizable error detection and manual intervention into workflow execution. Watchdog is implemented in Java and thus platform-independent and allows easy sharing of workflows and corresponding program modules. It provides a graphical user interface (GUI) for workflow construction using pre-defined modules as well as a helper script for creating new module definitions. Execution of workflows is possible using either the GUI or a command-line interface and a web-interface is provided for monitoring the execution status and intervening in case of errors. To illustrate its potentials on a real-life example, a comprehensive workflow and modules for the analysis of RNA-seq experiments were implemented and are provided with the software in addition to simple test examples.

CONCLUSIONS

Watchdog is a powerful and flexible WMS for the analysis of large-scale high-throughput experiments. We believe it will greatly benefit both users with and without programming skills who want to develop and apply bioinformatical workflows with reasonable overhead. The software, example workflows and a comprehensive documentation are freely available at www.bio.ifi.lmu.de/watchdog.

摘要

背景

高通量实验技术的发展,如下一代测序,为处理、分析和整合由此产生的大量和多样化数据集带来了新的挑战。这些数据的生物信息学分析通常需要应用于许多样本的多个条件和重复的许多相互依赖的步骤。为了支持这些分析,已经开发了许多工作流管理系统(WMS),以允许自动执行相应的分析工作流。WMS 的主要优点是结果的易于重现性以及工作流或其组件的可重用性。

结果

在本文中,我们介绍了 Watchdog,这是一种用于自动化分析大规模实验数据的 WMS。主要功能包括简单地处理重复数据、支持分布式计算机系统、可定制的错误检测以及对工作流执行的手动干预。Watchdog 是用 Java 实现的,因此与平台无关,并允许轻松共享工作流及其相应的程序模块。它提供了一个使用预定义模块构建工作流的图形用户界面(GUI),以及一个用于创建新模块定义的助手脚本。可以使用 GUI 或命令行界面执行工作流,并且提供了一个 Web 界面来监控执行状态并在发生错误时进行干预。为了在实际示例上说明其潜力,实现了一个综合的工作流和用于分析 RNA-seq 实验的模块,并随软件一起提供了简单的测试示例。

结论

Watchdog 是一种用于大规模高通量实验分析的强大而灵活的 WMS。我们相信,它将极大地受益于有和没有编程技能的用户,他们希望开发和应用具有合理开销的生物信息学工作流。软件、示例工作流和全面的文档可在 www.bio.ifi.lmu.de/watchdog 上免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/66c98c228563/12859_2018_2107_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/1effc36f18cb/12859_2018_2107_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/dbd65d668c3f/12859_2018_2107_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/ea85549c6b17/12859_2018_2107_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/8f99306c41c2/12859_2018_2107_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/d36c78182a30/12859_2018_2107_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/d9d9efd9becf/12859_2018_2107_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/65ffac730ba2/12859_2018_2107_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/1fb29269d317/12859_2018_2107_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/1ba5c6e4d020/12859_2018_2107_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/0d687dc1ad1d/12859_2018_2107_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/5b1d94520840/12859_2018_2107_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/66c98c228563/12859_2018_2107_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/1effc36f18cb/12859_2018_2107_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/dbd65d668c3f/12859_2018_2107_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/ea85549c6b17/12859_2018_2107_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/8f99306c41c2/12859_2018_2107_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/d36c78182a30/12859_2018_2107_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/d9d9efd9becf/12859_2018_2107_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/65ffac730ba2/12859_2018_2107_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/1fb29269d317/12859_2018_2107_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/1ba5c6e4d020/12859_2018_2107_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/0d687dc1ad1d/12859_2018_2107_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/5b1d94520840/12859_2018_2107_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a76d/5850912/66c98c228563/12859_2018_2107_Fig12_HTML.jpg

相似文献

1
Watchdog - a workflow management system for the distributed analysis of large-scale experimental data.Watchdog - 一种用于大规模实验数据分析的分布式工作流管理系统。
BMC Bioinformatics. 2018 Mar 13;19(1):97. doi: 10.1186/s12859-018-2107-4.
2
Watchdog 2.0: New developments for reusability, reproducibility, and workflow execution.Watchdog 2.0:可重用性、可重现性和工作流程执行的新发展。
Gigascience. 2020 Jun 1;9(6). doi: 10.1093/gigascience/giaa068.
3
CANEapp: a user-friendly application for automated next generation transcriptomic data analysis.CANEapp:一款用于自动化下一代转录组数据分析的用户友好型应用程序。
BMC Genomics. 2016 Jan 13;17:49. doi: 10.1186/s12864-015-2346-y.
4
systemPipeR: NGS workflow and report generation environment.systemPipeR:二代测序工作流程与报告生成环境。
BMC Bioinformatics. 2016 Sep 20;17:388. doi: 10.1186/s12859-016-1241-0.
5
Biowep: a workflow enactment portal for bioinformatics applications.生物工作流引擎(Biowep):一个用于生物信息学应用的工作流制定门户。
BMC Bioinformatics. 2007 Mar 8;8 Suppl 1(Suppl 1):S19. doi: 10.1186/1471-2105-8-S1-S19.
6
DolphinNext: a distributed data processing platform for high throughput genomics.海豚下一代:一个用于高通量基因组学的分布式数据处理平台。
BMC Genomics. 2020 Apr 19;21(1):310. doi: 10.1186/s12864-020-6714-x.
7
SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis.SPARTA:用于基于参考的细菌RNA测序转录组自动分析的简单程序。
BMC Bioinformatics. 2016 Feb 4;17:66. doi: 10.1186/s12859-016-0923-y.
8
Reusable, extensible, and modifiable R scripts and Kepler workflows for comprehensive single set ChIP-seq analysis.用于全面单组ChIP-seq分析的可重复使用、可扩展且可修改的R脚本和开普勒工作流程。
BMC Bioinformatics. 2016 Jul 5;17(1):270. doi: 10.1186/s12859-016-1125-3.
9
BioWMS: a web-based Workflow Management System for bioinformatics.BioWMS:一种用于生物信息学的基于网络的工作流管理系统。
BMC Bioinformatics. 2007 Mar 8;8 Suppl 1(Suppl 1):S2. doi: 10.1186/1471-2105-8-S1-S2.
10
qPortal: A platform for data-driven biomedical research.qPortal:一个用于数据驱动型生物医学研究的平台。
PLoS One. 2018 Jan 19;13(1):e0191603. doi: 10.1371/journal.pone.0191603. eCollection 2018.

引用本文的文献

1
A More Rapid Method for Culturing LUHMES-Derived Neurons Provides Greater Cell Numbers and Facilitates Studies of Multiple Viruses.一种用于培养源自LUHMES的神经元的更快方法可提供更多细胞数量并促进多种病毒的研究。
Viruses. 2025 Jul 16;17(7):1001. doi: 10.3390/v17071001.
2
HSV-1 infection induces a downstream shift of the +1 nucleosome.单纯疱疹病毒1型感染会导致+1核小体发生下游移位。
J Virol. 2025 Apr 15;99(4):e0208624. doi: 10.1128/jvi.02086-24. Epub 2025 Mar 25.
3
The HSV-1 ICP22 protein selectively impairs histone repositioning upon Pol II transcription downstream of genes.

本文引用的文献

1
Transcriptome analysis of dominant-negative Brd4 mutants identifies Brd4-specific target genes of small molecule inhibitor JQ1.转录组分析显性负性 Brd4 突变体鉴定小分子抑制剂 JQ1 的 Brd4 特异性靶基因。
Sci Rep. 2017 May 10;7(1):1684. doi: 10.1038/s41598-017-01943-6.
2
Rapid Genome-wide Recruitment of RNA Polymerase II Drives Transcription, Splicing, and Translation Events during T Cell Responses.RNA聚合酶II的全基因组快速募集驱动T细胞反应过程中的转录、剪接和翻译事件。
Cell Rep. 2017 Apr 18;19(3):643-654. doi: 10.1016/j.celrep.2017.03.069.
3
Nextflow enables reproducible computational workflows.
单纯疱疹病毒 1 型 ICP22 蛋白选择性地损害 Pol II 转录下游基因处的组蛋白重定位。
Nat Commun. 2023 Jul 31;14(1):4591. doi: 10.1038/s41467-023-40217-w.
4
RegCFinder: targeted discovery of genomic subregions with differential read density.RegCFinder:靶向发现具有差异读取密度的基因组亚区域。
Bioinform Adv. 2023 Jul 4;3(1):vbad085. doi: 10.1093/bioadv/vbad085. eCollection 2023.
5
HSV-1 Infection Induces a Downstream Shift of Promoter-Proximal Pausing for Host Genes.单纯疱疹病毒 1 感染诱导宿主基因启动子近端暂停的下游转移。
J Virol. 2023 May 31;97(5):e0038123. doi: 10.1128/jvi.00381-23. Epub 2023 Apr 24.
6
P-TEFb promotes cell survival upon p53 activation by suppressing intrinsic apoptosis pathway.P-TEFb 通过抑制内在凋亡途径促进 p53 激活后的细胞存活。
Nucleic Acids Res. 2023 Feb 28;51(4):1687-1706. doi: 10.1093/nar/gkad001.
7
RelB contributes to the survival, migration and lymphomagenesis of B cells with constitutively active CD40 signaling.RelB 有助于持续激活 CD40 信号的 B 细胞的存活、迁移和淋巴瘤发生。
Front Immunol. 2022 Aug 30;13:913275. doi: 10.3389/fimmu.2022.913275. eCollection 2022.
8
nf-rnaSeqCount: A Nextflow pipeline for obtaining raw read counts from RNA-seq data.nf-rnaSeqCount:一个用于从RNA测序数据中获取原始读取计数的Nextflow管道。
S Afr Comput J. 2021 Dec;33(2). doi: 10.18489/sacj.v33i2.830. Epub 2021 Dec 20.
9
Impact of concurrency on the performance of a whole exome sequencing pipeline.并发对全外显子组测序流程性能的影响。
BMC Bioinformatics. 2021 Feb 9;22(1):60. doi: 10.1186/s12859-020-03780-3.
10
Watchdog 2.0: New developments for reusability, reproducibility, and workflow execution.Watchdog 2.0:可重用性、可重现性和工作流程执行的新发展。
Gigascience. 2020 Jun 1;9(6). doi: 10.1093/gigascience/giaa068.
Nextflow支持可重复的计算工作流程。
Nat Biotechnol. 2017 Apr 11;35(4):316-319. doi: 10.1038/nbt.3820.
4
Widespread disruption of host transcription termination in HSV-1 infection.单纯疱疹病毒1型(HSV-1)感染中宿主转录终止的广泛破坏。
Nat Commun. 2015 May 20;6:7126. doi: 10.1038/ncomms8126.
5
ContextMap 2: fast and accurate context-based RNA-seq mapping.上下文映射2:基于上下文的快速准确的RNA测序映射。
BMC Bioinformatics. 2015 Apr 17;16:122. doi: 10.1186/s12859-015-0557-5.
6
limma powers differential expression analyses for RNA-sequencing and microarray studies.limma为RNA测序和微阵列研究提供差异表达分析的动力。
Nucleic Acids Res. 2015 Apr 20;43(7):e47. doi: 10.1093/nar/gkv007. Epub 2015 Jan 20.
7
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.使用DESeq2对RNA测序数据的倍数变化和离散度进行适度估计。
Genome Biol. 2014;15(12):550. doi: 10.1186/s13059-014-0550-8.
8
Ribosome profiling: new views of translation, from single codons to genome scale.核糖体图谱分析:从单个密码子到全基因组水平看翻译的新视角。
Nat Rev Genet. 2014 Mar;15(3):205-13. doi: 10.1038/nrg3645. Epub 2014 Jan 28.
9
MAKER-P: a tool kit for the rapid creation, management, and quality control of plant genome annotations.MAKER-P:一个用于快速创建、管理和质量控制植物基因组注释的工具包。
Plant Physiol. 2014 Feb;164(2):513-24. doi: 10.1104/pp.113.230144. Epub 2013 Dec 4.
10
featureCounts: an efficient general purpose program for assigning sequence reads to genomic features.featureCounts:一个用于将序列读取分配给基因组特征的高效通用程序。
Bioinformatics. 2014 Apr 1;30(7):923-30. doi: 10.1093/bioinformatics/btt656. Epub 2013 Nov 13.