Suppr超能文献

STAAR工作流程:一种用于可扩展和可重复的罕见变异分析的基于云的工作流程。

STAAR workflow: a cloud-based workflow for scalable and reproducible rare variant analysis.

作者信息

Gaynor Sheila M, Westerman Kenneth E, Ackovic Lea L, Li Xihao, Li Zilin, Manning Alisa K, Philippakis Anthony, Lin Xihong

机构信息

Department of Biostatistics, Harvard TH Chan School of Public Health, Boston, MA 02115, USA.

Department of Medicine, Clinical and Translational Epidemiology Unit, Mongan Institute, Massachusetts General Hospital, Boston, MA 02114, USA.

出版信息

Bioinformatics. 2022 May 26;38(11):3116-3117. doi: 10.1093/bioinformatics/btac272.

Abstract

SUMMARY

We developed the variant-Set Test for Association using Annotation infoRmation (STAAR) workflow description language (WDL) workflow to facilitate the analysis of rare variants in whole genome sequencing association studies. The open-access STAAR workflow written in the WDL allows a user to perform rare variant testing for both gene-centric and genetic region approaches, enabling genome-wide, candidate and conditional analyses. It incorporates functional annotations into the workflow as introduced in the STAAR method in order to boost the rare variant analysis power. This tool was specifically developed and optimized to be implemented on cloud-based platforms such as BioData Catalyst Powered by Terra. It provides easy-to-use functionality for rare variant analysis that can be incorporated into an exhaustive whole genome sequencing analysis pipeline.

AVAILABILITY AND IMPLEMENTATION

The workflow is freely available from https://dockstore.org/workflows/github.com/sheilagaynor/STAAR_workflow.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

摘要

我们开发了使用注释信息进行关联分析的变异集测试(STAAR)工作流描述语言(WDL)工作流,以促进全基因组测序关联研究中罕见变异的分析。用WDL编写的开放获取的STAAR工作流允许用户对以基因为中心和遗传区域方法进行罕见变异测试,实现全基因组、候选和条件分析。它将功能注释纳入工作流,如STAAR方法中所介绍的,以提高罕见变异分析能力。该工具经过专门开发和优化,可在基于云的平台(如由Terra提供支持的BioData Catalyst)上实施。它为罕见变异分析提供了易于使用的功能,可纳入详尽的全基因组测序分析流程。

可用性和实施

该工作流可从https://dockstore.org/workflows/github.com/sheilagaynor/STAAR_workflow免费获取。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
STAAR workflow: a cloud-based workflow for scalable and reproducible rare variant analysis.
Bioinformatics. 2022 May 26;38(11):3116-3117. doi: 10.1093/bioinformatics/btac272.
2
Tibanna: software for scalable execution of portable pipelines on the cloud.
Bioinformatics. 2019 Nov 1;35(21):4424-4426. doi: 10.1093/bioinformatics/btz379.
3
SciApps: a cloud-based platform for reproducible bioinformatics workflows.
Bioinformatics. 2018 Nov 15;34(22):3917-3920. doi: 10.1093/bioinformatics/bty439.
4
5
nf-core/nanostring: a pipeline for reproducible NanoString nCounter analysis.
Bioinformatics. 2024 Jan 2;40(1). doi: 10.1093/bioinformatics/btae019.
6
Sarek: A portable workflow for whole-genome sequencing analysis of germline and somatic variants.
F1000Res. 2020 Jan 29;9:63. doi: 10.12688/f1000research.16665.2. eCollection 2020.
7
CloudNeo: a cloud pipeline for identifying patient-specific tumor neoantigens.
Bioinformatics. 2017 Oct 1;33(19):3110-3112. doi: 10.1093/bioinformatics/btx375.
8
Accelerating bioinformatics implementation in public health.
Microb Genom. 2023 Jul;9(7). doi: 10.1099/mgen.0.001051.
9
10
PGen: large-scale genomic variations analysis workflow and browser in SoyKB.
BMC Bioinformatics. 2016 Oct 6;17(Suppl 13):337. doi: 10.1186/s12859-016-1227-y.

引用本文的文献

3
SUMMIT-FA: a new resource for improved transcriptome imputation using functional annotations.
Hum Mol Genet. 2024 Mar 20;33(7):624-635. doi: 10.1093/hmg/ddad205.
5
IMMerge: merging imputation data at scale.
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac750.
6
FAVOR: functional annotation of variants online resource and annotator for variation across the human genome.
Nucleic Acids Res. 2023 Jan 6;51(D1):D1300-D1311. doi: 10.1093/nar/gkac966.

本文引用的文献

2
Efficient Variant Set Mixed Model Association Tests for Continuous and Binary Traits in Large-Scale Whole-Genome Sequencing Studies.
Am J Hum Genet. 2019 Feb 7;104(2):260-274. doi: 10.1016/j.ajhg.2018.12.012. Epub 2019 Jan 10.
3
The Dockstore: enabling modular, community-focused sharing of Docker-based genomics tools and workflows.
F1000Res. 2017 Jan 18;6:52. doi: 10.12688/f1000research.10137.1. eCollection 2017.
4
A high-performance computing toolset for relatedness and principal component analysis of SNP data.
Bioinformatics. 2012 Dec 15;28(24):3326-8. doi: 10.1093/bioinformatics/bts606. Epub 2012 Oct 11.
5
GWASTools: an R/Bioconductor package for quality control and analysis of genome-wide association studies.
Bioinformatics. 2012 Dec 15;28(24):3329-31. doi: 10.1093/bioinformatics/bts610. Epub 2012 Oct 10.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验