• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

STAAR工作流程:一种用于可扩展和可重复的罕见变异分析的基于云的工作流程。

STAAR workflow: a cloud-based workflow for scalable and reproducible rare variant analysis.

作者信息

Gaynor Sheila M, Westerman Kenneth E, Ackovic Lea L, Li Xihao, Li Zilin, Manning Alisa K, Philippakis Anthony, Lin Xihong

机构信息

Department of Biostatistics, Harvard TH Chan School of Public Health, Boston, MA 02115, USA.

Department of Medicine, Clinical and Translational Epidemiology Unit, Mongan Institute, Massachusetts General Hospital, Boston, MA 02114, USA.

出版信息

Bioinformatics. 2022 May 26;38(11):3116-3117. doi: 10.1093/bioinformatics/btac272.

DOI:10.1093/bioinformatics/btac272
PMID:35441669
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9991895/
Abstract

SUMMARY

We developed the variant-Set Test for Association using Annotation infoRmation (STAAR) workflow description language (WDL) workflow to facilitate the analysis of rare variants in whole genome sequencing association studies. The open-access STAAR workflow written in the WDL allows a user to perform rare variant testing for both gene-centric and genetic region approaches, enabling genome-wide, candidate and conditional analyses. It incorporates functional annotations into the workflow as introduced in the STAAR method in order to boost the rare variant analysis power. This tool was specifically developed and optimized to be implemented on cloud-based platforms such as BioData Catalyst Powered by Terra. It provides easy-to-use functionality for rare variant analysis that can be incorporated into an exhaustive whole genome sequencing analysis pipeline.

AVAILABILITY AND IMPLEMENTATION

The workflow is freely available from https://dockstore.org/workflows/github.com/sheilagaynor/STAAR_workflow.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

摘要

我们开发了使用注释信息进行关联分析的变异集测试(STAAR)工作流描述语言(WDL)工作流,以促进全基因组测序关联研究中罕见变异的分析。用WDL编写的开放获取的STAAR工作流允许用户对以基因为中心和遗传区域方法进行罕见变异测试,实现全基因组、候选和条件分析。它将功能注释纳入工作流,如STAAR方法中所介绍的,以提高罕见变异分析能力。该工具经过专门开发和优化,可在基于云的平台(如由Terra提供支持的BioData Catalyst)上实施。它为罕见变异分析提供了易于使用的功能,可纳入详尽的全基因组测序分析流程。

可用性和实施

该工作流可从https://dockstore.org/workflows/github.com/sheilagaynor/STAAR_workflow免费获取。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
STAAR workflow: a cloud-based workflow for scalable and reproducible rare variant analysis.STAAR工作流程:一种用于可扩展和可重复的罕见变异分析的基于云的工作流程。
Bioinformatics. 2022 May 26;38(11):3116-3117. doi: 10.1093/bioinformatics/btac272.
2
Tibanna: software for scalable execution of portable pipelines on the cloud.Tibanna:用于在云端可扩展执行可移植管道的软件。
Bioinformatics. 2019 Nov 1;35(21):4424-4426. doi: 10.1093/bioinformatics/btz379.
3
SciApps: a cloud-based platform for reproducible bioinformatics workflows.SciApps:一个基于云的可重复生物信息学工作流平台。
Bioinformatics. 2018 Nov 15;34(22):3917-3920. doi: 10.1093/bioinformatics/bty439.
4
The Dockstore: enhancing a community platform for sharing reproducible and accessible computational protocols.Dockstore:增强了一个用于共享可重复和可访问的计算协议的社区平台。
Nucleic Acids Res. 2021 Jul 2;49(W1):W624-W632. doi: 10.1093/nar/gkab346.
5
nf-core/nanostring: a pipeline for reproducible NanoString nCounter analysis.nf-core/nanostring:用于可重复的 NanoString nCounter 分析的流水线。
Bioinformatics. 2024 Jan 2;40(1). doi: 10.1093/bioinformatics/btae019.
6
Sarek: A portable workflow for whole-genome sequencing analysis of germline and somatic variants.萨雷克:用于种系和体细胞变异的全基因组测序分析的便携式工作流程。
F1000Res. 2020 Jan 29;9:63. doi: 10.12688/f1000research.16665.2. eCollection 2020.
7
CloudNeo: a cloud pipeline for identifying patient-specific tumor neoantigens.CloudNeo:一种用于鉴定患者特异性肿瘤新生抗原的云流水线。
Bioinformatics. 2017 Oct 1;33(19):3110-3112. doi: 10.1093/bioinformatics/btx375.
8
Accelerating bioinformatics implementation in public health.加速生物信息学在公共卫生中的应用。
Microb Genom. 2023 Jul;9(7). doi: 10.1099/mgen.0.001051.
9
COWID: an efficient cloud-based genomics workflow for scalable identification of SARS-COV-2.COVID-19 基因组学工作流程:一种基于云计算的高效 SARS-CoV-2 可扩展鉴定方法
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad280.
10
PGen: large-scale genomic variations analysis workflow and browser in SoyKB.PGen:大豆知识库中的大规模基因组变异分析工作流程与浏览器
BMC Bioinformatics. 2016 Oct 6;17(Suppl 13):337. doi: 10.1186/s12859-016-1227-y.

引用本文的文献

1
Assessment of the functionality and usability of open-source rare variant analysis pipelines.开源罕见变异分析流程的功能与可用性评估。
Brief Bioinform. 2025 Feb 5;26(1). doi: 10.1093/bib/bbaf044.
2
Whole genome sequencing based analysis of inflammation biomarkers in the Trans-Omics for Precision Medicine (TOPMed) consortium.基于全基因组测序的精准医学转化研究联盟(TOPMed)炎症生物标志物分析。
Hum Mol Genet. 2024 Aug 6;33(16):1429-1441. doi: 10.1093/hmg/ddae050.
3
SUMMIT-FA: a new resource for improved transcriptome imputation using functional annotations.SUMMIT-FA:利用功能注释提高转录本推断的新资源。
Hum Mol Genet. 2024 Mar 20;33(7):624-635. doi: 10.1093/hmg/ddad205.
4
TIVAN-indel: a computational framework for annotating and predicting non-coding regulatory small insertions and deletions.TIVAN-indel:一种注释和预测非编码调控小插入和缺失的计算框架。
Bioinformatics. 2023 Feb 3;39(2). doi: 10.1093/bioinformatics/btad060.
5
IMMerge: merging imputation data at scale.IMMerge:大规模合并插补数据。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac750.
6
FAVOR: functional annotation of variants online resource and annotator for variation across the human genome.FAVOR:在线变体功能注释资源和人类基因组变异注释器。
Nucleic Acids Res. 2023 Jan 6;51(D1):D1300-D1311. doi: 10.1093/nar/gkac966.

本文引用的文献

1
Dynamic incorporation of multiple in silico functional annotations empowers rare variant association analysis of large whole-genome sequencing studies at scale.大规模全基因组测序研究中通过多种计算功能注释的动态整合增强罕见变异关联分析。
Nat Genet. 2020 Sep;52(9):969-983. doi: 10.1038/s41588-020-0676-4. Epub 2020 Aug 24.
2
Efficient Variant Set Mixed Model Association Tests for Continuous and Binary Traits in Large-Scale Whole-Genome Sequencing Studies.高效的变体集混合模型关联测试在全基因组测序研究中用于连续和二项性状。
Am J Hum Genet. 2019 Feb 7;104(2):260-274. doi: 10.1016/j.ajhg.2018.12.012. Epub 2019 Jan 10.
3
The Dockstore: enabling modular, community-focused sharing of Docker-based genomics tools and workflows.码头仓库:实现基于Docker的基因组学工具和工作流程的模块化、以社区为中心的共享。
F1000Res. 2017 Jan 18;6:52. doi: 10.12688/f1000research.10137.1. eCollection 2017.
4
A high-performance computing toolset for relatedness and principal component analysis of SNP data.用于 SNP 数据亲缘关系和主成分分析的高性能计算工具集。
Bioinformatics. 2012 Dec 15;28(24):3326-8. doi: 10.1093/bioinformatics/bts606. Epub 2012 Oct 11.
5
GWASTools: an R/Bioconductor package for quality control and analysis of genome-wide association studies.GWASTools:一个用于全基因组关联研究质量控制和分析的 R/Bioconductor 包。
Bioinformatics. 2012 Dec 15;28(24):3329-31. doi: 10.1093/bioinformatics/bts610. Epub 2012 Oct 10.