JWES：一个用于全基因组/外显子组序列数据处理、管理以及基因变异发现、注释、预测和基因分型的新管道。

JWES: a new pipeline for whole genome/exome sequence data processing, management, and gene-variant discovery, annotation, prediction, and genotyping.

机构信息

Institute for Health, Health Care Policy and Aging Research, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA.

Department of Medicine, Rutgers Robert Wood Johnson Medical School, Rutgers Biomedical and Health Sciences, New Brunswick, NJ, USA.

出版信息

FEBS Open Bio. 2021 Sep;11(9):2441-2452. doi: 10.1002/2211-5463.13261. Epub 2021 Aug 11.

DOI:10.1002/2211-5463.13261

PMID:34370400

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8409305/

Abstract

Whole genome and exome sequencing (WGS/WES) are the most popular next-generation sequencing (NGS) methodologies and are at present often used to detect rare and common genetic variants of clinical significance. We emphasize that automated sequence data processing, management, and visualization should be an indispensable component of modern WGS and WES data analysis for sequence assembly, variant detection (SNPs, SVs), imputation, and resolution of haplotypes. In this manuscript, we present a newly developed findable, accessible, interoperable, and reusable (FAIR) bioinformatics-genomics pipeline Java based Whole Genome/Exome Sequence Data Processing Pipeline (JWES) for efficient variant discovery and interpretation, and big data modeling and visualization. JWES is a cross-platform, user-friendly, product line application, that entails three modules: (a) data processing, (b) storage, and (c) visualization. The data processing module performs a series of different tasks for variant calling, the data storage module efficiently manages high-volume gene-variant data, and the data visualization module supports variant data interpretation with Circos graphs. The performance of JWES was tested and validated in-house with different experiments, using Microsoft Windows, macOS Big Sur, and UNIX operating systems. JWES is an open-source and freely available pipeline, allowing scientists to take full advantage of all the computing resources available, without requiring much computer science knowledge. We have successfully applied JWES for processing, management, and gene-variant discovery, annotation, prediction, and genotyping of WGS and WES data to analyze variable complex disorders. In summary, we report the performance of JWES with some reproducible case studies, using open access and in-house generated, high-quality datasets.

摘要

全基因组和外显子组测序（WGS/WES）是最流行的下一代测序（NGS）方法，目前常用于检测具有临床意义的罕见和常见遗传变异。我们强调，自动化的序列数据处理、管理和可视化应该是现代 WGS 和 WES 数据分析的一个不可或缺的组成部分，用于序列组装、变异检测（SNP、SV）、推断和单倍型分辨率。在本文中，我们提出了一个新开发的、可发现的、可访问的、可互操作的和可重复使用的（FAIR）生物信息学-基因组学管道 Java 全基因组/外显子组序列数据处理管道（JWES），用于高效的变异发现和解释，以及大数据建模和可视化。JWES 是一个跨平台、用户友好的产品线应用程序，包含三个模块：（a）数据处理、（b）存储和（c）可视化。数据处理模块执行一系列用于变异调用的不同任务，数据存储模块高效地管理大容量基因变异数据，数据可视化模块支持使用 Circos 图进行变异数据解释。JWES 的性能在内部使用不同的实验进行了测试和验证，使用 Microsoft Windows、macOS Big Sur 和 UNIX 操作系统。JWES 是一个开源的、免费的管道，允许科学家充分利用所有可用的计算资源，而不需要太多的计算机科学知识。我们已经成功地应用 JWES 来处理、管理和发现 WGS 和 WES 数据的基因变异，对可变复杂疾病进行注释、预测和基因分型。总之，我们使用一些可重复的案例研究报告了 JWES 的性能，使用了开放获取和内部生成的高质量数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a28/8409305/b53f9b6aed19/FEB4-11-2441-g003.jpg

相似文献

JWES: a new pipeline for whole genome/exome sequence data processing, management, and gene-variant discovery, annotation, prediction, and genotyping.

FEBS Open Bio. 2021 Sep;11(9):2441-2452. doi: 10.1002/2211-5463.13261. Epub 2021 Aug 11.

CoVaCS: a consensus variant calling system.

BMC Genomics. 2018 Feb 5;19(1):120. doi: 10.1186/s12864-018-4508-1.

A community-based resource for automatic exome variant-calling and annotation in Mendelian disorders.

BMC Genomics. 2014;15 Suppl 3(Suppl 3):S5. doi: 10.1186/1471-2164-15-S3-S5. Epub 2014 May 6.

Genomics pipelines to investigate susceptibility in whole genome and exome sequenced data for variant discovery, annotation, prediction and genotyping.

PeerJ. 2021 Jul 26;9:e11724. doi: 10.7717/peerj.11724. eCollection 2021.

Genome analysis and knowledge-driven variant interpretation with TGex.

BMC Med Genomics. 2019 Dec 30;12(1):200. doi: 10.1186/s12920-019-0647-8.

Bioinformatics Analysis of Whole Exome Sequencing Data.

Methods Mol Biol. 2019;1881:277-318. doi: 10.1007/978-1-4939-8876-1_21.

NGS_SNPAnalyzer: a desktop software supporting genome projects by identifying and visualizing sequence variations from next-generation sequencing data.

Genes Genomics. 2020 Nov;42(11):1311-1317. doi: 10.1007/s13258-020-00997-7. Epub 2020 Sep 26.

Prioritizing disease-linked variants, genes, and pathways with an interactive whole-genome analysis pipeline.

Hum Mutat. 2014 May;35(5):537-47. doi: 10.1002/humu.22520. Epub 2014 Mar 6.

A survey of tools for variant analysis of next-generation genome sequencing data.

Brief Bioinform. 2014 Mar;15(2):256-78. doi: 10.1093/bib/bbs086. Epub 2013 Jan 21.

CSN and CAVA: variant annotation tools for rapid, robust next-generation sequencing analysis in the clinical setting.

Genome Med. 2015 Jul 28;7(1):76. doi: 10.1186/s13073-015-0195-6.

引用本文的文献

VAREANT: a bioinformatics application for gene variant reduction and annotation.

Bioinform Adv. 2024 Dec 31;5(1):vbae210. doi: 10.1093/bioadv/vbae210. eCollection 2025.

Deciphering expression and variants in cardiovascular disease genes among heart failure population for precision medicine.

ESC Heart Fail. 2024 Feb;11(1):606-609. doi: 10.1002/ehf2.14653. Epub 2023 Dec 22.

Functional mutation, splice, distribution, and divergence analysis of impactful genes associated with heart failure and other cardiovascular diseases.

Sci Rep. 2023 Oct 5;13(1):16769. doi: 10.1038/s41598-023-44127-1.

MIRACUM-Pipe: An Adaptable Pipeline for Next-Generation Sequencing Analysis, Reporting, and Visualization for Clinical Decision Making.

Cancers (Basel). 2023 Jul 1;15(13):3456. doi: 10.3390/cancers15133456.

Whole Animal Genome Sequencing: user-friendly, rapid, containerized pipelines for processing, variant discovery, and annotation of short-read whole genome sequencing data.

G3 (Bethesda). 2023 Aug 9;13(8). doi: 10.1093/g3journal/jkad117.

Resources and tools for rare disease variant interpretation.

Front Mol Biosci. 2023 May 10;10:1169109. doi: 10.3389/fmolb.2023.1169109. eCollection 2023.

Integrated ACMG-approved genes and ICD codes for the translational research and precision medicine.

Database (Oxford). 2023 May 17;2023. doi: 10.1093/database/baad033.

Artificial Intelligence, Healthcare, Clinical Genomics, and Pharmacogenomics Approaches in Precision Medicine.

Front Genet. 2022 Jul 6;13:929736. doi: 10.3389/fgene.2022.929736. eCollection 2022.

Artificial intelligence and machine learning approaches using gene expression and variant data for personalized medicine.

Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac191.

本文引用的文献

Genomics pipelines to investigate susceptibility in whole genome and exome sequenced data for variant discovery, annotation, prediction and genotyping.

PeerJ. 2021 Jul 26;9:e11724. doi: 10.7717/peerj.11724. eCollection 2021.

A wealth of discovery built on the Human Genome Project - by the numbers.

Nature. 2021 Feb;590(7845):212-215. doi: 10.1038/d41586-021-00314-6.

Evaluation of NGS-based approaches for SARS-CoV-2 whole genome characterisation.

Virus Evol. 2020 Oct 5;6(2):veaa075. doi: 10.1093/ve/veaa075. eCollection 2020 Jul.

Coding-Complete Genome Sequences of Three SARS-CoV-2 Strains from Bangladesh.

Microbiol Resour Announc. 2020 Sep 24;9(39):e00764-20. doi: 10.1128/MRA.00764-20.

Retrospective evaluation of whole exome and genome mutation calls in 746 cancer samples.

Nat Commun. 2020 Sep 21;11(1):4748. doi: 10.1038/s41467-020-18151-y.

Four SARS-CoV-2 Genome Sequences from Late April in Stockholm, Sweden, Reveal a Rare Mutation in the Spike Protein.

Microbiol Resour Announc. 2020 Aug 27;9(35):e00934-20. doi: 10.1128/MRA.00934-20.

Galactic Circos: User-friendly Circos plots within the Galaxy platform.

Gigascience. 2020 Jun 1;9(6). doi: 10.1093/gigascience/giaa065.

Alterations in Gut Microbiota of Patients With COVID-19 During Time of Hospitalization.

Gastroenterology. 2020 Sep;159(3):944-955.e8. doi: 10.1053/j.gastro.2020.05.048. Epub 2020 May 20.

Accelerating next generation sequencing data analysis: an evaluation of optimized best practices for Genome Analysis Toolkit algorithms.

Genomics Inform. 2020 Mar;18(1):e10. doi: 10.5808/GI.2020.18.1.e10. Epub 2020 Mar 31.

Precision medicine integrating whole-genome sequencing, comprehensive metabolomics, and advanced imaging.

Proc Natl Acad Sci U S A. 2020 Feb 11;117(6):3053-3062. doi: 10.1073/pnas.1909378117. Epub 2020 Jan 24.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

JWES：一个用于全基因组/外显子组序列数据处理、管理以及基因变异发现、注释、预测和基因分型的新管道。

JWES: a new pipeline for whole genome/exome sequence data processing, management, and gene-variant discovery, annotation, prediction, and genotyping.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献