• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

复合杂合变异鉴定管道(CompoundHetVIP)。

CompoundHetVIP: Compound Heterozygous Variant Identification Pipeline.

机构信息

Department of Biology, Brigham Young University, Provo, UT, 84602, USA.

出版信息

F1000Res. 2020 Oct 8;9:1211. doi: 10.12688/f1000research.26848.2. eCollection 2020.

DOI:10.12688/f1000research.26848.2
PMID:33680433
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7905494/
Abstract

Compound Heterozygous ( ) variant identification requires distinguishing maternally from paternally derived nucleotides, a process that requires numerous computational tools. Using such tools often introduces unforeseen challenges such as installation procedures that are operating-system specific, software dependencies that must be installed, and formatting requirements for input files. To overcome these challenges, we developed Compound Heterozygous Variant Identification Pipeline (CompoundHetVIP), which uses a single Docker image to encapsulate commonly used software tools for file aggregation ( or ), VCF liftover ( ), joint-genotyping ( ), file conversion ( ), phasing ( , , and/or ), variant normalization ( tools), annotation ( ), relational database generation ( ), and identification of , homozygous alternate, and variants in a series of 13 steps. To begin using our tool, researchers need only install the Docker engine and download the CompoundHetVIP Docker image. The tools provided in CompoundHetVIP, subject to the limitations of the underlying software, can be applied to whole-genome, whole-exome, or targeted exome sequencing data of individual samples or trios (a child and both parents), using VCF or gVCF files as initial input. Each step of the pipeline produces an analysis-ready output file that can be further evaluated. To illustrate its use, we applied CompoundHetVIP to data from a publicly available Ashkenazim trio and identified two genes with a candidate variant and two genes with a candidate homozygous alternate variant after filtering based on user-set thresholds for global minor allele frequency, Combined Annotation Dependent Depletion, and Gene Damage Index. While this example uses genomic data from a healthy child, we anticipate that most researchers will use CompoundHetVIP to uncover missing heritability in human diseases and other phenotypes. CompoundHetVIP is open-source software and can be found at https://github.com/dmiller903/CompoundHetVIP; this repository also provides detailed, step-by-step examples.

摘要

复合杂合子 () 变异体的鉴定需要区分母源和父源核苷酸,这一过程需要大量的计算工具。使用这些工具通常会带来意想不到的挑战,例如特定于操作系统的安装程序、必须安装的软件依赖项以及输入文件的格式要求。为了克服这些挑战,我们开发了复合杂合子变异体鉴定管道 (CompoundHetVIP),该管道使用单个 Docker 镜像来封装常用的文件聚合软件工具 ( 或 )、VCF 提升 ( )、联合基因分型 ( )、文件转换 ( )、相位 ( 、 和/或 )、变异体标准化 ( 工具)、注释 ( )、关系数据库生成 ( )以及在一系列 13 个步骤中鉴定 、纯合替代和 变体。要开始使用我们的工具,研究人员只需安装 Docker 引擎并下载 CompoundHetVIP Docker 镜像。CompoundHetVIP 中提供的工具(受底层软件的限制)可以应用于个体样本或三体型(一个孩子和两个父母)的全基因组、全外显子或靶向外显子测序数据,使用 VCF 或 gVCF 文件作为初始输入。管道的每个步骤都会生成一个可进一步评估的分析就绪输出文件。为了说明其用途,我们将 CompoundHetVIP 应用于公开可用的阿什肯纳兹三人组的数据,并在基于用户设置的全局次要等位基因频率、综合注释依赖耗竭和基因损伤指数的阈值对数据进行过滤后,鉴定出两个候选 变体基因和两个候选纯合替代变体基因。虽然此示例使用来自健康儿童的基因组数据,但我们预计大多数研究人员将使用 CompoundHetVIP 揭示人类疾病和其他表型中的遗传缺失。CompoundHetVIP 是开源软件,可以在 https://github.com/dmiller903/CompoundHetVIP 找到;该存储库还提供了详细的、逐步的示例。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ddb/7905566/ca0485078dae/f1000research-9-54440-g0000.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ddb/7905566/ca0485078dae/f1000research-9-54440-g0000.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ddb/7905566/ca0485078dae/f1000research-9-54440-g0000.jpg

相似文献

1
CompoundHetVIP: Compound Heterozygous Variant Identification Pipeline.复合杂合变异鉴定管道(CompoundHetVIP)。
F1000Res. 2020 Oct 8;9:1211. doi: 10.12688/f1000research.26848.2. eCollection 2020.
2
ILIAD: a suite of automated Snakemake workflows for processing genomic data for downstream applications.ILIAD:一套用于处理基因组数据以用于下游应用的自动化 Snakemake 工作流程套件。
BMC Bioinformatics. 2023 Nov 8;24(1):424. doi: 10.1186/s12859-023-05548-x.
3
123VCF: an intuitive and efficient tool for filtering VCF files.123VCF:一个直观且高效的 VCF 文件过滤工具。
BMC Bioinformatics. 2024 Feb 14;25(1):68. doi: 10.1186/s12859-024-05661-5.
4
trioPhaser: using Mendelian inheritance logic to improve genomic phasing of trios.trioPhaser:利用孟德尔遗传逻辑提高三体型的基因组相位。
BMC Bioinformatics. 2021 Nov 22;22(1):559. doi: 10.1186/s12859-021-04470-4.
5
The Allele Catalog Tool: a web-based interactive tool for allele discovery and analysis.等位基因目录工具:一个基于网络的等位基因发现和分析的交互式工具。
BMC Genomics. 2023 Mar 10;24(1):107. doi: 10.1186/s12864-023-09161-3.
6
Kuura-An automated workflow for analyzing WES and WGS data.Kuura—一种用于分析 WES 和 WGS 数据的自动化工作流程。
PLoS One. 2024 Jan 18;19(1):e0296785. doi: 10.1371/journal.pone.0296785. eCollection 2024.
7
Prioritizing disease-linked variants, genes, and pathways with an interactive whole-genome analysis pipeline.使用交互式全基因组分析流程对疾病相关变异、基因和通路进行优先级排序。
Hum Mutat. 2014 May;35(5):537-47. doi: 10.1002/humu.22520. Epub 2014 Mar 6.
8
A Survey of Compound Heterozygous Variants in Pediatric Cancers and Structural Birth Defects.小儿癌症和结构性出生缺陷中的复合杂合变异体调查
Front Genet. 2021 Mar 22;12:640242. doi: 10.3389/fgene.2021.640242. eCollection 2021.
9
VarGenius executes cohort-level DNA-seq variant calling and annotation and allows to manage the resulting data through a PostgreSQL database.VarGenius 执行队列级别的 DNA 测序变异调用和注释,并允许通过 PostgreSQL 数据库管理产生的数据。
BMC Bioinformatics. 2018 Dec 12;19(1):477. doi: 10.1186/s12859-018-2532-4.
10
WEP: a high-performance analysis pipeline for whole-exome data.WEP:一种用于全外显子组数据的高性能分析管道。
BMC Bioinformatics. 2013;14 Suppl 7(Suppl 7):S11. doi: 10.1186/1471-2105-14-S7-S11. Epub 2013 Apr 22.

引用本文的文献

1
trioPhaser: using Mendelian inheritance logic to improve genomic phasing of trios.trioPhaser:利用孟德尔遗传逻辑提高三体型的基因组相位。
BMC Bioinformatics. 2021 Nov 22;22(1):559. doi: 10.1186/s12859-021-04470-4.
2
Toward a methodology for evaluating DNA variants in nuclear families.评估核型家族中 DNA 变体的方法学研究
PLoS One. 2021 Oct 8;16(10):e0258375. doi: 10.1371/journal.pone.0258375. eCollection 2021.
3
A Survey of Compound Heterozygous Variants in Pediatric Cancers and Structural Birth Defects.小儿癌症和结构性出生缺陷中的复合杂合变异体调查
Front Genet. 2021 Mar 22;12:640242. doi: 10.3389/fgene.2021.640242. eCollection 2021.