• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Var2GO:一个基于网络的基因变异体选择工具。

Var2GO: a web-based tool for gene variants selection.

作者信息

Granata Ilaria, Sangiovanni Mara, Maiorano Francesco, Miele Marco, Guarracino Mario Rosario

机构信息

High Performance Computing and Networking Institute, National Research Council of Italy, Via P. Castellino, 111, Napoli, 80131, Italy.

出版信息

BMC Bioinformatics. 2016 Nov 8;17(Suppl 12):376. doi: 10.1186/s12859-016-1197-0.

DOI:10.1186/s12859-016-1197-0
PMID:28185576
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5123234/
Abstract

BACKGROUND

One of the most challenging issue in the variant calling process is handling the resulting data, and filtering the genes retaining only the ones strictly related to the topic of interest. Several tools permit to gather annotations at different levels of complexity for the detected genes and to group them according to the pathways and/or processes they belong to. However, it might be a time consuming and frustrating task. This is partly due to the size of the file, that might contain many thousands of genes, and to the search of associated variants that requires a gene-by-gene investigation and annotation approach. As a consequence, the initial gene list is often reduced exploiting the knowledge of variants effect, novelty and genotype, with the potential risk of losing meaningful pieces of information.

RESULTS

Here we present Var2GO, a new web-based tool to support the annotation and filtering of variants and genes coming from variant calling of high-throughput sequencing data. Var2GO permits to upload either the unprocessed Variant Calling Format file or a table containing the annotated variants. The raw data undergo a preliminary step of variants annotation, using the SnpEff tool, and are converted to a table format. The table is then uploaded into an on the fly generated database. Genes associated to the variants are automatically annotated with the corresponding Gene Ontology terms covering the three GO domains. Using the web interface it is then possible to filter and extract, from the whole list, genes having annotations in the domain of interest, by simply specifying filtering parameters and one or more keywords. The relevance of this tool is demonstrated on exome sequencing data.

CONCLUSIONS

Var2GO is a novel tool that implements a topic-based approach, expressly designed to help biologists in narrowing the search of relevant genes coming from variant calling analysis. Its main purpose is to support non-bioinformaticians in handling and processing raw variant calling data through an intuitive web interface. Furthermore, Var2GO offers a complete pipeline that, starting from the raw VCF file, allows to annotate both variants and associated genes and supports the extraction of relevant biological knowledge.

摘要

背景

变异检测过程中最具挑战性的问题之一是处理所得数据,并筛选基因,只保留与感兴趣主题严格相关的基因。有几种工具可以为检测到的基因收集不同复杂程度的注释,并根据它们所属的途径和/或过程对其进行分组。然而,这可能是一项耗时且令人沮丧的任务。部分原因在于文件大小,它可能包含数千个基因,还在于搜索相关变异需要逐个基因的调查和注释方法。因此,最初的基因列表常常会利用变异效应、新颖性和基因型的知识进行缩减,存在丢失有意义信息片段的潜在风险。

结果

在此,我们展示了Var2GO,这是一种基于网络的新工具,用于支持对来自高通量测序数据变异检测的变异和基因进行注释与筛选。Var2GO允许上传未处理的变异调用格式文件或包含已注释变异的表格。原始数据使用SnpEff工具进行变异注释的初步步骤,并转换为表格格式。然后将该表格上传到动态生成的数据库中。与变异相关的基因会自动用涵盖三个基因本体论(GO)领域的相应术语进行注释。使用网络界面,只需指定筛选参数和一个或多个关键词,就可以从整个列表中筛选并提取在感兴趣领域有注释的基因。该工具在全外显子组测序数据上的相关性得到了证明。

结论

Var2GO是一种新颖的工具,它实现了基于主题的方法,专门设计用于帮助生物学家缩小对来自变异检测分析的相关基因的搜索范围。其主要目的是通过直观的网络界面支持非生物信息学家处理和加工原始变异检测数据。此外,Var2GO提供了一个完整的流程,从原始VCF文件开始,允许对变异和相关基因进行注释,并支持提取相关的生物学知识。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/769a/5123234/bbbd89d742a5/12859_2016_1197_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/769a/5123234/bbbd89d742a5/12859_2016_1197_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/769a/5123234/bbbd89d742a5/12859_2016_1197_Fig1_HTML.jpg

相似文献

1
Var2GO: a web-based tool for gene variants selection.Var2GO:一个基于网络的基因变异体选择工具。
BMC Bioinformatics. 2016 Nov 8;17(Suppl 12):376. doi: 10.1186/s12859-016-1197-0.
2
VCF-Miner: GUI-based application for mining variants and annotations stored in VCF files.VCF-Miner:用于挖掘存储在VCF文件中的变异和注释的基于图形用户界面的应用程序。
Brief Bioinform. 2016 Mar;17(2):346-51. doi: 10.1093/bib/bbv051. Epub 2015 Jul 25.
3
DaMold: A data-mining platform for variant annotation and visualization in molecular diagnostics research.DaMold:一个用于分子诊断研究中变异注释和可视化的数据挖掘平台。
Hum Mutat. 2017 Jul;38(7):778-787. doi: 10.1002/humu.23227. Epub 2017 May 30.
4
VCF.Filter: interactive prioritization of disease-linked genetic variants from sequencing data.VCF.Filter:从测序数据中交互式优先考虑与疾病相关的遗传变异。
Nucleic Acids Res. 2017 Jul 3;45(W1):W567-W572. doi: 10.1093/nar/gkx425.
5
VCF-Server: A web-based visualization tool for high-throughput variant data mining and management.VCF-Server:一个基于网络的高通量变异数据挖掘和管理的可视化工具。
Mol Genet Genomic Med. 2019 Jul;7(7):e00641. doi: 10.1002/mgg3.641. Epub 2019 May 24.
6
A community-based resource for automatic exome variant-calling and annotation in Mendelian disorders.一个基于社区的用于孟德尔疾病中自动外显子组变异检测和注释的资源。
BMC Genomics. 2014;15 Suppl 3(Suppl 3):S5. doi: 10.1186/1471-2164-15-S3-S5. Epub 2014 May 6.
7
BrowseVCF: a web-based application and workflow to quickly prioritize disease-causative variants in VCF files.BrowseVCF:一个基于网络的应用程序和工作流程,用于快速对VCF文件中的致病变异进行优先级排序。
Brief Bioinform. 2017 Sep 1;18(5):774-779. doi: 10.1093/bib/bbw054.
8
Variant Calling From Next Generation Sequence Data.从下一代测序数据中进行变异检测
Methods Mol Biol. 2016;1418:209-24. doi: 10.1007/978-1-4939-3578-9_11.
9
VarElect: the phenotype-based variation prioritizer of the GeneCards Suite.VarElect:基因卡片套件中基于表型的变异优先级排序工具。
BMC Genomics. 2016 Jun 23;17 Suppl 2(Suppl 2):444. doi: 10.1186/s12864-016-2722-2.
10
Genomic variant annotation and prioritization with ANNOVAR and wANNOVAR.使用ANNOVAR和wANNOVAR进行基因组变异注释和优先级排序。
Nat Protoc. 2015 Oct;10(10):1556-66. doi: 10.1038/nprot.2015.105. Epub 2015 Sep 17.

引用本文的文献

1
Variant Impact Predictor database (VIPdb), version 2: trends from three decades of genetic variant impact predictors.变异影响预测器数据库(VIPdb),版本 2:三十年来遗传变异影响预测器的趋势。
Hum Genomics. 2024 Aug 28;18(1):90. doi: 10.1186/s40246-024-00663-z.
2
Variant Impact Predictor database (VIPdb), version 2: Trends from 25 years of genetic variant impact predictors.变异影响预测数据库(VIPdb),版本2:25年基因变异影响预测的趋势
bioRxiv. 2024 Jun 28:2024.06.25.600283. doi: 10.1101/2024.06.25.600283.
3
Whole-Exome Sequencing (WES) Reveals Novel Sex-Specific Gene Variants in Non-Alcoholic Steatohepatitis (MASH).

本文引用的文献

1
GYG1 gene mutations in a family with polyglucosan body myopathy.一个多发性糖原体肌病家系中 GYG1 基因突变。
Neurol Genet. 2015 Sep 24;1(3):e21. doi: 10.1212/NXG.0000000000000021. eCollection 2015 Oct.
2
VCF-Miner: GUI-based application for mining variants and annotations stored in VCF files.VCF-Miner:用于挖掘存储在VCF文件中的变异和注释的基于图形用户界面的应用程序。
Brief Bioinform. 2016 Mar;17(2):346-51. doi: 10.1093/bib/bbv051. Epub 2015 Jul 25.
3
SNP2GO: functional analysis of genome-wide association studies.SNP2GO:全基因组关联研究的功能分析
全外显子组测序(WES)揭示了非酒精性脂肪性肝炎(MASH)中性别特异性的新基因变异。
Genes (Basel). 2024 Mar 13;15(3):357. doi: 10.3390/genes15030357.
4
BITS 2015: the annual meeting of the Italian Society of Bioinformatics.2015年意大利生物信息学学会年会
BMC Bioinformatics. 2016 Nov 8;17(Suppl 12):396. doi: 10.1186/s12859-016-1187-2.
Genetics. 2014 May;197(1):285-9. doi: 10.1534/genetics.113.160341. Epub 2014 Feb 21.
4
The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data.人类表型本体论项目:通过表型数据将分子生物学和疾病联系起来。
Nucleic Acids Res. 2014 Jan;42(Database issue):D966-74. doi: 10.1093/nar/gkt1026. Epub 2013 Nov 11.
5
New insights in the field of muscle glycogenoses.肌肉糖原贮积症研究新进展
Curr Opin Neurol. 2013 Oct;26(5):544-53. doi: 10.1097/WCO.0b013e328364dbdc.
6
GEMINI: integrative exploration of genetic variation and genome annotations.GEMINI:遗传变异与基因组注释的综合探索。
PLoS Comput Biol. 2013;9(7):e1003153. doi: 10.1371/journal.pcbi.1003153. Epub 2013 Jul 18.
7
A survey of tools for variant analysis of next-generation genome sequencing data.下一代基因组测序数据变异分析工具综述。
Brief Bioinform. 2014 Mar;15(2):256-78. doi: 10.1093/bib/bbs086. Epub 2013 Jan 21.
8
EVA: Exome Variation Analyzer, an efficient and versatile tool for filtering strategies in medical genomics.EVA:外显子变异分析,一种用于医学基因组学过滤策略的高效、通用工具。
BMC Bioinformatics. 2012;13 Suppl 14(Suppl 14):S9. doi: 10.1186/1471-2105-13-S14-S9. Epub 2012 Sep 7.
9
Exome sequencing and complex disease: practical aspects of rare variant association studies.外显子组测序与复杂疾病:罕见变异关联研究的实用方面。
Hum Mol Genet. 2012 Oct 15;21(R1):R1-9. doi: 10.1093/hmg/dds387. Epub 2012 Sep 13.
10
A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3.一个用于注释和预测单核苷酸多态性影响的程序,即SnpEff:黑腹果蝇品系w1118、iso-2、iso-3基因组中的单核苷酸多态性。
Fly (Austin). 2012 Apr-Jun;6(2):80-92. doi: 10.4161/fly.19695.