Laboratoire de Génétique médicale, UMR_S INSERM U1112, IGMA, Faculté de Médecine FMTS, Université de Strasbourg , Strasbourg , France ; IGBMC, CNRS UMR 7104/INSERM U964/Université de Strasbourg , Illkirch Cedex , France.
IGBMC, CNRS UMR 7104/INSERM U964/Université de Strasbourg , Illkirch Cedex , France.
PeerJ. 2015 Mar 3;3:e796. doi: 10.7717/peerj.796. eCollection 2015.
Background. Most genetic disorders are caused by single nucleotide variations (SNVs) or small insertion/deletions (indels). High throughput sequencing has broadened the catalogue of human variation, including common polymorphisms, rare variations or disease causing mutations. However, identifying one variation among hundreds or thousands of others is still a complex task for biologists, geneticists and clinicians. Results. We have developed VaRank, a command-line tool for the ranking of genetic variants detected by high-throughput sequencing. VaRank scores and prioritizes variants annotated either by Alamut Batch or SnpEff. A barcode allows users to quickly view the presence/absence of variants (with homozygote/heterozygote status) in analyzed samples. VaRank supports the commonly used VCF input format for variants analysis thus allowing it to be easily integrated into NGS bioinformatics analysis pipelines. VaRank has been successfully applied to disease-gene identification as well as to molecular diagnostics setup for several hundred patients. Conclusions. VaRank is implemented in Tcl/Tk, a scripting language which is platform-independent but has been tested only on Unix environment. The source code is available under the GNU GPL, and together with sample data and detailed documentation can be downloaded from http://www.lbgi.fr/VaRank/.
背景:大多数遗传疾病是由单核苷酸变异(SNVs)或小的插入/缺失(indels)引起的。高通量测序拓宽了人类变异的目录,包括常见的多态性、罕见的变异或致病突变。然而,在数百或数千个其他变异中识别一个变异对于生物学家、遗传学家和临床医生来说仍然是一项复杂的任务。
结果:我们开发了 VaRank,这是一种用于对高通量测序检测到的遗传变异进行排序的命令行工具。VaRank 对 Alamut Batch 或 SnpEff 注释的变异进行评分和优先级排序。条形码允许用户快速查看分析样本中变异的存在/缺失(具有纯合子/杂合子状态)。VaRank 支持常用的 VCF 变异分析输入格式,因此可以轻松集成到 NGS 生物信息学分析管道中。VaRank 已成功应用于疾病基因识别以及数百名患者的分子诊断设置。
结论:VaRank 是用 Tcl/Tk 实现的,这是一种独立于平台的脚本语言,但仅在 Unix 环境下进行了测试。源代码遵循 GNU GPL 许可,可与示例数据和详细文档一起从 http://www.lbgi.fr/VaRank/ 下载。
J Med Genet. 2012-9-28
Mol Genet Genomic Med. 2019-5-24
BMC Genomics. 2021-6-3
Bioinformatics. 2018-10-15
BMC Bioinformatics. 2016-11-8
BMC Bioinformatics. 2017-12-15
Nucleic Acids Res. 2017-7-3
Front Physiol. 2023-5-9
Int J Mol Sci. 2023-2-9
Nucleic Acids Res. 2014-12-16
Genome Med. 2014-3-31
Bioinformatics. 2014-4-15