• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

网络基因查询工具(webGQT):用于基于模型的变异过滤的基因型查询工具的闪亮服务器。

webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering.

作者信息

Arumilli Meharji, Layer Ryan M, Hytönen Marjo K, Lohi Hannes

机构信息

Department of Veterinary Biosciences, Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.

Genetics Research Program, The Folkhälsan Research Center, Helsinki, Finland.

出版信息

Front Genet. 2020 Mar 3;11:152. doi: 10.3389/fgene.2020.00152. eCollection 2020.

DOI:10.3389/fgene.2020.00152
PMID:32194629
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7063093/
Abstract

SUMMARY

Genotype Query Tools (GQT) were developed to discover disease-causing variations from billions of genotypes and millions of genomes, processes data at substantially higher speed over other existing methods. While GQT has been available to a wide audience as command-line software, the difficulty of constructing queries among non-IT or non-bioinformatics researchers has limited its applicability. To overcome this limitation, we developed webGQT, an easy-to-use tool with a graphical user interface. With pre-built queries across three modules, webGQT allows for pedigree analysis, case-control studies, and population frequency studies. As a package, webGQT allows researchers with less or no applied bioinformatics/IT experience to mine potential disease-causing variants from billions.

RESULTS

webGQT offers a flexible and easy-to-use interface for model-based candidate variant filtering for Mendelian diseases from thousands to millions of genomes at a reduced computation time. Additionally, webGQT provides adjustable parameters to reduce false positives and rescue missing genotypes across all modules. Using a case study, we demonstrate the applicability of webGQT to query non-human genomes. In addition, we demonstrate the scalability of webGQT on large data sets by implementing complex population-specific queries on the 1000 Genomes Project Phase 3 data set, which includes 8.4 billion variants from 2504 individuals across 26 different populations. Furthermore, webGQT supports filtering single-nucleotide variants, short insertions/deletions, copy number or any other variant genotypes supported by the VCF specification. Our results show that webGQT can be used as an online web service, or deployed on personal computers or local servers within research groups.

AVAILABILITY

webGQT is made available to the users in three forms: 1) as a webserver available at https://vm1138.kaj.pouta.csc.fi/webgqt/, 2) as an R package to install on personal computers, and 3) as part of the same R package to configure on the user's own servers. The application is available for installation at https://github.com/arumds/webgqt.

摘要

摘要

基因型查询工具(GQT)旨在从数十亿个基因型和数百万个基因组中发现致病变异,其处理数据的速度比其他现有方法快得多。虽然GQT作为命令行软件已被广泛使用,但非信息技术或非生物信息学研究人员构建查询的难度限制了其适用性。为克服这一限制,我们开发了webGQT,这是一个带有图形用户界面的易用工具。通过三个模块中的预建查询,webGQT可进行系谱分析、病例对照研究和群体频率研究。作为一个软件包,webGQT使几乎没有或没有应用生物信息学/信息技术经验的研究人员能够从数十亿个数据中挖掘潜在的致病变异。

结果

webGQT为基于模型的候选变异筛选提供了一个灵活且易用的界面,用于从数千到数百万个基因组中筛选孟德尔疾病的变异,同时减少计算时间。此外,webGQT提供了可调整的参数,以减少所有模块中的假阳性并挽救缺失的基因型。通过一个案例研究,我们展示了webGQT查询非人类基因组的适用性。此外,我们通过对1000基因组计划第三阶段数据集(其中包括来自26个不同群体的2504个个体的84亿个变异)实施复杂的群体特异性查询,展示了webGQT在大数据集上的可扩展性。此外,webGQT支持筛选单核苷酸变异、短插入/缺失、拷贝数或VCF规范支持的任何其他变异基因型。我们的结果表明,webGQT既可以用作在线网络服务,也可以部署在研究小组内的个人计算机或本地服务器上。

可用性

webGQT以三种形式提供给用户:1)作为一个网络服务器,可在https://vm1138.kaj.pouta.csc.fi/webgqt/上使用;2)作为一个R包,可安装在个人计算机上;3)作为同一个R包的一部分,可在用户自己的服务器上进行配置。该应用程序可在https://github.com/arumds/webgqt上进行安装。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1ef/7063093/f734b731ea63/fgene-11-00152-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1ef/7063093/cc65a97e225f/fgene-11-00152-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1ef/7063093/f734b731ea63/fgene-11-00152-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1ef/7063093/cc65a97e225f/fgene-11-00152-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1ef/7063093/f734b731ea63/fgene-11-00152-g002.jpg

相似文献

1
webGQT: A Shiny Server for Genotype Query Tools for Model-Based Variant Filtering.网络基因查询工具(webGQT):用于基于模型的变异过滤的基因型查询工具的闪亮服务器。
Front Genet. 2020 Mar 3;11:152. doi: 10.3389/fgene.2020.00152. eCollection 2020.
2
VCF-Server: A web-based visualization tool for high-throughput variant data mining and management.VCF-Server:一个基于网络的高通量变异数据挖掘和管理的可视化工具。
Mol Genet Genomic Med. 2019 Jul;7(7):e00641. doi: 10.1002/mgg3.641. Epub 2019 May 24.
3
VCF-Explorer: filtering and analysing whole genome VCF files.VCF-Explorer:过滤和分析全基因组 VCF 文件。
Bioinformatics. 2017 Nov 1;33(21):3468-3470. doi: 10.1093/bioinformatics/btx422.
4
Variant graph craft (VGC): a comprehensive tool for analyzing genetic variation and identifying disease-causing variants.变体图工艺(VGC):一种全面的分析遗传变异和识别致病变异的工具。
BMC Bioinformatics. 2024 Sep 3;25(1):288. doi: 10.1186/s12859-024-05875-7.
5
VCFshiny: an R/Shiny application for interactively analyzing and visualizing genetic variants.VCFshiny:一款用于交互式分析和可视化基因变异的R/Shiny应用程序。
Bioinform Adv. 2023 Aug 26;3(1):vbad107. doi: 10.1093/bioadv/vbad107. eCollection 2023.
6
VCF-Miner: GUI-based application for mining variants and annotations stored in VCF files.VCF-Miner:用于挖掘存储在VCF文件中的变异和注释的基于图形用户界面的应用程序。
Brief Bioinform. 2016 Mar;17(2):346-51. doi: 10.1093/bib/bbv051. Epub 2015 Jul 25.
7
Huvariome: a web server resource of whole genome next-generation sequencing allelic frequencies to aid in pathological candidate gene selection.Huvariome:一个用于辅助病理候选基因选择的全基因组下一代测序等位基因频率的网络服务器资源。
J Clin Bioinforma. 2012 Nov 19;2(1):19. doi: 10.1186/2043-9113-2-19.
8
Efficient genotype compression and analysis of large genetic-variation data sets.大型基因变异数据集的高效基因型压缩与分析
Nat Methods. 2016 Jan;13(1):63-5. doi: 10.1038/nmeth.3654. Epub 2015 Nov 9.
9
123VCF: an intuitive and efficient tool for filtering VCF files.123VCF:一个直观且高效的 VCF 文件过滤工具。
BMC Bioinformatics. 2024 Feb 14;25(1):68. doi: 10.1186/s12859-024-05661-5.
10
PSReliP: an integrated pipeline for analysis and visualization of population structure and relatedness based on genome-wide genetic variant data.PSReliP:一个基于全基因组遗传变异数据的分析和可视化群体结构及亲缘关系的集成分析工具。
BMC Bioinformatics. 2023 Apr 5;24(1):135. doi: 10.1186/s12859-023-05169-4.

引用本文的文献

1
IP3 receptor depletion in a spontaneous canine model of Charcot-Marie-Tooth disease 1J with amelogenesis imperfecta.伴有牙釉质发育不全的夏科-马里-图斯病1J型自发性犬模型中的肌醇三磷酸受体缺失
PLoS Genet. 2025 Jan 13;21(1):e1011328. doi: 10.1371/journal.pgen.1011328. eCollection 2025 Jan.
2
Naturally occurring canine laminopathy leading to a dilated and fibrosing cardiomyopathy in the Nova Scotia Duck Tolling Retriever.一种导致新斯科舍诱鸭寻回猎犬扩张型和纤维性心肌病的天然发生的犬层状蛋白病。
Sci Rep. 2023 Nov 4;13(1):19077. doi: 10.1038/s41598-023-46601-2.
3
A loss-of-function variant in canine GLRA1 associates with a neurological disorder resembling human hyperekplexia.

本文引用的文献

1
A novel KRT71 variant in curly-coated dogs.卷毛犬中一种新的角蛋白71变体。
Anim Genet. 2019 Feb;50(1):101-104. doi: 10.1111/age.12746. Epub 2018 Nov 19.
2
The UK's 100,000 Genomes Project: manifesting policymakers' expectations.英国的“十万基因组计划”:兑现政策制定者的期望。
New Genet Soc. 2017 Sep 6;36(4):336-353. doi: 10.1080/14636778.2017.1370671. eCollection 2017.
3
VCF-Explorer: filtering and analysing whole genome VCF files.VCF-Explorer:过滤和分析全基因组 VCF 文件。
犬 GLRA1 中的功能丧失变异与一种类似于人类肌阵挛性癫痫的神经紊乱有关。
Hum Genet. 2023 Aug;142(8):1221-1230. doi: 10.1007/s00439-023-02571-z. Epub 2023 May 24.
4
Missense variant in LOXHD1 is associated with canine nonsyndromic hearing loss.LOXHD1 中的错义变异与犬非综合征性听力损失有关。
Hum Genet. 2021 Nov;140(11):1611-1618. doi: 10.1007/s00439-021-02286-z. Epub 2021 May 13.
5
In-frame deletion in canine PITRM1 is associated with a severe early-onset epilepsy, mitochondrial dysfunction and neurodegeneration.犬 PITRM1 框内缺失与严重早发性癫痫、线粒体功能障碍和神经退行性变有关。
Hum Genet. 2021 Nov;140(11):1593-1609. doi: 10.1007/s00439-021-02279-y. Epub 2021 Apr 9.
6
A missense variant in IFT122 associated with a canine model of retinitis pigmentosa.IFT122 中的错义变异与视网膜色素变性的犬模型相关。
Hum Genet. 2021 Nov;140(11):1569-1579. doi: 10.1007/s00439-021-02266-3. Epub 2021 Feb 19.
7
Intronic variant in POU1F1 associated with canine pituitary dwarfism.POU1F1 内含子变异与犬垂体性侏儒症相关。
Hum Genet. 2021 Nov;140(11):1553-1562. doi: 10.1007/s00439-021-02259-2. Epub 2021 Feb 6.
8
GCViT: a method for interactive, genome-wide visualization of resequencing and SNP array data.GCViT:一种用于交互式、全基因组重测序和 SNP 数组数据可视化的方法。
BMC Genomics. 2020 Nov 23;21(1):822. doi: 10.1186/s12864-020-07217-2.
9
A novel genomic region on chromosome 11 associated with fearfulness in dogs.与犬类恐惧相关的 11 号染色体上新的基因组区域。
Transl Psychiatry. 2020 May 28;10(1):169. doi: 10.1038/s41398-020-0849-z.
10
Recessive missense LAMP3 variant associated with defect in lamellar body biogenesis and fatal neonatal interstitial lung disease in dogs.隐性错义 LAMP3 变异与犬层状小体生物发生缺陷和致命性新生儿间质性肺病有关。
PLoS Genet. 2020 Mar 9;16(3):e1008651. doi: 10.1371/journal.pgen.1008651. eCollection 2020 Mar.
Bioinformatics. 2017 Nov 1;33(21):3468-3470. doi: 10.1093/bioinformatics/btx422.
4
Analysis of protein-coding genetic variation in 60,706 humans.对60706名人类的蛋白质编码基因变异进行分析。
Nature. 2016 Aug 18;536(7616):285-91. doi: 10.1038/nature19057.
5
vcfr: a package to manipulate and visualize variant call format data in R.vcfr:一个用于在R中处理和可视化变异调用格式数据的软件包。
Mol Ecol Resour. 2017 Jan;17(1):44-53. doi: 10.1111/1755-0998.12549. Epub 2016 Jul 12.
6
Efficient genotype compression and analysis of large genetic-variation data sets.大型基因变异数据集的高效基因型压缩与分析
Nat Methods. 2016 Jan;13(1):63-5. doi: 10.1038/nmeth.3654. Epub 2015 Nov 9.
7
A global reference for human genetic variation.人类遗传变异的全球参考。
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.
8
VCF-Miner: GUI-based application for mining variants and annotations stored in VCF files.VCF-Miner:用于挖掘存储在VCF文件中的变异和注释的基于图形用户界面的应用程序。
Brief Bioinform. 2016 Mar;17(2):346-51. doi: 10.1093/bib/bbv051. Epub 2015 Jul 25.
9
Big Data: Astronomical or Genomical?大数据:天文学的还是基因组学的?
PLoS Biol. 2015 Jul 7;13(7):e1002195. doi: 10.1371/journal.pbio.1002195. eCollection 2015 Jul.
10
CanvasDB: a local database infrastructure for analysis of targeted- and whole genome re-sequencing projects.CanvasDB:一种用于分析靶向和全基因组重测序项目的本地数据库基础设施。
Database (Oxford). 2014 Oct 3;2014. doi: 10.1093/database/bau098. Print 2014.