Suppr超能文献

GLANET:基因组位点注释和富集工具。

GLANET: genomic loci annotation and enrichment tool.

机构信息

Department of Computer Engineering, Middle East Technical University, 06800, Ankara, Turkey.

Department of Computer Engineering, Bilkent University, 06800, Ankara, Turkey.

出版信息

Bioinformatics. 2017 Sep 15;33(18):2818-2828. doi: 10.1093/bioinformatics/btx326.

Abstract

MOTIVATION

Genomic studies identify genomic loci representing genetic variations, transcription factor (TF) occupancy, or histone modification through next generation sequencing (NGS) technologies. Interpreting these loci requires evaluating them with known genomic and epigenomic annotations.

RESULTS

We present GLANET as a comprehensive annotation and enrichment analysis tool which implements a sampling-based enrichment test that accounts for GC content and/or mappability biases, jointly or separately. GLANET annotates and performs enrichment analysis on these loci with a rich library. We introduce and perform novel data-driven computational experiments for assessing the power and Type-I error of its enrichment procedure which show that GLANET has attained high statistical power and well-controlled Type-I error rate. As a key feature, users can easily extend its library with new gene sets and genomic intervals. Other key features include assessment of impact of single nucleotide variants (SNPs) on TF binding sites and regulation based pathway enrichment analysis.

AVAILABILITY AND IMPLEMENTATION

GLANET can be run using its GUI or on command line. GLANET's source code is available at https://github.com/burcakotlu/GLANET . Tutorials are provided at https://glanet.readthedocs.org .

CONTACT

burcak@ceng.metu.edu.tr or oznur.tastan@cs.bilkent.edu.tr.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

基因组研究通过下一代测序 (NGS) 技术识别代表遗传变异、转录因子 (TF) 占据或组蛋白修饰的基因组位置。解释这些位置需要使用已知的基因组和表观基因组注释来评估它们。

结果

我们提出了 GLANET,它是一种综合注释和富集分析工具,实现了基于抽样的富集测试,可分别或联合考虑 GC 含量和/或可映射性偏差。GLANET 使用丰富的库对这些位置进行注释和富集分析。我们引入并进行了新的基于数据的计算实验,以评估其富集过程的功效和 I 型错误率,结果表明 GLANET 具有较高的统计功效和良好控制的 I 型错误率。作为一个关键功能,用户可以轻松地用新的基因集和基因组间隔来扩展其库。其他关键功能包括评估单核苷酸变异 (SNP) 对 TF 结合位点的影响和基于通路的调控富集分析。

可用性和实现

GLANET 可以使用其 GUI 或命令行运行。GLANET 的源代码可在 https://github.com/burcakotlu/GLANET 上获得。教程可在 https://glanet.readthedocs.org 上找到。

联系人

burcak@ceng.metu.edu.troznur.tastan@cs.bilkent.edu.tr

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
GLANET: genomic loci annotation and enrichment tool.
Bioinformatics. 2017 Sep 15;33(18):2818-2828. doi: 10.1093/bioinformatics/btx326.
2
annotatr: genomic regions in context.
Bioinformatics. 2017 Aug 1;33(15):2381-2383. doi: 10.1093/bioinformatics/btx183.
3
Functional annotation of genomic variants in studies of late-onset Alzheimer's disease.
Bioinformatics. 2018 Aug 15;34(16):2724-2731. doi: 10.1093/bioinformatics/bty177.
4
BinDNase: a discriminatory approach for transcription factor binding prediction using DNase I hypersensitivity data.
Bioinformatics. 2015 Sep 1;31(17):2852-9. doi: 10.1093/bioinformatics/btv294. Epub 2015 May 7.
5
ePIANNO: ePIgenomics ANNOtation tool.
PLoS One. 2016 Feb 9;11(2):e0148321. doi: 10.1371/journal.pone.0148321. eCollection 2016.
6
AGORA: organellar genome annotation from the amino acid and nucleotide references.
Bioinformatics. 2018 Aug 1;34(15):2661-2663. doi: 10.1093/bioinformatics/bty196.
7
fcScan: a versatile tool to cluster combinations of sites using genomic coordinates.
BMC Bioinformatics. 2020 May 19;21(1):194. doi: 10.1186/s12859-020-3536-4.
8
NGSphy: phylogenomic simulation of next-generation sequencing data.
Bioinformatics. 2018 Jul 15;34(14):2506-2507. doi: 10.1093/bioinformatics/bty146.
9
In-depth annotation of SNPs arising from resequencing projects using NGS-SNP.
Bioinformatics. 2011 Aug 15;27(16):2300-1. doi: 10.1093/bioinformatics/btr372. Epub 2011 Jun 22.
10
Discovery and genotyping of novel sequence insertions in many sequenced individuals.
Bioinformatics. 2017 Jul 15;33(14):i161-i169. doi: 10.1093/bioinformatics/btx254.

引用本文的文献

1
Demystifying non-coding GWAS variants: an overview of computational tools and methods.
Hum Mol Genet. 2022 Oct 20;31(R1):R73-R83. doi: 10.1093/hmg/ddac198.
3
NoRCE: non-coding RNA sets cis enrichment tool.
BMC Bioinformatics. 2021 Jun 2;22(1):294. doi: 10.1186/s12859-021-04112-9.
4
Analytical Approaches for ATAC-seq Data Analysis.
Curr Protoc Hum Genet. 2020 Jun;106(1):e101. doi: 10.1002/cphg.101.
5
Non-canonical RNA-DNA differences and other human genomic features are enriched within very short tandem repeats.
PLoS Comput Biol. 2020 Jun 8;16(6):e1007968. doi: 10.1371/journal.pcbi.1007968. eCollection 2020 Jun.
6
AnnoGen: annotating genome-wide pragmatic features.
Bioinformatics. 2020 May 1;36(9):2899-2901. doi: 10.1093/bioinformatics/btaa027.
7
JOA: Joint Overlap Analysis of multiple genomic interval sets.
BMC Bioinformatics. 2019 Mar 8;20(1):121. doi: 10.1186/s12859-019-2698-4.
8
Coloc-stats: a unified web interface to perform colocalization analysis of genomic features.
Nucleic Acids Res. 2018 Jul 2;46(W1):W186-W193. doi: 10.1093/nar/gky474.

本文引用的文献

1
Integromic analysis of genetic variation and gene expression identifies networks for cardiovascular disease phenotypes.
Circulation. 2015 Feb 10;131(6):536-49. doi: 10.1161/CIRCULATIONAHA.114.010696. Epub 2014 Dec 22.
2
A role for H3K4 monomethylation in gene repression and partitioning of chromatin readers.
Mol Cell. 2014 Mar 20;53(6):979-92. doi: 10.1016/j.molcel.2014.02.032.
3
PANOGA: a web server for identification of SNP-targeted pathways from genome-wide association study data.
Bioinformatics. 2014 May 1;30(9):1287-9. doi: 10.1093/bioinformatics/btt743. Epub 2014 Jan 11.
4
Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments.
Nucleic Acids Res. 2014 Mar;42(5):2976-87. doi: 10.1093/nar/gkt1249. Epub 2013 Dec 13.
5
JASPAR 2014: an extensively expanded and updated open-access database of transcription factor binding profiles.
Nucleic Acids Res. 2014 Jan;42(Database issue):D142-7. doi: 10.1093/nar/gkt997. Epub 2013 Nov 4.
6
GAT: a simulation framework for testing the association of genomic intervals.
Bioinformatics. 2013 Aug 15;29(16):2046-8. doi: 10.1093/bioinformatics/btt343. Epub 2013 Jun 18.
7
Relating genes to function: identifying enriched transcription factors using the ENCODE ChIP-Seq significance tool.
Bioinformatics. 2013 Aug 1;29(15):1922-4. doi: 10.1093/bioinformatics/btt316. Epub 2013 Jun 3.
8
Effects of GC bias in next-generation-sequencing data on de novo genome assembly.
PLoS One. 2013 Apr 29;8(4):e62856. doi: 10.1371/journal.pone.0062856. Print 2013.
9
An integrated map of genetic variation from 1,092 human genomes.
Nature. 2012 Nov 1;491(7422):56-65. doi: 10.1038/nature11632.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验