• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

rGREAT:一个用于基因组区域功能富集的 R/bioconductor 包。

rGREAT: an R/bioconductor package for functional enrichment on genomic regions.

机构信息

Molecular Precision Oncology Program, National Center for Tumor Diseases (NCT), Heidelberg 69120, Germany.

Heidelberg Institute of Stem Cell Technology and Experimental Medicine (HI-STEM), Heidelberg 69120, Germany.

出版信息

Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac745.

DOI:10.1093/bioinformatics/btac745
PMID:36394265
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9805586/
Abstract

SUMMARY

GREAT (Genomic Regions Enrichment of Annotations Tool) is a widely used tool for functional enrichment on genomic regions. However, as an online tool, it has limitations of outdated annotation data, small numbers of supported organisms and gene set collections, and not being extensible for users. Here, we developed a new R/Bioconductorpackage named rGREAT which implements the GREAT algorithm locally. rGREAT by default supports more than 600 organisms and a large number of gene set collections, as well as self-provided gene sets and organisms from users. Additionally, it implements a general method for dealing with background regions.

AVAILABILITY AND IMPLEMENTATION

The package rGREAT is freely available from the Bioconductor project: https://bioconductor.org/packages/rGREAT/. The development version is available at https://github.com/jokergoo/rGREAT. Gene Ontology gene sets for more than 600 organisms retrieved from Ensembl BioMart are presented in an R package BioMartGOGeneSets which is available at https://github.com/jokergoo/BioMartGOGeneSets.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

摘要

GREAT(基因组区域注释富集工具)是一个广泛应用于基因组区域功能富集的工具。然而,作为一个在线工具,它存在注释数据过时、支持的生物种类和基因集数量有限以及不可扩展等问题。在这里,我们开发了一个名为 rGREAT 的新的 R/Bioconductor 包,它在本地实现了 GREAT 算法。rGREAT 默认支持超过 600 种生物和大量的基因集集,以及用户提供的自定义基因集和生物。此外,它还实现了一种处理背景区域的通用方法。

可用性和实现

rGREAT 包可从 Bioconductor 项目免费获得:https://bioconductor.org/packages/rGREAT/。开发版本可在 https://github.com/jokergoo/rGREAT 获得。从 Ensembl BioMart 检索到的超过 600 种生物的基因本体基因集以 R 包 BioMartGOGeneSets 的形式呈现,该 R 包可在 https://github.com/jokergoo/BioMartGOGeneSets 获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf1e/9805586/59bc68566354/btac745f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf1e/9805586/59bc68566354/btac745f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf1e/9805586/59bc68566354/btac745f1.jpg

相似文献

1
rGREAT: an R/bioconductor package for functional enrichment on genomic regions.rGREAT:一个用于基因组区域功能富集的 R/bioconductor 包。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac745.
2
PanViz: interactive visualization of the structure of functionally annotated pangenomes.PanViz:功能注释泛基因组结构的交互式可视化
Bioinformatics. 2017 Apr 1;33(7):1081-1082. doi: 10.1093/bioinformatics/btw761.
3
Using meshes for MeSH term enrichment and semantic analyses.使用网格进行 MeSH 术语富集和语义分析。
Bioinformatics. 2018 Nov 1;34(21):3766-3767. doi: 10.1093/bioinformatics/bty410.
4
GenVisR: Genomic Visualizations in R.GenVisR:R语言中的基因组可视化
Bioinformatics. 2016 Oct 1;32(19):3012-4. doi: 10.1093/bioinformatics/btw325. Epub 2016 Jun 10.
5
monaLisa: an R/Bioconductor package for identifying regulatory motifs.monaLisa:用于识别调控基序的 R/Bioconductor 包。
Bioinformatics. 2022 Apr 28;38(9):2624-2625. doi: 10.1093/bioinformatics/btac102.
6
microbiomeMarker: an R/Bioconductor package for microbiome marker identification and visualization.microbiomeMarker:一个用于微生物组标记物识别和可视化的 R/Bioconductor 包。
Bioinformatics. 2022 Aug 10;38(16):4027-4029. doi: 10.1093/bioinformatics/btac438.
7
ensembldb: an R package to create and use Ensembl-based annotation resources.ensembldb:一个用于创建和使用基于Ensembl注释资源的R包。
Bioinformatics. 2019 Sep 1;35(17):3151-3153. doi: 10.1093/bioinformatics/btz031.
8
DNAshapeR: an R/Bioconductor package for DNA shape prediction and feature encoding.DNAshapeR:一个用于DNA形状预测和特征编码的R/Bioconductor软件包。
Bioinformatics. 2016 Apr 15;32(8):1211-3. doi: 10.1093/bioinformatics/btv735. Epub 2015 Dec 14.
9
Synaptome.db: a bioconductor package for synaptic proteomics data.Synaptome.db:一个用于突触蛋白质组学数据的生物导体包。
Bioinform Adv. 2022 Nov 12;2(1):vbac086. doi: 10.1093/bioadv/vbac086. eCollection 2022.
10
Spathial: an R package for the evolutionary analysis of biological data.Spathial:用于生物数据进化分析的 R 包。
Bioinformatics. 2020 Nov 1;36(17):4664-4667. doi: 10.1093/bioinformatics/btaa273.

引用本文的文献

1
Complex genetic architecture underlies human hand and foot evolution.复杂的遗传结构是人类手足进化的基础。
Res Sq. 2025 Aug 18:rs.3.rs-7124496. doi: 10.21203/rs.3.rs-7124496/v1.
2
Mammalian conservation of endogenous G-quadruplex reveals their associations with complex traits.哺乳动物内源性G-四链体的保守性揭示了它们与复杂性状的关联。
Genome Biol. 2025 Sep 1;26(1):262. doi: 10.1186/s13059-025-03750-z.
3
A stem cell differentiation model reveals two alternative fates in CBFA2T3::GLIS2-driven acute megakaryoblastic leukemia initiation.

本文引用的文献

1
GENCODE 2021.GENCODE 2021.
Nucleic Acids Res. 2021 Jan 8;49(D1):D916-D923. doi: 10.1093/nar/gkaa1087.
2
Mind the gaps: overlooking inaccessible regions confounds statistical testing in genome analysis.注意空白:在基因组分析中忽略难以获取的区域会使统计检验产生偏差。
BMC Bioinformatics. 2018 Dec 14;19(1):481. doi: 10.1186/s12859-018-2438-1.
3
LOLA: enrichment analysis for genomic region sets and regulatory elements in R and Bioconductor.LOLA:R和Bioconductor中基因组区域集和调控元件的富集分析。
一种干细胞分化模型揭示了在CBFA2T3::GLIS2驱动的急性巨核细胞白血病起始过程中的两种不同命运。
Commun Biol. 2025 Aug 27;8(1):1289. doi: 10.1038/s42003-025-08730-4.
4
NanoTag is an improved method to map interactions between DNA and proteins not requiring IgG.纳米标签是一种改进的方法,用于绘制DNA与蛋白质之间的相互作用,无需免疫球蛋白。
Sci Rep. 2025 Aug 6;15(1):28672. doi: 10.1038/s41598-025-13316-5.
5
IT-scC&T-seq streamlines scalable, parallel profiling of protein-DNA interactions in single cells.IT-scC&T-seq简化了单细胞中蛋白质与DNA相互作用的可扩展并行分析。
Genome Biol. 2025 Jul 7;26(1):196. doi: 10.1186/s13059-025-03661-z.
6
Single-cell long-read Hi-C, scNanoHi-C2, details 3D genome reorganization in embryonic-stage germ cells.单细胞长读长Hi-C技术,即scNanoHi-C2,揭示了胚胎期生殖细胞中的三维基因组重组细节。
Nat Struct Mol Biol. 2025 Jul 4. doi: 10.1038/s41594-025-01604-7.
7
Decoding DNA sequence-driven evolution of the human brain epigenome at cellular resolution.在细胞分辨率下解码人类大脑表观基因组的DNA序列驱动进化。
Nat Commun. 2025 Jul 1;16(1):5625. doi: 10.1038/s41467-025-60665-w.
8
Macrophages with different origins proliferate and do not lose their core intrinsic features.不同来源的巨噬细胞会增殖,且不会丧失其核心固有特征。
iScience. 2025 May 12;28(6):112635. doi: 10.1016/j.isci.2025.112635. eCollection 2025 Jun 20.
9
Accumulation of Epigenetic Noise in the Aging Corneal Epithelium and Its Possible Mechanism.衰老角膜上皮中表观遗传噪声的积累及其可能机制。
FASEB J. 2025 Jun 15;39(11):e70699. doi: 10.1096/fj.202500954R.
10
Discovery of candidate functional non-coding mutations in acute myeloid leukemia using single-cell chromatin accessibility sequencing.利用单细胞染色质可及性测序发现急性髓系白血病中的候选功能性非编码突变
Commun Biol. 2025 May 26;8(1):808. doi: 10.1038/s42003-025-08257-8.
Bioinformatics. 2016 Feb 15;32(4):587-9. doi: 10.1093/bioinformatics/btv612. Epub 2015 Oct 27.
4
Ensembl BioMarts: a hub for data retrieval across taxonomic space.Ensembl BioMarts:跨分类空间检索数据的中心枢纽。
Database (Oxford). 2011 Jul 23;2011:bar030. doi: 10.1093/database/bar030. Print 2011.
5
Molecular signatures database (MSigDB) 3.0.分子特征数据库(MSigDB)3.0。
Bioinformatics. 2011 Jun 15;27(12):1739-40. doi: 10.1093/bioinformatics/btr260. Epub 2011 May 5.
6
GREAT improves functional interpretation of cis-regulatory regions.GREAT 提高了顺式调控区域的功能解释。
Nat Biotechnol. 2010 May;28(5):495-501. doi: 10.1038/nbt.1630. Epub 2010 May 2.
7
BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis.生物数据库与芯片分析的强大纽带——生物集市与生物导体
Bioinformatics. 2005 Aug 15;21(16):3439-40. doi: 10.1093/bioinformatics/bti525.
8
Ontological analysis of gene expression data: current tools, limitations, and open problems.基因表达数据的本体分析:当前工具、局限性及开放问题
Bioinformatics. 2005 Sep 15;21(18):3587-95. doi: 10.1093/bioinformatics/bti565. Epub 2005 Jun 30.