• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

OGRE:计算、可视化和分析基因组输入区域与公共注释之间的重叠。

OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations.

机构信息

Centre of Reproductive Medicine and Andrology, University of Münster, Albert-Schweitzer-Campus 1 Building D11, 48149, Munster, Germany.

Institute of Medical Informatics, University of Münster, Albert-Schweitzer-Campus 1 Building A11, 48149, Munster, Germany.

出版信息

BMC Bioinformatics. 2023 Jul 26;24(1):300. doi: 10.1186/s12859-023-05422-w.

DOI:10.1186/s12859-023-05422-w
PMID:37496002
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10369718/
Abstract

BACKGROUND

Modern genome sequencing leads to an ever-growing collection of genomic annotations. Combining these elements with a set of input regions (e.g. genes) would yield new insights in genomic associations, such as those involved in gene regulation. The required data are scattered across different databases making a manual approach tiresome, unpractical, and prone to error. Semi-automatic approaches require programming skills in data parsing, processing, overlap calculation, and visualization, which most biomedical researchers lack. Our aim was to develop an automated tool providing all necessary algorithms, benefiting both bioinformaticians and researchers without bioinformatic training.

RESULTS

We developed overlapping annotated genomic regions (OGRE) as a comprehensive tool to associate and visualize input regions with genomic annotations. It does so by parsing regions of interest, mining publicly available annotations, and calculating possible overlaps between them. The user can thus identify location, type, and number of associated regulatory elements. Results are presented as easy to understand visualizations and result tables. We applied OGRE to recent studies and could show high reproducibility and potential new insights. To demonstrate OGRE's performance in terms of running time and output, we have conducted a benchmark and compared its features with similar tools.

CONCLUSIONS

OGRE's functions and built-in annotations can be applied as a downstream overlap association step, which is compatible with most genomic sequencing outputs, and can thus enrich pre-existing analyses pipelines. Compared to similar tools, OGRE shows competitive performance, offers additional features, and has been successfully applied to two recent studies. Overall, OGRE addresses the lack of tools for automatic analysis, local genomic overlap calculation, and visualization by providing an easy to use, end-to-end solution for both biologists and computational scientists.

摘要

背景

现代基因组测序产生了越来越多的基因组注释。将这些元素与一组输入区域(例如基因)相结合,将产生新的基因组关联见解,例如参与基因调控的那些见解。所需的数据分散在不同的数据库中,使得手动方法繁琐、不切实际且容易出错。半自动方法需要在数据解析、处理、重叠计算和可视化方面具有编程技能,而大多数生物医学研究人员都缺乏这些技能。我们的目标是开发一种自动化工具,提供所有必要的算法,使生物信息学家和没有生物信息学培训的研究人员都受益。

结果

我们开发了重叠注释基因组区域(OGRE),作为一种综合工具,用于关联和可视化输入区域与基因组注释。它通过解析感兴趣的区域、挖掘公开可用的注释并计算它们之间的可能重叠来实现这一点。用户可以识别相关的调节元件的位置、类型和数量。结果以易于理解的可视化和结果表呈现。我们将 OGRE 应用于最近的研究中,能够显示出高度的可重复性和潜在的新见解。为了展示 OGRE 在运行时间和输出方面的性能,我们进行了基准测试,并将其功能与类似工具进行了比较。

结论

OGRE 的功能和内置注释可以作为下游重叠关联步骤应用,与大多数基因组测序输出兼容,因此可以丰富现有的分析管道。与类似工具相比,OGRE 具有竞争力的性能,提供了额外的功能,并已成功应用于最近的两项研究。总的来说,OGRE 通过提供一种易于使用的端到端解决方案,解决了缺乏自动分析、本地基因组重叠计算和可视化工具的问题,为生物学家和计算科学家都提供了帮助。

相似文献

1
OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations.OGRE:计算、可视化和分析基因组输入区域与公共注释之间的重叠。
BMC Bioinformatics. 2023 Jul 26;24(1):300. doi: 10.1186/s12859-023-05422-w.
2
zDB: bacterial comparative genomics made easy.zDB:轻松进行细菌比较基因组学研究。
mSystems. 2024 Jul 23;9(7):e0047324. doi: 10.1128/msystems.00473-24. Epub 2024 Jun 28.
3
4
CGKB: an annotation knowledge base for cowpea (Vigna unguiculata L.) methylation filtered genomic genespace sequences.CGKB:豇豆(Vigna unguiculata L.)甲基化过滤基因组基因空间序列的注释知识库。
BMC Bioinformatics. 2007 Apr 19;8:129. doi: 10.1186/1471-2105-8-129.
5
Statistical Viewer: a tool to upload and integrate linkage and association data as plots displayed within the Ensembl genome browser.统计查看器:一种用于上传和整合连锁与关联数据并将其作为图谱显示在Ensembl基因组浏览器中的工具。
BMC Bioinformatics. 2005 Apr 12;6:95. doi: 10.1186/1471-2105-6-95.
6
Mapping genomic features of tiling microarray data by TileMapper.使用TileMapper映射平铺微阵列数据的基因组特征。
Methods Mol Biol. 2013;1067:225-33. doi: 10.1007/978-1-62703-607-8_14.
7
GenomeQC: a quality assessment tool for genome assemblies and gene structure annotations.基因组 QC:基因组组装和基因结构注释的质量评估工具。
BMC Genomics. 2020 Mar 2;21(1):193. doi: 10.1186/s12864-020-6568-2.
8
ChromoMap: an R package for interactive visualization of multi-omics data and annotation of chromosomes.ChromoMap:一个用于多组学数据交互式可视化和染色体注释的 R 包。
BMC Bioinformatics. 2022 Jan 11;23(1):33. doi: 10.1186/s12859-021-04556-z.
9
Structural and functional-annotation of an equine whole genome oligoarray.马全基因组寡核苷酸芯片的结构和功能注释。
BMC Bioinformatics. 2009 Oct 8;10 Suppl 11(Suppl 11):S8. doi: 10.1186/1471-2105-10-S11-S8.
10
MILANO--custom annotation of microarray results using automatic literature searches.米兰——使用自动文献检索对微阵列结果进行定制注释。
BMC Bioinformatics. 2005 Jan 20;6:12. doi: 10.1186/1471-2105-6-12.

本文引用的文献

1
Single-cell RNA-seq unravels alterations of the human spermatogonial stem cell compartment in patients with impaired spermatogenesis.单细胞 RNA 测序揭示了生精功能障碍患者精原干细胞区室的改变。
Cell Rep Med. 2021 Sep 9;2(9):100395. doi: 10.1016/j.xcrm.2021.100395. eCollection 2021 Sep 21.
2
GENCODE 2021.GENCODE 2021.
Nucleic Acids Res. 2021 Jan 8;49(D1):D916-D923. doi: 10.1093/nar/gkaa1087.
3
The UCSC Genome Browser database: 2021 update.UCSC 基因组浏览器数据库:2021 年更新。
Nucleic Acids Res. 2021 Jan 8;49(D1):D1046-D1057. doi: 10.1093/nar/gkaa1070.
4
Epigenetics in Health and Disease.《健康与疾病中的表观遗传学》
Adv Exp Med Biol. 2020;1253:3-55. doi: 10.1007/978-981-15-3449-2_1.
5
Determinants of transcription factor regulatory range.转录因子调控范围的决定因素。
Nat Commun. 2020 May 18;11(1):2472. doi: 10.1038/s41467-020-16106-x.
6
Long walk to genomics: History and current approaches to genome sequencing and assembly.通往基因组学的漫长之路:基因组测序与组装的历史及当前方法
Comput Struct Biotechnol J. 2019 Nov 17;18:9-19. doi: 10.1016/j.csbj.2019.11.002. eCollection 2020.
7
JASPAR 2020: update of the open-access database of transcription factor binding profiles.JASPAR 2020:转录因子结合谱开放获取数据库的更新。
Nucleic Acids Res. 2020 Jan 8;48(D1):D87-D92. doi: 10.1093/nar/gkz1001.
8
Ensembl 2020.Ensembl 2020.
Nucleic Acids Res. 2020 Jan 8;48(D1):D682-D688. doi: 10.1093/nar/gkz966.
9
Overlapping protein-coding genes in human genome and their coincidental expression in tissues.人类基因组中重叠的蛋白编码基因及其在组织中的偶然表达。
Sci Rep. 2019 Sep 16;9(1):13377. doi: 10.1038/s41598-019-49802-w.
10
CiiiDER: A tool for predicting and analysing transcription factor binding sites.CiiiDER:一个用于预测和分析转录因子结合位点的工具。
PLoS One. 2019 Sep 4;14(9):e0215495. doi: 10.1371/journal.pone.0215495. eCollection 2019.