• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于高效基因组区间查询的工具的全面基准测试。

A comprehensive benchmark of tools for efficient genomic interval querying.

作者信息

Schäfer Richard A, Yang Rendong

机构信息

Department of Urology, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, United States.

Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, United States.

出版信息

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf379.

DOI:10.1093/bib/bbaf379
PMID:40728859
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12306436/
Abstract

Efficiently querying genomic intervals is fundamental to modern bioinformatics, enabling researchers to extract and analyze specific regions from large genomic datasets. While various tools have been developed for this purpose, there lacks a comprehensive comparison of their performance, memory usage, and practical utility. We present a systematic evaluation of genomic interval query tools using simulated datasets of varying sizes. Our benchmarking framework, segmeter, assesses both basic and complex interval queries, examining runtime performance, memory efficiency, and query precision across different tools. This comprehensive analysis provides insights into the strengths and limitations of different approaches to genomic interval querying, offering guidance for tool selection based on specific use cases and data requirements. The segmeter framework and all benchmark data are freely available, facilitating reproducibility and enabling researchers to conduct their own comparative analyses.

摘要

高效查询基因组区间是现代生物信息学的基础,使研究人员能够从大型基因组数据集中提取和分析特定区域。虽然已经为此开发了各种工具,但缺乏对它们的性能、内存使用情况和实际效用的全面比较。我们使用不同大小的模拟数据集对基因组区间查询工具进行了系统评估。我们的基准测试框架segmeter评估基本和复杂的区间查询,检查不同工具的运行时性能、内存效率和查询精度。这种全面分析深入了解了基因组区间查询不同方法的优缺点,为根据特定用例和数据要求选择工具提供了指导。segmeter框架和所有基准数据均可免费获取,便于重复使用,并使研究人员能够进行自己的比较分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9257/12306436/4d05879a0130/bbaf379f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9257/12306436/e48aad0d3c6f/bbaf379f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9257/12306436/5146e288356a/bbaf379f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9257/12306436/c7ce547dcef1/bbaf379f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9257/12306436/393b7f6d3c76/bbaf379f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9257/12306436/16e7e598c784/bbaf379f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9257/12306436/4d05879a0130/bbaf379f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9257/12306436/e48aad0d3c6f/bbaf379f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9257/12306436/5146e288356a/bbaf379f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9257/12306436/c7ce547dcef1/bbaf379f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9257/12306436/393b7f6d3c76/bbaf379f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9257/12306436/16e7e598c784/bbaf379f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9257/12306436/4d05879a0130/bbaf379f6.jpg

相似文献

1
A comprehensive benchmark of tools for efficient genomic interval querying.用于高效基因组区间查询的工具的全面基准测试。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf379.
2
A Comprehensive Benchmark of Tools for Efficient Genomic Interval Querying.高效基因组区间查询工具的综合基准测试
bioRxiv. 2025 Feb 26:2025.02.19.639167. doi: 10.1101/2025.02.19.639167.
3
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.
4
Towards a standard benchmark for phenotype-driven variant and gene prioritisation algorithms: PhEval - Phenotypic inference Evaluation framework.迈向用于表型驱动的变异体和基因优先级排序算法的标准基准:PhEval - 表型推断评估框架。
BMC Bioinformatics. 2025 Mar 22;26(1):87. doi: 10.1186/s12859-025-06105-4.
5
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
6
Neuraminidase inhibitors for preventing and treating influenza in healthy adults and children.用于预防和治疗健康成人及儿童流感的神经氨酸酶抑制剂。
Cochrane Database Syst Rev. 2012 Jan 18;1:CD008965. doi: 10.1002/14651858.CD008965.pub3.
7
Influence of Sequencing Technology on Pangenome-level Analysis and Detection of Antimicrobial Resistance Genes in ESKAPE Pathogens.测序技术对泛基因组水平分析及ESKAPE病原体中抗菌药物耐药基因检测的影响
bioRxiv. 2025 Jan 10:2025.01.08.631980. doi: 10.1101/2025.01.08.631980.
8
Benchmarking transcription factor binding site prediction models: a comparative analysis on synthetic and biological data.基准测试转录因子结合位点预测模型:对合成数据和生物数据的比较分析
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf363.
9
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
10
Genomic investigations of successful invasions: the picture emerging from recent studies.成功入侵的基因组学研究:近期研究呈现的图景
Biol Rev Camb Philos Soc. 2025 Jun;100(3):1396-1418. doi: 10.1111/brv.70005. Epub 2025 Feb 16.

本文引用的文献

1
ScanNeo2: a comprehensive workflow for neoantigen detection and immunogenicity prediction from diverse genomic and transcriptomic alterations.ScanNeo2:一种从多种基因组和转录组改变中检测新抗原和预测免疫原性的综合工作流程。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad659.
2
Genetic associations of protein-coding variants in human disease.人类疾病相关蛋白编码变异的遗传关联。
Nature. 2022 Mar;603(7899):95-102. doi: 10.1038/s41586-022-04394-w. Epub 2022 Feb 23.
3
RNAnue: efficient data analysis for RNA-RNA interactomics.
RNAnue:用于RNA - RNA相互作用组学的高效数据分析
Nucleic Acids Res. 2021 Jun 4;49(10):5493-5501. doi: 10.1093/nar/gkab340.
4
Comparative analysis of commonly used peak calling programs for ChIP-Seq analysis.用于ChIP-Seq分析的常用峰检测程序的比较分析。
Genomics Inform. 2020 Dec;18(4):e42. doi: 10.5808/GI.2020.18.4.e42. Epub 2020 Dec 14.
5
IGD: high-performance search for large-scale genomic interval datasets.IGD:对大规模基因组区间数据集进行高性能搜索。
Bioinformatics. 2021 Apr 9;37(1):118-120. doi: 10.1093/bioinformatics/btaa1062.
6
Bedtk: finding interval overlap with implicit interval tree.贝德克:使用隐式区间树查找区间重叠。
Bioinformatics. 2021 Jun 9;37(9):1315-1316. doi: 10.1093/bioinformatics/btaa827.
7
GLASSgo in Galaxy: high-throughput, reproducible and easy-to-integrate prediction of sRNA homologs.GLASSgo 在 Galaxy 中的应用:高通量、可重复且易于整合的 sRNA 同源物预测。
Bioinformatics. 2020 Aug 1;36(15):4357-4359. doi: 10.1093/bioinformatics/btaa556.
8
Augmented Interval List: a novel data structure for efficient genomic interval search.增强型区间列表:一种用于高效基因组区间搜索的新型数据结构。
Bioinformatics. 2019 Dec 1;35(23):4907-4911. doi: 10.1093/bioinformatics/btz407.
9
GLASSgo - Automated and Reliable Detection of sRNA Homologs From a Single Input Sequence.GLASSgo——从单一输入序列中自动且可靠地检测小RNA同源物
Front Genet. 2018 Apr 17;9:124. doi: 10.3389/fgene.2018.00124. eCollection 2018.
10
GIGGLE: a search engine for large-scale integrated genome analysis.GIGGLE:一个用于大规模综合基因组分析的搜索引擎。
Nat Methods. 2018 Feb;15(2):123-126. doi: 10.1038/nmeth.4556. Epub 2018 Jan 8.