• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因组变异基准:如果无法衡量,就无法改进。

Genomic variant benchmark: if you cannot measure it, you cannot improve it.

机构信息

Department of Computational Biology, University of Lausanne, 1015, Lausanne, Switzerland.

SIB Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland.

出版信息

Genome Biol. 2023 Oct 5;24(1):221. doi: 10.1186/s13059-023-03061-1.

DOI:10.1186/s13059-023-03061-1
PMID:37798733
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10552390/
Abstract

Genomic benchmark datasets are essential to driving the field of genomics and bioinformatics. They provide a snapshot of the performances of sequencing technologies and analytical methods and highlight future challenges. However, they depend on sequencing technology, reference genome, and available benchmarking methods. Thus, creating a genomic benchmark dataset is laborious and highly challenging, often involving multiple sequencing technologies, different variant calling tools, and laborious manual curation. In this review, we discuss the available benchmark datasets and their utility. Additionally, we focus on the most recent benchmark of genes with medical relevance and challenging genomic complexity.

摘要

基因组基准数据集对于推动基因组学和生物信息学领域至关重要。它们提供了测序技术和分析方法性能的快照,并突出了未来的挑战。然而,它们依赖于测序技术、参考基因组和可用的基准测试方法。因此,创建基因组基准数据集是一项费力且极具挑战性的工作,通常涉及多种测序技术、不同的变异调用工具以及繁琐的人工整理。在这篇综述中,我们讨论了可用的基准数据集及其用途。此外,我们还重点介绍了最近具有医学相关性和挑战性基因组复杂性的基因基准数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2d6/10552390/87e6220c9ada/13059_2023_3061_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2d6/10552390/633812121d2d/13059_2023_3061_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2d6/10552390/bbd32bdd93c8/13059_2023_3061_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2d6/10552390/673ffd7f5bdd/13059_2023_3061_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2d6/10552390/87e6220c9ada/13059_2023_3061_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2d6/10552390/633812121d2d/13059_2023_3061_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2d6/10552390/bbd32bdd93c8/13059_2023_3061_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2d6/10552390/673ffd7f5bdd/13059_2023_3061_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2d6/10552390/87e6220c9ada/13059_2023_3061_Fig4_HTML.jpg

相似文献

1
Genomic variant benchmark: if you cannot measure it, you cannot improve it.基因组变异基准:如果无法衡量,就无法改进。
Genome Biol. 2023 Oct 5;24(1):221. doi: 10.1186/s13059-023-03061-1.
2
SMaSH: a benchmarking toolkit for human genome variant calling.SMaSH:一种用于人类基因组变异检测的基准测试工具包。
Bioinformatics. 2014 Oct;30(19):2787-95. doi: 10.1093/bioinformatics/btu345. Epub 2014 Jun 3.
3
An open resource for accurately benchmarking small variant and reference calls.用于准确基准测试小型变体和参考调用的开放资源。
Nat Biotechnol. 2019 May;37(5):561-566. doi: 10.1038/s41587-019-0074-6. Epub 2019 Apr 1.
4
Variant calling and benchmarking in an era of complete human genome sequences.全基因组序列时代的变异调用和基准测试。
Nat Rev Genet. 2023 Jul;24(7):464-483. doi: 10.1038/s41576-023-00590-0. Epub 2023 Apr 14.
5
NCBench: providing an open, reproducible, transparent, adaptable, and continuous benchmark approach for DNA-sequencing-based variant calling.NCBench:提供一种开放、可重复、透明、可适应和持续的基于 DNA 测序的变异调用基准方法。
F1000Res. 2024 Sep 12;12:1125. doi: 10.12688/f1000research.140344.1. eCollection 2023.
6
Leveraging long read sequencing from a single individual to provide a comprehensive resource for benchmarking variant calling methods.利用来自单个个体的长读长测序技术,为变异检测方法的基准测试提供全面资源。
Sci Rep. 2015 Sep 28;5:14493. doi: 10.1038/srep14493.
7
Benchmarking multi-platform sequencing technologies for human genome assembly.多平台测序技术在人类基因组组装中的基准测试。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad300.
8
Integrated approach to generate artificial samples with low tumor fraction for somatic variant calling benchmarking.综合方法生成低肿瘤分数的人工样本用于体细胞变异calling 基准测试。
BMC Bioinformatics. 2024 May 8;25(1):180. doi: 10.1186/s12859-024-05793-8.
9
Extend the benchmarking indel set by manual review using the individual cell line sequencing data from the Sequencing Quality Control 2 (SEQC2) project.使用来自测序质量控制 2(SEQC2)项目的个体细胞系测序数据,通过手动审查扩展基准插入缺失集。
Sci Rep. 2024 Mar 25;14(1):7028. doi: 10.1038/s41598-024-57439-7.
10
Medical implications of technical accuracy in genome sequencing.基因组测序技术准确性的医学意义。
Genome Med. 2016 Mar 2;8(1):24. doi: 10.1186/s13073-016-0269-0.

引用本文的文献

1
Performance comparison of germline variant calling tools in sporadic disease cohorts.散发性疾病队列中种系变异检测工具的性能比较
Mol Genet Genomics. 2025 Sep 6;300(1):90. doi: 10.1007/s00438-025-02292-0.
2
The Platinum Pedigree: a long-read benchmark for genetic variants.铂金谱系:遗传变异的长读长基准
Nat Methods. 2025 Aug;22(8):1669-1676. doi: 10.1038/s41592-025-02750-y. Epub 2025 Aug 4.
3
Interlaboratory evaluation of high molecular weight DNA extraction methods for long-read sequencing and structural variant analysis.

本文引用的文献

1
Multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes.泛基因组的多尺度分析能够改善对重复和临床相关基因的基因组多样性的表示。
Nat Methods. 2023 Aug;20(8):1213-1221. doi: 10.1038/s41592-023-01914-y. Epub 2023 Jun 26.
2
A draft human pangenome reference.人类泛基因组参考草图。
Nature. 2023 May;617(7960):312-324. doi: 10.1038/s41586-023-05896-x. Epub 2023 May 10.
3
Recombination between heterologous human acrocentric chromosomes.异源人类近端着丝粒染色体之间的重组。
用于长读长测序和结构变异分析的高分子量DNA提取方法的实验室间评估。
BMC Genomics. 2025 Jul 28;26(1):698. doi: 10.1186/s12864-025-11792-7.
4
Accurate, Scalable Structural Variant Genotyping in Complex Genomes at Population Scales.群体规模下复杂基因组中准确、可扩展的结构变异基因分型
Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf180.
5
A Hitchhiker's Guide to long-read genomic analysis.长读长基因组分析指南
Genome Res. 2025 Apr 14;35(4):545-558. doi: 10.1101/gr.279975.124.
6
Analytical validation of germline small variant detection using long-read HiFi genome sequencing.使用长读长HiFi基因组测序进行种系小变异检测的分析验证
Genome Res. 2025 Jun 2;35(6):1391-1399. doi: 10.1101/gr.278836.123.
7
Closing the gaps, and improving somatic structural variant analysis and benchmarking using CHM13-T2T.缩小差距,并利用CHM13-T2T改进体细胞结构变异分析和基准测试。
Genome Res. 2025 Apr 14;35(4):621-631. doi: 10.1101/gr.279352.124.
8
GREGoR: Accelerating Genomics for Rare Diseases.GREGoR:加速罕见病基因组学研究
ArXiv. 2024 Dec 18:arXiv:2412.14338v1.
9
Comprehensive genome analysis and variant detection at scale using DRAGEN.使用DRAGEN进行大规模的全基因组分析和变异检测。
Nat Biotechnol. 2024 Oct 25. doi: 10.1038/s41587-024-02382-1.
10
The GIAB genomic stratifications resource for human reference genomes.GIAB 基因组分层资源用于人类参考基因组。
Nat Commun. 2024 Oct 19;15(1):9029. doi: 10.1038/s41467-024-53260-y.
Nature. 2023 May;617(7960):335-343. doi: 10.1038/s41586-023-05976-y. Epub 2023 May 10.
4
Increased mutation and gene conversion within human segmental duplications.人类片段重复序列中突变和基因转换的增加。
Nature. 2023 May;617(7960):325-334. doi: 10.1038/s41586-023-05895-y. Epub 2023 May 10.
5
FixItFelix: improving genomic analysis by fixing reference errors.FixItFelix:通过修复参考错误来改进基因组分析。
Genome Biol. 2023 Feb 21;24(1):31. doi: 10.1186/s13059-023-02863-7.
6
A multi-platform reference for somatic structural variation detection.用于体细胞结构变异检测的多平台参考资料。
Cell Genom. 2022 Jun 8;2(6):100139. doi: 10.1016/j.xgen.2022.100139.
7
Truvari: refined structural variant comparison preserves allelic diversity.特鲁瓦里:精细化结构变异比较保留等位基因多样性。
Genome Biol. 2022 Dec 27;23(1):271. doi: 10.1186/s13059-022-02840-6.
8
Benchmarking challenging small variants with linked and long reads.使用连锁读段和长读段对具有挑战性的小变异进行基准测试。
Cell Genom. 2022 May;2(5). doi: 10.1016/j.xgen.2022.100128.
9
Library adaptors with integrated reference controls improve the accuracy and reliability of nanopore sequencing.带有集成参考控制的文库接头可提高纳米孔测序的准确性和可靠性。
Nat Commun. 2022 Oct 28;13(1):6437. doi: 10.1038/s41467-022-34028-8.
10
An evaluation of RNA-seq differential analysis methods.RNA-seq 差异分析方法评估。
PLoS One. 2022 Sep 16;17(9):e0264246. doi: 10.1371/journal.pone.0264246. eCollection 2022.