• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

来自X和Y染色体完整组装的小变异基准。

Small variant benchmark from a complete assembly of X and Y chromosomes.

作者信息

Wagner Justin, Olson Nathan D, McDaniel Jennifer, Harris Lindsay, Pinto Brendan J, Jáspez David, Muñoz-Barrera Adrián, Rubio-Rodríguez Luis A, Lorenzo-Salazar José M, Flores Carlos, Sahraeian Sayed Mohammad Ebrahim, Narzisi Giuseppe, Byrska-Bishop Marta, Evani Uday S, Xiao Chunlin, Lake Juniper A, Fontana Peter, Greenberg Craig, Freed Donald, Mootor Mohammed Faizal Eeman, Boutros Paul C, Murray Lisa, Shafin Kishwar, Carroll Andrew, Sedlazeck Fritz J, Wilson Melissa, Zook Justin M

机构信息

Material Measurement Laboratory, National Institute of Standards and Technology, 100 Bureau Dr., Gaithersburg, MD, USA.

Center for Evolution & Medicine and School of Life Sciences, Arizona State University, Tempe, AZ 85281 USA - Department of Zoology, Milwaukee Public Museum, Milwaukee, WI, USA.

出版信息

Nat Commun. 2025 Jan 8;16(1):497. doi: 10.1038/s41467-024-55710-z.

DOI:10.1038/s41467-024-55710-z
PMID:39779690
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11711550/
Abstract

The sex chromosomes contain complex, important genes impacting medical phenotypes, but differ from the autosomes in their ploidy and large repetitive regions. To enable technology developers along with research and clinical laboratories to evaluate variant detection on male sex chromosomes X and Y, we create a small variant benchmark set with 111,725 variants for the Genome in a Bottle HG002 reference material. We develop an active evaluation approach to demonstrate the benchmark set reliably identifies errors in challenging genomic regions and across short and long read callsets. We show how complete assemblies can expand benchmarks to difficult regions, but highlight remaining challenges benchmarking variants in long homopolymers and tandem repeats, complex gene conversions, copy number variable gene arrays, and human satellites.

摘要

性染色体包含影响医学表型的复杂且重要的基因,但在倍性和大的重复区域方面与常染色体不同。为了使技术开发者以及研究和临床实验室能够评估男性性染色体X和Y上的变异检测,我们为“瓶中基因组”HG002参考材料创建了一个包含111,725个变异的小变异基准集。我们开发了一种主动评估方法,以证明该基准集能够可靠地识别具有挑战性的基因组区域以及短读长和长读长调用集中的错误。我们展示了完整的组装如何将基准扩展到困难区域,但也强调了在长同聚物和串联重复序列、复杂基因转换、拷贝数可变基因阵列以及人类卫星序列中对变异进行基准测试时仍然存在的挑战。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e489/11711550/8045f6d5a11e/41467_2024_55710_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e489/11711550/ab1c7aed9496/41467_2024_55710_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e489/11711550/77d1acadd464/41467_2024_55710_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e489/11711550/40c7f0364eac/41467_2024_55710_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e489/11711550/8045f6d5a11e/41467_2024_55710_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e489/11711550/ab1c7aed9496/41467_2024_55710_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e489/11711550/77d1acadd464/41467_2024_55710_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e489/11711550/40c7f0364eac/41467_2024_55710_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e489/11711550/8045f6d5a11e/41467_2024_55710_Fig4_HTML.jpg

相似文献

1
Small variant benchmark from a complete assembly of X and Y chromosomes.来自X和Y染色体完整组装的小变异基准。
Nat Commun. 2025 Jan 8;16(1):497. doi: 10.1038/s41467-024-55710-z.
2
A robust benchmark for detecting low-frequency variants in the HG002 Genome In A Bottle NIST reference material.用于检测基因组在瓶 NIST 参考材料 HG002 中低频变异的强大基准。
bioRxiv. 2024 Dec 5:2024.12.02.625685. doi: 10.1101/2024.12.02.625685.
3
Analysis and benchmarking of small and large genomic variants across tandem repeats.串联重复序列中小的和大的基因组变异的分析与基准测试。
Nat Biotechnol. 2025 Mar;43(3):431-442. doi: 10.1038/s41587-024-02225-z. Epub 2024 Apr 26.
4
Improved epigenetic age prediction models by combining sex chromosome and autosomal markers.通过结合性染色体和常染色体标记改进表观遗传年龄预测模型。
Epigenetics Chromatin. 2025 Jul 15;18(1):45. doi: 10.1186/s13072-025-00606-5.
5
NCBench: providing an open, reproducible, transparent, adaptable, and continuous benchmark approach for DNA-sequencing-based variant calling.NCBench:提供一种开放、可重复、透明、可适应和持续的基于 DNA 测序的变异调用基准方法。
F1000Res. 2024 Sep 12;12:1125. doi: 10.12688/f1000research.140344.1. eCollection 2023.
6
Expectations and blind spots for structural variation detection from long-read assemblies and short-read genome sequencing technologies.从长读序列组装和短读基因组测序技术中检测结构变异的预期和盲点。
Am J Hum Genet. 2021 May 6;108(5):919-928. doi: 10.1016/j.ajhg.2021.03.014. Epub 2021 Mar 30.
7
Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA?一项初步评估。
Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.
8
Transposable elements and heterochromatic regions are enriched for structural variation and sequence divergence in the genome of wild-type Caenorhabditis elegans.在野生型秀丽隐杆线虫的基因组中,转座元件和异染色质区域富含结构变异和序列差异。
G3 (Bethesda). 2025 Jul 9;15(7). doi: 10.1093/g3journal/jkaf092.
9
Best practices for improving alignment and variant calling on human sex chromosomes.
bioRxiv. 2025 Apr 30:2025.04.29.651297. doi: 10.1101/2025.04.29.651297.
10
Transposable element expression is associated with sex chromosome number in humans.转座元件的表达与人类性染色体数量相关。
PLoS Genet. 2025 Jun 26;21(6):e1011668. doi: 10.1371/journal.pgen.1011668. eCollection 2025 Jun.

引用本文的文献

1
Unraveling the hidden complexity of cancer through long-read sequencing.通过长读长测序揭示癌症隐藏的复杂性。
Genome Res. 2025 Apr 14;35(4):599-620. doi: 10.1101/gr.280041.124.
2
Sex Differences in Cancer Functional Genomics: Gene Dependency and Drug Sensitivity.癌症功能基因组学中的性别差异:基因依赖性与药物敏感性
bioRxiv. 2025 Mar 21:2025.02.05.636540. doi: 10.1101/2025.02.05.636540.
3
Metapipeline-DNA: A Comprehensive Germline & Somatic Genomics Nextflow Pipeline.Metapipeline-DNA:一个全面的种系和体细胞基因组学Nextflow流程。

本文引用的文献

1
Exploring gene content with pangene graphs.利用泛基因图探索基因内容。
Bioinformatics. 2024 Jul 23;40(7). doi: 10.1093/bioinformatics/btae456.
2
The complete sequence of a human Y chromosome.人类 Y 染色体的完整序列。
Nature. 2023 Sep;621(7978):344-354. doi: 10.1038/s41586-023-06457-y. Epub 2023 Aug 23.
3
Assembly of 43 human Y chromosomes reveals extensive complexity and variation.43 个人类 Y 染色体的组装揭示了广泛的复杂性和变异性。
bioRxiv. 2025 Apr 25:2024.09.04.611267. doi: 10.1101/2024.09.04.611267.
Nature. 2023 Sep;621(7978):355-364. doi: 10.1038/s41586-023-06425-6. Epub 2023 Aug 23.
4
Multiscale analysis of pangenomes enables improved representation of genomic diversity for repetitive and clinically relevant genes.泛基因组的多尺度分析能够改善对重复和临床相关基因的基因组多样性的表示。
Nat Methods. 2023 Aug;20(8):1213-1221. doi: 10.1038/s41592-023-01914-y. Epub 2023 Jun 26.
5
Sequencing by avidity enables high accuracy with low reagent consumption.高亲和力测序可实现高精度和低试剂消耗。
Nat Biotechnol. 2024 Jan;42(1):132-138. doi: 10.1038/s41587-023-01750-7. Epub 2023 May 25.
6
A draft human pangenome reference.人类泛基因组参考草图。
Nature. 2023 May;617(7960):312-324. doi: 10.1038/s41586-023-05896-x. Epub 2023 May 10.
7
Variant calling and benchmarking in an era of complete human genome sequences.全基因组序列时代的变异调用和基准测试。
Nat Rev Genet. 2023 Jul;24(7):464-483. doi: 10.1038/s41576-023-00590-0. Epub 2023 Apr 14.
8
Benchmarking challenging small variants with linked and long reads.使用连锁读段和长读段对具有挑战性的小变异进行基准测试。
Cell Genom. 2022 May;2(5). doi: 10.1016/j.xgen.2022.100128.
9
Semi-automated assembly of high-quality diploid human reference genomes.半自动组装高质量的二倍体人类参考基因组。
Nature. 2022 Nov;611(7936):519-531. doi: 10.1038/s41586-022-05325-5. Epub 2022 Oct 19.
10
Chromosome-scale haplotype-resolved pangenomics.染色体级单倍型解析泛基因组学
Trends Genet. 2022 Nov;38(11):1103-1107. doi: 10.1016/j.tig.2022.06.011. Epub 2022 Jul 8.