• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TT-Mars:基于单倍型解析组装的结构变异评估。

TT-Mars: structural variants assessment based on haplotype-resolved assemblies.

机构信息

Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.

出版信息

Genome Biol. 2022 May 6;23(1):110. doi: 10.1186/s13059-022-02666-2.

DOI:10.1186/s13059-022-02666-2
PMID:35524317
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9077962/
Abstract

Variant benchmarking is often performed by comparing a test callset to a gold standard set of variants. In repetitive regions of the genome, it may be difficult to establish what is the truth for a call, for example, when different alignment scoring metrics provide equally supported but different variant calls on the same data. Here, we provide an alternative approach, TT-Mars, that takes advantage of the recent production of high-quality haplotype-resolved genome assemblies by providing false discovery rates for variant calls based on how well their call reflects the content of the assembly, rather than comparing calls themselves.

摘要

变异基准测试通常通过将测试调用集与变异的黄金标准集进行比较来完成。在基因组的重复区域,确定调用的真实性可能很困难,例如,当不同的比对评分指标在相同的数据上提供同样支持但不同的变异调用时。在这里,我们提供了一种替代方法 TT-Mars,它利用了最近产生的高质量单倍型解析基因组组装,通过基于调用反映组装内容的程度而不是比较调用本身来为变异调用提供错误发现率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d65/9077962/f90966626c07/13059_2022_2666_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d65/9077962/02439136d92d/13059_2022_2666_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d65/9077962/3ea8f3e8fa56/13059_2022_2666_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d65/9077962/29fc57cf71e9/13059_2022_2666_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d65/9077962/eda24c7ff561/13059_2022_2666_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d65/9077962/f90966626c07/13059_2022_2666_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d65/9077962/02439136d92d/13059_2022_2666_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d65/9077962/3ea8f3e8fa56/13059_2022_2666_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d65/9077962/29fc57cf71e9/13059_2022_2666_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d65/9077962/eda24c7ff561/13059_2022_2666_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2d65/9077962/f90966626c07/13059_2022_2666_Fig5_HTML.jpg

相似文献

1
TT-Mars: structural variants assessment based on haplotype-resolved assemblies.TT-Mars:基于单倍型解析组装的结构变异评估。
Genome Biol. 2022 May 6;23(1):110. doi: 10.1186/s13059-022-02666-2.
2
Robust Benchmark Structural Variant Calls of An Asian Using State-of-the-art Long-read Sequencing Technologies.利用最先进的长读测序技术对亚洲个体进行稳健的基准结构变异调用。
Genomics Proteomics Bioinformatics. 2022 Feb;20(1):192-204. doi: 10.1016/j.gpb.2020.10.006. Epub 2021 Mar 2.
3
svclassify: a method to establish benchmark structural variant calls.svclassify:一种建立基准结构变异调用的方法。
BMC Genomics. 2016 Jan 16;17:64. doi: 10.1186/s12864-016-2366-2.
4
Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies.清除单倍型:三代二倍体基因组组装的等位基因 contig 重新分配。
BMC Bioinformatics. 2018 Nov 29;19(1):460. doi: 10.1186/s12859-018-2485-7.
5
Best practices for benchmarking germline small-variant calls in human genomes.人类基因组中小变异calls 的基准测试最佳实践。
Nat Biotechnol. 2019 May;37(5):555-560. doi: 10.1038/s41587-019-0054-x. Epub 2019 Mar 11.
6
Haplotype-resolved assemblies and variant benchmark of a Chinese Quartet.单体型解析组装与中国四重奏个体的变异基准
Genome Biol. 2023 Dec 4;24(1):277. doi: 10.1186/s13059-023-03116-3.
7
Systematic benchmark of state-of-the-art variant calling pipelines identifies major factors affecting accuracy of coding sequence variant discovery.系统基准测试最先进的变异调用管道,确定影响编码序列变异发现准确性的主要因素。
BMC Genomics. 2022 Feb 22;23(1):155. doi: 10.1186/s12864-022-08365-3.
8
Structural variant-based pangenome construction has low sensitivity to variability of haplotype-resolved bovine assemblies.基于结构变异的泛基因组构建对单倍型解析牛组装体的变异性的敏感性较低。
Nat Commun. 2022 May 31;13(1):3012. doi: 10.1038/s41467-022-30680-2.
9
geck: trio-based comparative benchmarking of variant calls.geck:基于 trio 的变异调用比较基准测试。
Bioinformatics. 2018 Oct 15;34(20):3488-3495. doi: 10.1093/bioinformatics/bty415.
10
MsPAC: a tool for haplotype-phased structural variant detection.MsPAC:一种用于单体型相位结构变异检测的工具。
Bioinformatics. 2020 Feb 1;36(3):922-924. doi: 10.1093/bioinformatics/btz618.

引用本文的文献

1
ASVBM: Structural variant benchmarking with local joint analysis for multiple callsets.ASVBM:通过对多个数据集进行局部联合分析的结构变异基准测试
Comput Struct Biotechnol J. 2025 Jun 29;27:2851-2862. doi: 10.1016/j.csbj.2025.06.045. eCollection 2025.
2
Comprehensive evaluation and guidance of structural variation detection tools in chicken whole genome sequence data.鸡全基因组序列数据中结构变异检测工具的综合评估和指导
BMC Genomics. 2024 Oct 16;25(1):970. doi: 10.1186/s12864-024-10875-1.
3
Analysis and benchmarking of small and large genomic variants across tandem repeats.

本文引用的文献

1
Mako: A Graph-based Pattern Growth Approach to Detect Complex Structural Variants.Mako:一种基于图的模式生长方法,用于检测复杂结构变异。
Genomics Proteomics Bioinformatics. 2022 Feb;20(1):205-218. doi: 10.1016/j.gpb.2021.03.007. Epub 2021 Jul 3.
2
lra: A long read aligner for sequences and contigs.lra:一种用于序列和重叠群的长读比对工具。
PLoS Comput Biol. 2021 Jun 21;17(6):e1009078. doi: 10.1371/journal.pcbi.1009078. eCollection 2021 Jun.
3
Samplot: a platform for structural variant visual validation and automated filtering.
串联重复序列中小的和大的基因组变异的分析与基准测试。
Nat Biotechnol. 2025 Mar;43(3):431-442. doi: 10.1038/s41587-024-02225-z. Epub 2024 Apr 26.
4
Advances in the discovery and analyses of human tandem repeats.人类串联重复序列的发现和分析进展。
Emerg Top Life Sci. 2023 Dec 14;7(3):361-381. doi: 10.1042/ETLS20230074.
5
Genomic variant benchmark: if you cannot measure it, you cannot improve it.基因组变异基准:如果无法衡量,就无法改进。
Genome Biol. 2023 Oct 5;24(1):221. doi: 10.1186/s13059-023-03061-1.
6
Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation.可扩展的纳米孔测序技术对人类基因组进行测序,提供了全面的单倍型分辨率变异和甲基化视图。
Nat Methods. 2023 Oct;20(10):1483-1492. doi: 10.1038/s41592-023-01993-x. Epub 2023 Sep 14.
7
A survey of algorithms for the detection of genomic structural variants from long-read sequencing data.长读测序数据中基因组结构变异检测算法研究综述。
Nat Methods. 2023 Aug;20(8):1143-1158. doi: 10.1038/s41592-023-01932-w. Epub 2023 Jun 29.
8
Structural variant-based pangenome construction has low sensitivity to variability of haplotype-resolved bovine assemblies.基于结构变异的泛基因组构建对单倍型解析牛组装体的变异性的敏感性较低。
Nat Commun. 2022 May 31;13(1):3012. doi: 10.1038/s41467-022-30680-2.
Samplot:用于结构变异可视化验证和自动过滤的平台。
Genome Biol. 2021 May 25;22(1):161. doi: 10.1186/s13059-021-02380-5.
4
Expectations and blind spots for structural variation detection from long-read assemblies and short-read genome sequencing technologies.从长读序列组装和短读基因组测序技术中检测结构变异的预期和盲点。
Am J Hum Genet. 2021 May 6;108(5):919-928. doi: 10.1016/j.ajhg.2021.03.014. Epub 2021 Mar 30.
5
Haplotype-resolved diverse human genomes and integrated analysis of structural variation.单体型解析的多样化人类基因组和结构变异的综合分析。
Science. 2021 Apr 2;372(6537). doi: 10.1126/science.abf7117. Epub 2021 Feb 25.
6
Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm.使用带有 hifiasm 的相定装配图进行单体型解析从头组装。
Nat Methods. 2021 Feb;18(2):170-175. doi: 10.1038/s41592-020-01056-5. Epub 2021 Feb 1.
7
A diploid assembly-based benchmark for variants in the major histocompatibility complex.基于二倍体组装的主要组织相容性复合体变异基准
Nat Commun. 2020 Sep 22;11(1):4794. doi: 10.1038/s41467-020-18564-9.
8
A robust benchmark for detection of germline large deletions and insertions.一种用于检测种系大片段缺失和插入的稳健基准
Nat Biotechnol. 2020 Nov;38(11):1347-1355. doi: 10.1038/s41587-020-0538-8. Epub 2020 Jun 15.
9
A structural variation reference for medical and population genetics.医学和人群遗传学的结构变异参考
Nature. 2020 May;581(7809):444-451. doi: 10.1038/s41586-020-2287-8. Epub 2020 May 27.
10
Mapping and characterization of structural variation in 17,795 human genomes.人类基因组 17795 号结构变异的定位与特征分析。
Nature. 2020 Jul;583(7814):83-89. doi: 10.1038/s41586-020-2371-0. Epub 2020 May 27.