• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用变异图进行跨变异类和长度的精确基因分型。

Accurate genotyping across variant classes and lengths using variant graphs.

机构信息

The Bioinformatics Centre, Department of Biology, University of Copenhagen, Copenhagen, Denmark.

出版信息

Nat Genet. 2018 Jul;50(7):1054-1059. doi: 10.1038/s41588-018-0145-5. Epub 2018 Jun 18.

DOI:10.1038/s41588-018-0145-5
PMID:29915429
Abstract

Genotype estimates from short-read sequencing data are typically based on the alignment of reads to a linear reference, but reads originating from more complex variants (for example, structural variants) often align poorly, resulting in biased genotype estimates. This bias can be mitigated by first collecting a set of candidate variants across discovery methods, individuals and databases, and then realigning the reads to the variants and reference simultaneously. However, this realignment problem has proved computationally difficult. Here, we present a new method (BayesTyper) that uses exact alignment of read k-mers to a graph representation of the reference and variants to efficiently perform unbiased, probabilistic genotyping across the variation spectrum. We demonstrate that BayesTyper generally provides superior variant sensitivity and genotyping accuracy relative to existing methods when used to integrate variants across discovery approaches and individuals. Finally, we demonstrate that including a 'variation-prior' database containing already known variants significantly improves sensitivity.

摘要

基于短读测序数据的基因型估计通常基于将读取与线性参考进行比对,但源自更复杂变体(例如结构变体)的读取通常比对效果不佳,导致基因型估计存在偏差。通过首先使用各种发现方法、个体和数据库收集一组候选变体,然后同时将读取重新对齐到变体和参考上,可以减轻这种偏差。然而,这种重新对齐问题在计算上被证明是困难的。在这里,我们提出了一种新的方法(BayesTyper),它使用读取 k-mer 与参考和变体的图形表示的精确比对,以有效地在整个变异谱中进行无偏、概率基因型检测。我们证明,当用于整合来自不同发现方法和个体的变体时,BayesTyper 通常相对于现有方法提供更高的变体敏感性和基因型准确性。最后,我们证明包含包含已知变体的“变体先验”数据库显著提高了敏感性。

相似文献

1
Accurate genotyping across variant classes and lengths using variant graphs.使用变异图进行跨变异类和长度的精确基因分型。
Nat Genet. 2018 Jul;50(7):1054-1059. doi: 10.1038/s41588-018-0145-5. Epub 2018 Jun 18.
2
Fast read alignment with incorporation of known genomic variants.快速读取与已知基因组变异的整合。
BMC Med Inform Decis Mak. 2019 Dec 19;19(Suppl 6):265. doi: 10.1186/s12911-019-0960-3.
3
Pangenome-based genome inference allows efficient and accurate genotyping across a wide spectrum of variant classes.基于泛基因组的基因组推断可在广泛的变异类别中实现高效、准确的基因分型。
Nat Genet. 2022 Apr;54(4):518-525. doi: 10.1038/s41588-022-01043-w. Epub 2022 Apr 11.
4
Enhancing SNV identification in whole-genome sequencing data through the incorporation of known genetic variants into the minimap2 index.通过将已知遗传变异纳入 minimap2 索引来提高全基因组测序数据中 SNV 的识别能力。
BMC Bioinformatics. 2024 Jul 13;25(1):238. doi: 10.1186/s12859-024-05862-y.
5
Calling known variants and identifying new variants while rapidly aligning sequence data.在快速对齐序列数据的同时,调用已知变异体并识别新变异体。
J Dairy Sci. 2019 Apr;102(4):3216-3229. doi: 10.3168/jds.2018-15172. Epub 2019 Feb 14.
6
Fast alignment of reads to a variation graph with application to SNP detection.快速将读取内容与变异图谱对齐,应用于 SNP 检测。
J Integr Bioinform. 2021 Nov 16;18(4):20210032. doi: 10.1515/jib-2021-0032.
7
On the critical evaluation and confirmation of germline sequence variants identified using massively parallel sequencing.关于使用大规模平行测序鉴定的种系序列变异的关键评估和确认。
J Biotechnol. 2019 Jun 10;298:64-75. doi: 10.1016/j.jbiotec.2019.04.013. Epub 2019 Apr 15.
8
Sensitive alignment using paralogous sequence variants improves long-read mapping and variant calling in segmental duplications.利用直系同源序列变异进行敏感比对可提高大片段重复区域的长读长序列比对和变异calling 效率。
Nucleic Acids Res. 2020 Nov 4;48(19):e114. doi: 10.1093/nar/gkaa829.
9
Demonstrating the utility of flexible sequence queries against indexed short reads with FlexTyper.使用 FlexTyper 对索引短读取进行灵活序列查询的实用性展示。
PLoS Comput Biol. 2021 Mar 22;17(3):e1008815. doi: 10.1371/journal.pcbi.1008815. eCollection 2021 Mar.
10
Bovine breed-specific augmented reference graphs facilitate accurate sequence read mapping and unbiased variant discovery.牛种特异性增强参考图谱有助于准确的序列读取映射和无偏的变异发现。
Genome Biol. 2020 Jul 27;21(1):184. doi: 10.1186/s13059-020-02105-0.

引用本文的文献

1
Plant graph-based pangenomics: techniques, applications, and challenges.基于植物图谱的泛基因组学:技术、应用与挑战。
aBIOTECH. 2025 Mar 28;6(2):361-376. doi: 10.1007/s42994-025-00206-7. eCollection 2025 Jun.
2
Pooled, Long-read Sequencing for Structural Variant Characterization in Schistosome Populations.用于血吸虫种群结构变异特征分析的合并长读长测序
Genome Biol Evol. 2025 Jul 3;17(7). doi: 10.1093/gbe/evaf127.
3
Benchmarking Genomic Variant Calling Tools in Inbred Mouse Strains: Recommendations and Considerations.近交系小鼠品系中基因组变异检测工具的基准测试:建议与注意事项
bioRxiv. 2025 May 31:2025.05.28.656711. doi: 10.1101/2025.05.28.656711.
4
Pangenome-aware DeepVariant.全基因组感知深度变异体
bioRxiv. 2025 Jun 6:2025.06.05.657102. doi: 10.1101/2025.06.05.657102.
5
SVLearn: a dual-reference machine learning approach enables accurate cross-species genotyping of structural variants.SVLearn:一种双参考机器学习方法可实现结构变异的准确跨物种基因分型。
Nat Commun. 2025 Mar 11;16(1):2406. doi: 10.1038/s41467-025-57756-z.
6
The landscape of structural variation in aye-ayes ().指猴的结构变异图谱()。 (括号内容原文缺失,译文根据已有内容补全括号形式)
bioRxiv. 2024 Nov 11:2024.11.08.622672. doi: 10.1101/2024.11.08.622672.
7
Integer programming framework for pangenome-based genome inference.基于泛基因组的基因组推断的整数规划框架
bioRxiv. 2024 Oct 29:2024.10.27.620212. doi: 10.1101/2024.10.27.620212.
8
A stepwise guide for pangenome development in crop plants: an alfalfa (Medicago sativa) case study.作物泛基因组开发的分步指南:以紫花苜蓿(Medicago sativa)为例。
BMC Genomics. 2024 Oct 31;25(1):1022. doi: 10.1186/s12864-024-10931-w.
9
GGTyper: genotyping complex structural variants using short-read sequencing data.GGTyper:使用短读测序数据进行基因分型复杂结构变异。
Bioinformatics. 2024 Sep 1;40(Suppl 2):ii11-ii19. doi: 10.1093/bioinformatics/btae391.
10
Telomere-to-telomere Citrullus super-pangenome provides direction for watermelon breeding.端粒到端粒的西瓜超级泛基因组为西瓜育种提供了方向。
Nat Genet. 2024 Aug;56(8):1750-1761. doi: 10.1038/s41588-024-01823-6. Epub 2024 Jul 8.