Suppr超能文献

用于结构变异基因分型的长读长比对堆积的K-mer分析。

K-mer analysis of long-read alignment pileups for structural variant genotyping.

作者信息

English Adam C, Cunial Fabio, Metcalf Ginger A, Gibbs Richard A, Sedlazeck Fritz J

机构信息

Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA.

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

出版信息

bioRxiv. 2024 Oct 25:2024.10.22.619642. doi: 10.1101/2024.10.22.619642.

Abstract

Accurately genotyping structural variant (SV) alleles is crucial to genomics research. We present a novel method (kanpig) for genotyping SVs that leverages variant graphs and k-mer vectors to rapidly generate accurate SV genotypes. We benchmark kanpig against the latest SV benchmarks and show single-sample genotyping concordance of 82.1%, which is higher than existing genotypers averaging 66.3%. We explore kanpig's applicability to multi-sample projects by benchmarking project-level VCFs containing 47 genetically diverse samples and find kanpig accurately genotypes complex loci (e.g. SVs neighboring other SVs), achieving much higher genotyping concordance than other tools. Kanpig requires only 43 seconds to process a single sample's 20x long-reads and can be run on PacBio or ONT long-reads.

摘要

准确地对结构变异(SV)等位基因进行基因分型对于基因组学研究至关重要。我们提出了一种用于SV基因分型的新方法(kanpig),该方法利用变异图和k-mer向量来快速生成准确的SV基因型。我们将kanpig与最新的SV基准进行了比较,结果显示单样本基因分型一致性为82.1%,高于现有基因分型器平均66.3%的水平。我们通过对包含47个基因多样化样本的项目级VCF进行基准测试,探索了kanpig在多样本项目中的适用性,发现kanpig能够准确地对复杂位点(例如与其他SV相邻的SV)进行基因分型,其基因分型一致性远高于其他工具。Kanpig处理单个样本的20倍长读长仅需43秒,并且可以在PacBio或ONT长读长上运行。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40aa/11526963/427dfa7a501f/nihpp-2024.10.22.619642v1-f0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验