Suppr超能文献

少即是多:基因组学中的最小化器草图。

When less is more: sketching with minimizers in genomics.

机构信息

Department of Fundamental Microbiology, UNIL, Lausanne, Switzerland.

Department of Computational Biology, UNIL, Lausanne, Switzerland.

出版信息

Genome Biol. 2024 Oct 14;25(1):270. doi: 10.1186/s13059-024-03414-4.

Abstract

The exponential increase in sequencing data calls for conceptual and computational advances to extract useful biological insights. One such advance, minimizers, allows for reducing the quantity of data handled while maintaining some of its key properties. We provide a basic introduction to minimizers, cover recent methodological developments, and review the diverse applications of minimizers to analyze genomic data, including de novo genome assembly, metagenomics, read alignment, read correction, and pangenomes. We also touch on alternative data sketching techniques including universal hitting sets, syncmers, or strobemers. Minimizers and their alternatives have rapidly become indispensable tools for handling vast amounts of data.

摘要

测序数据的指数级增长要求在提取有用的生物学见解方面取得概念和计算上的进展。其中一种进展是 minimizers,它可以在保持数据关键属性的同时减少处理的数据量。我们提供了 minimizers 的基本介绍,涵盖了最近的方法学发展,并回顾了 minimizers 在分析基因组数据中的多种应用,包括从头基因组组装、宏基因组学、读对齐、读校正和泛基因组。我们还涉及了替代数据草图技术,包括通用命中集、syncmers 或 strobe rs。Minimizers 及其替代品已迅速成为处理大量数据不可或缺的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d6d/11472564/2bde933dbfd3/13059_2024_3414_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验