Suppr超能文献

Graphite:使用彩色德布鲁因图绘制基因组

Graphite: painting genomes using a colored de Bruijn graph.

作者信息

Beeloo Rick, Zomer Aldert L, Deorowicz Sebastian, Dutilh Bas E

机构信息

Theoretical Biology and Bioinformatics, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands.

Department of Infectious Diseases and Immunology, Faculty of Veterinary Medicine, Utrecht University, 3584 Utrecht, The Netherlands.

出版信息

NAR Genom Bioinform. 2024 Oct 23;6(4):lqae142. doi: 10.1093/nargab/lqae142. eCollection 2024 Sep.

Abstract

The recent growth of microbial sequence data allows comparisons at unprecedented scales, enabling the tracking of strains, mobile genetic elements, or genes. Querying a genome against a large reference database can easily yield thousands of matches that are tedious to interpret and pose computational challenges. We developed Graphite that uses a colored de Bruijn graph (cDBG) to paint query genomes, selecting the local best matches along the full query length. By focusing on the best genomic match of each query region, Graphite reduces the number of matches while providing the most promising leads for sequence tracking or genomic forensics. When applied to hundreds of genomes we found extensive gene sharing, including a previously undetected plasmid that matched a chromosome. Together, genome painting using cDBGs as enabled by Graphite, can reveal new biological phenomena by mitigating computational hurdles.

摘要

近期微生物序列数据的增长使得能够以前所未有的规模进行比较,从而实现对菌株、移动遗传元件或基因的追踪。将一个基因组与大型参考数据库进行比对,很容易产生数千个匹配结果,这些结果解读起来很繁琐,并且带来了计算方面的挑战。我们开发了Graphite,它使用彩色德布鲁因图(cDBG)对查询基因组进行描绘,沿着整个查询长度选择局部最佳匹配。通过专注于每个查询区域的最佳基因组匹配,Graphite减少了匹配数量,同时为序列追踪或基因组法医鉴定提供了最有前景的线索。当应用于数百个基因组时,我们发现了广泛的基因共享,包括一个之前未检测到的与一条染色体匹配的质粒。总之,借助Graphite实现的使用cDBG进行基因组描绘,可以通过克服计算障碍揭示新的生物学现象。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4905/11497850/2300abc77c0c/lqae142figgra1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验