• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Mumemto:跨泛基因组的高效最大匹配

Mumemto: efficient maximal matching across pangenomes.

作者信息

Shivakumar Vikram S, Langmead Ben

机构信息

Department of Computer Science, Johns Hopkins University, Baltimore, USA.

出版信息

Genome Biol. 2025 Jun 17;26(1):169. doi: 10.1186/s13059-025-03644-0.

DOI:10.1186/s13059-025-03644-0
PMID:40528225
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12172372/
Abstract

Aligning genomes into common coordinates is central to pangenome construction, though computationally expensive. Multi-sequence maximal unique matches (multi-MUMs) help to frame and solve the multiple alignment problem. We introduce Mumemto, a tool that computes multi-MUMs and other match types across large pangenomes. Mumemto allows for visualization of synteny, reveals aberrant assemblies and scaffolds, and highlights pangenome conservation and structural variation. Mumemto computes multi-MUMs across 320 human assemblies (960GB) in 25.7 h with 800 GB of memory and hundreds of fungal assemblies in minutes. Mumemto is implemented in C++ and Python and available open-source at https://github.com/vikshiv/mumemto (v1.1.1 at doi.org/10.5281/zenodo.15053447 ).

摘要

将基因组比对到共同的坐标是构建泛基因组的核心,尽管计算成本高昂。多序列最大唯一匹配(multi-MUMs)有助于构建和解决多重比对问题。我们引入了Mumemto,这是一种可在大型泛基因组中计算多MUMs和其他匹配类型的工具。Mumemto允许对共线性进行可视化,揭示异常组装和支架,并突出显示泛基因组的保守性和结构变异。Mumemto使用800GB内存,在25.7小时内可对320个人类基因组组装(960GB)计算多MUMs,对数百个真菌基因组组装只需几分钟。Mumemto用C++和Python实现,可在https://github.com/vikshiv/mumemto(doi.org/10.5281/zenodo.15053447上的v1.1.1版本)开源获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49a8/12172372/1b196526a93b/13059_2025_3644_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49a8/12172372/cc3deb4fdd1b/13059_2025_3644_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49a8/12172372/6aaa06140981/13059_2025_3644_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49a8/12172372/cf324b39e2b2/13059_2025_3644_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49a8/12172372/a7b54c6697cb/13059_2025_3644_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49a8/12172372/fb5e0bf7ebcd/13059_2025_3644_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49a8/12172372/1b196526a93b/13059_2025_3644_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49a8/12172372/cc3deb4fdd1b/13059_2025_3644_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49a8/12172372/6aaa06140981/13059_2025_3644_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49a8/12172372/cf324b39e2b2/13059_2025_3644_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49a8/12172372/a7b54c6697cb/13059_2025_3644_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49a8/12172372/fb5e0bf7ebcd/13059_2025_3644_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49a8/12172372/1b196526a93b/13059_2025_3644_Figa_HTML.jpg

相似文献

1
Mumemto: efficient maximal matching across pangenomes.Mumemto:跨泛基因组的高效最大匹配
Genome Biol. 2025 Jun 17;26(1):169. doi: 10.1186/s13059-025-03644-0.
2
Mumemto: efficient maximal matching across pangenomes.Mumemto:跨全基因组的高效最大匹配
bioRxiv. 2025 Jan 5:2025.01.05.631388. doi: 10.1101/2025.01.05.631388.
3
Partitioned Multi-MUM finding for scalable pangenomics.用于可扩展全基因组学的分区多MUM查找
bioRxiv. 2025 May 25:2025.05.20.654611. doi: 10.1101/2025.05.20.654611.
4
SVbyEye: A visual tool to characterize structural variation among whole-genome assemblies.SVbyEye:一种用于表征全基因组组装之间结构变异的可视化工具。
Bioinformatics. 2025 Jun 6. doi: 10.1093/bioinformatics/btaf332.
5
Pairwise graph edit distance characterizes the impact of the construction method on pangenome graphs.成对图编辑距离表征了构建方法对泛基因组图的影响。
Bioinformatics. 2025 May 9. doi: 10.1093/bioinformatics/btaf291.
6
SAKit: An all-in-one analysis pipeline for identifying novel proteins resulting from variant events at both large and small scales.SAKit:一种用于鉴定由大尺度和小尺度变异事件产生的新型蛋白质的一体化分析管道。
J Bioinform Comput Biol. 2024 Oct;22(5):2450022. doi: 10.1142/S0219720024500227. Epub 2024 Oct 1.
7
dsOMOP: Bridging OMOP CDM and DataSHIELD for Secure Federated Analysis of Standardized Clinical Data.dsOMOP:连接OMOP通用数据模型(CDM)和DataSHIELD以实现标准化临床数据的安全联合分析
Bioinformatics. 2025 May 6. doi: 10.1093/bioinformatics/btaf286.
8
HSDSnake: a user-friendly SnakeMake pipeline for analysis of duplicate genes in eukaryotic genomes.HSDSnake:一个用于分析真核生物基因组中重复基因的用户友好型SnakeMake工作流程。
Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf325.
9
Fast and accurate antigen typing with Kaptive 3.使用Kaptive 3进行快速准确的抗原分型。
Microb Genom. 2025 Jun;11(6). doi: 10.1099/mgen.0.001428.
10
DNA sequences alignment method using sparse index on pan-genome graph.基于泛基因组图的稀疏索引的 DNA 序列比对方法。
J Bioinform Comput Biol. 2024 Aug;22(4):2450019. doi: 10.1142/S0219720024500197. Epub 2024 Aug 31.

引用本文的文献

1
Partitioned Multi-MUM finding for scalable pangenomics.用于可扩展全基因组学的分区多MUM查找
bioRxiv. 2025 May 25:2025.05.20.654611. doi: 10.1101/2025.05.20.654611.

本文引用的文献

1
SVbyEye: A visual tool to characterize structural variation among whole-genome assemblies.SVbyEye:一种用于表征全基因组组装之间结构变异的可视化工具。
Bioinformatics. 2025 Jun 6. doi: 10.1093/bioinformatics/btaf332.
2
Leveraging a phased pangenome for haplotype design of hybrid potato.利用分阶段的泛基因组进行杂交马铃薯的单倍型设计。
Nature. 2025 Apr;640(8058):408-417. doi: 10.1038/s41586-024-08476-9. Epub 2025 Jan 22.
3
Movi: A fast and cache-efficient full-text pangenome index.Movi:一种快速且缓存高效的全基因组索引。
iScience. 2024 Nov 27;27(12):111464. doi: 10.1016/j.isci.2024.111464. eCollection 2024 Dec 20.
4
BWT construction and search at the terabase scale.万亿碱基规模下的BWT构建与搜索。
Bioinformatics. 2024 Nov 28;40(12). doi: 10.1093/bioinformatics/btae717.
5
Building a pangenome alignment index via recursive prefix-free parsing.通过递归无前缀解析构建泛基因组比对索引。
iScience. 2024 Sep 12;27(10):110933. doi: 10.1016/j.isci.2024.110933. eCollection 2024 Oct 18.
6
Parsnp 2.0: scalable core-genome alignment for massive microbial datasets.Parsnp 2.0:适用于大规模微生物数据集的可扩展核心基因组比对工具。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae311.
7
Telomere-to-telomere assemblies of 142 strains characterize the genome structural landscape in Saccharomyces cerevisiae.142 株酿酒酵母的端粒到端粒组装描绘了基因组结构景观。
Nat Genet. 2023 Aug;55(8):1390-1399. doi: 10.1038/s41588-023-01459-y. Epub 2023 Jul 31.
8
Genomic sketching with multiplicities and locality-sensitive hashing using Dashing 2.使用 Dashing 2 进行多重性和位置敏感哈希的基因组草图绘制。
Genome Res. 2023 Jul;33(7):1218-1227. doi: 10.1101/gr.277655.123. Epub 2023 Jul 6.
9
SPUMONI 2: improved classification using a pangenome index of minimizer digests.SPUMONI 2:使用最小化消化物的泛基因组指数进行改进分类。
Genome Biol. 2023 May 18;24(1):122. doi: 10.1186/s13059-023-02958-1.
10
A draft human pangenome reference.人类泛基因组参考草图。
Nature. 2023 May;617(7960):312-324. doi: 10.1038/s41586-023-05896-x. Epub 2023 May 10.