• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Mumemto:跨全基因组的高效最大匹配

Mumemto: efficient maximal matching across pangenomes.

作者信息

Shivakumar Vikram S, Langmead Ben

机构信息

Department of Computer Science, Johns Hopkins University.

出版信息

bioRxiv. 2025 Jan 5:2025.01.05.631388. doi: 10.1101/2025.01.05.631388.

DOI:10.1101/2025.01.05.631388
PMID:39803467
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11722392/
Abstract

Aligning genomes into common coordinates is central to pangenome analysis and construction, but it is also computationally expensive. Multi-sequence maximal unique matches (multi-MUMs) are guideposts for core genome alignments, helping to frame and solve the multiple alignment problem. We introduce Mumemto, a tool that computes multi-MUMs and other match types across large pangenomes. Mumemto allows for visualization of synteny, reveals aberrant assemblies and scaffolds, and highlights pangenome conservation and structural variation. Mumemto computes multi-MUMs across 320 human genome assemblies (960GB) in 25.7 hours with under 800 GB of memory, and over hundreds of fungal genome assemblies in minutes. Mumemto is implemented in C++ and Python and available open-source at https://github.com/vikshiv/mumemto.

摘要

将基因组比对到共同的坐标是泛基因组分析和构建的核心,但计算成本也很高。多序列最大唯一匹配(multi-MUMs)是核心基因组比对的路标,有助于构建和解决多重比对问题。我们引入了Mumemto,这是一种可在大型泛基因组中计算multi-MUMs和其他匹配类型的工具。Mumemto允许对共线性进行可视化,揭示异常组装和支架,并突出泛基因组的保守性和结构变异。Mumemto在25.7小时内使用不到800GB的内存就能在320个人类基因组组装(960GB)中计算multi-MUMs,并且能在几分钟内完成数百个真菌基因组组装的计算。Mumemto用C++和Python实现,可在https://github.com/vikshiv/mumemto上开源获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7647/11722392/add309b89873/nihpp-2025.01.05.631388v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7647/11722392/07b12c3f226b/nihpp-2025.01.05.631388v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7647/11722392/5c51eaddc820/nihpp-2025.01.05.631388v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7647/11722392/f24334a19676/nihpp-2025.01.05.631388v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7647/11722392/b3ccb18fb7d3/nihpp-2025.01.05.631388v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7647/11722392/add309b89873/nihpp-2025.01.05.631388v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7647/11722392/07b12c3f226b/nihpp-2025.01.05.631388v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7647/11722392/5c51eaddc820/nihpp-2025.01.05.631388v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7647/11722392/f24334a19676/nihpp-2025.01.05.631388v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7647/11722392/b3ccb18fb7d3/nihpp-2025.01.05.631388v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7647/11722392/add309b89873/nihpp-2025.01.05.631388v1-f0005.jpg

相似文献

1
Mumemto: efficient maximal matching across pangenomes.Mumemto:跨全基因组的高效最大匹配
bioRxiv. 2025 Jan 5:2025.01.05.631388. doi: 10.1101/2025.01.05.631388.
2
Mumemto: efficient maximal matching across pangenomes.Mumemto:跨泛基因组的高效最大匹配
Genome Biol. 2025 Jun 17;26(1):169. doi: 10.1186/s13059-025-03644-0.
3
Partitioned Multi-MUM finding for scalable pangenomics.用于可扩展全基因组学的分区多MUM查找
bioRxiv. 2025 May 25:2025.05.20.654611. doi: 10.1101/2025.05.20.654611.
4
Initial arch wires used in orthodontic treatment with fixed appliances.固定矫治器正畸治疗中使用的初始弓丝。
Cochrane Database Syst Rev. 2018 Jul 31;7(7):CD007859. doi: 10.1002/14651858.CD007859.pub4.
5
Antidepressants for pain management in adults with chronic pain: a network meta-analysis.抗抑郁药治疗成人慢性疼痛的疼痛管理:一项网络荟萃分析。
Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.
6
Topical antibiotics with steroids for chronic suppurative otitis media.用于慢性化脓性中耳炎的含类固醇局部用抗生素
Cochrane Database Syst Rev. 2025 Jun 9;6(6):CD013054. doi: 10.1002/14651858.CD013054.pub3.
7
Lumbar sympathectomy versus prostanoids for critical limb ischaemia due to non-reconstructable peripheral arterial disease.腰交感神经切除术与前列腺素类药物治疗因不可重建的外周动脉疾病导致的严重肢体缺血的比较
Cochrane Database Syst Rev. 2018 Apr 16;4(4):CD009366. doi: 10.1002/14651858.CD009366.pub2.
8
SAKit: An all-in-one analysis pipeline for identifying novel proteins resulting from variant events at both large and small scales.SAKit:一种用于鉴定由大尺度和小尺度变异事件产生的新型蛋白质的一体化分析管道。
J Bioinform Comput Biol. 2024 Oct;22(5):2450022. doi: 10.1142/S0219720024500227. Epub 2024 Oct 1.
9
Exercise versus airway clearance techniques for people with cystic fibrosis.运动与气道廓清技术治疗囊性纤维化。
Cochrane Database Syst Rev. 2022 Jun 22;6(6):CD013285. doi: 10.1002/14651858.CD013285.pub2.
10
Perioperative medications for preventing temporarily increased intraocular pressure after laser trabeculoplasty.用于预防激光小梁成形术后眼压暂时升高的围手术期药物。
Cochrane Database Syst Rev. 2017 Feb 23;2(2):CD010746. doi: 10.1002/14651858.CD010746.pub2.

本文引用的文献

1
SVbyEye: A visual tool to characterize structural variation among whole-genome assemblies.SVbyEye:一种用于表征全基因组组装之间结构变异的可视化工具。
Bioinformatics. 2025 Jun 6. doi: 10.1093/bioinformatics/btaf332.
2
Movi: A fast and cache-efficient full-text pangenome index.Movi:一种快速且缓存高效的全基因组索引。
iScience. 2024 Nov 27;27(12):111464. doi: 10.1016/j.isci.2024.111464. eCollection 2024 Dec 20.
3
BWT construction and search at the terabase scale.万亿碱基规模下的BWT构建与搜索。
Bioinformatics. 2024 Nov 28;40(12). doi: 10.1093/bioinformatics/btae717.
4
Building a pangenome alignment index via recursive prefix-free parsing.通过递归无前缀解析构建泛基因组比对索引。
iScience. 2024 Sep 12;27(10):110933. doi: 10.1016/j.isci.2024.110933. eCollection 2024 Oct 18.
5
Parsnp 2.0: scalable core-genome alignment for massive microbial datasets.Parsnp 2.0:适用于大规模微生物数据集的可扩展核心基因组比对工具。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae311.
6
A pan-genome of 69 Arabidopsis thaliana accessions reveals a conserved genome structure throughout the global species range.69 个拟南芥品系的泛基因组揭示了全球物种范围内的保守基因组结构。
Nat Genet. 2024 May;56(5):982-991. doi: 10.1038/s41588-024-01715-9. Epub 2024 Apr 11.
7
A call for spatial omics submissions.征集空间组学投稿。
Nat Genet. 2024 Jan;56(1):1. doi: 10.1038/s41588-023-01621-6.
8
Telomere-to-telomere assemblies of 142 strains characterize the genome structural landscape in Saccharomyces cerevisiae.142 株酿酒酵母的端粒到端粒组装描绘了基因组结构景观。
Nat Genet. 2023 Aug;55(8):1390-1399. doi: 10.1038/s41588-023-01459-y. Epub 2023 Jul 31.
9
Genomic sketching with multiplicities and locality-sensitive hashing using Dashing 2.使用 Dashing 2 进行多重性和位置敏感哈希的基因组草图绘制。
Genome Res. 2023 Jul;33(7):1218-1227. doi: 10.1101/gr.277655.123. Epub 2023 Jul 6.
10
SPUMONI 2: improved classification using a pangenome index of minimizer digests.SPUMONI 2:使用最小化消化物的泛基因组指数进行改进分类。
Genome Biol. 2023 May 18;24(1):122. doi: 10.1186/s13059-023-02958-1.