Suppr超能文献

ModDotPlot-快速和交互式串联重复序列可视化。

ModDotPlot-rapid and interactive visualization of tandem repeats.

机构信息

Genome Informatics Section, Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, United States.

Department of Computer Science, Johns Hopkins University, Baltimore, MD 21211, United States.

出版信息

Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae493.

Abstract

MOTIVATION

A common method for analyzing genomic repeats is to produce a sequence similarity matrix visualized via a dot plot. Innovative approaches such as StainedGlass have improved upon this classic visualization by rendering dot plots as a heatmap of sequence identity, enabling researchers to better visualize multi-megabase tandem repeat arrays within centromeres and other heterochromatic regions of the genome. However, computing the similarity estimates for heatmaps requires high computational overhead and can suffer from decreasing accuracy.

RESULTS

In this work, we introduce ModDotPlot, an interactive and alignment-free dot plot viewer. By approximating average nucleotide identity via a k-mer-based containment index, ModDotPlot produces accurate plots orders of magnitude faster than StainedGlass. We accomplish this through the use of a hierarchical modimizer scheme that can visualize the full 128 Mb genome of Arabidopsis thaliana in under 5 min on a laptop. ModDotPlot is bundled with a graphical user interface supporting real-time interactive navigation of entire chromosomes.

AVAILABILITY AND IMPLEMENTATION

ModDotPlot is available at https://github.com/marbl/ModDotPlot.

摘要

动机

分析基因组重复序列的一种常用方法是生成序列相似性矩阵,通过点图可视化。StainedGlass 等创新方法通过将点图渲染为序列同一性的热图,改进了这种经典可视化,使研究人员能够更好地可视化着丝粒和基因组其他异染色质区域内的多兆碱基串联重复阵列。然而,计算热图的相似度估计值需要很高的计算开销,并且可能会降低准确性。

结果

在这项工作中,我们引入了 ModDotPlot,这是一种交互式的、无需对齐的点图查看器。通过使用基于 k-mer 的包含指数来近似平均核苷酸同一性,ModDotPlot 可以以比 StainedGlass 快几个数量级的速度生成准确的图谱。我们通过使用分层 modimizer 方案来实现这一点,该方案可以在笔记本电脑上在不到 5 分钟的时间内可视化拟南芥完整的 128 Mb 基因组。ModDotPlot 随附有一个图形用户界面,支持实时交互式导航整个染色体。

可用性和实现

ModDotPlot 可在 https://github.com/marbl/ModDotPlot 上获得。

相似文献

1
ModDotPlot-rapid and interactive visualization of tandem repeats.
Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae493.
2
ModDotPlot-Rapid and interactive visualization of complex repeats.
bioRxiv. 2024 Apr 19:2024.04.15.589623. doi: 10.1101/2024.04.15.589623.
3
StainedGlass: interactive visualization of massive tandem repeat structures with identity heatmaps.
Bioinformatics. 2022 Mar 28;38(7):2049-2051. doi: 10.1093/bioinformatics/btac018.
4
TRASH: Tandem Repeat Annotation and Structural Hierarchy.
Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad308.
5
jHeatmap: an interactive heatmap viewer for the web.
Bioinformatics. 2014 Jun 15;30(12):1757-8. doi: 10.1093/bioinformatics/btu094. Epub 2014 Feb 23.
6
TROLL--tandem repeat occurrence locator.
Bioinformatics. 2002 Apr;18(4):634-6. doi: 10.1093/bioinformatics/18.4.634.
8
Search of tandem repeats with insertion and deletions in the A. thaliana genome.
Dokl Biochem Biophys. 2017 Nov;477(1):398-400. doi: 10.1134/S160767291706014X. Epub 2018 Jan 3.
9
REViewer: haplotype-resolved visualization of read alignments in and around tandem repeats.
Genome Med. 2022 Aug 11;14(1):84. doi: 10.1186/s13073-022-01085-z.
10
Skittle: a 2-dimensional genome visualization tool.
BMC Bioinformatics. 2009 Dec 30;10:452. doi: 10.1186/1471-2105-10-452.

引用本文的文献

1
Chromosome-level assembly of cv. 'Tokiwa' as a reference genome of Japanese cucumber.
Breed Sci. 2025 Apr;75(2):85-92. doi: 10.1270/jsbbs.24066. Epub 2025 Mar 27.
2
Programmed DNA elimination drives rapid genomic innovation in two thirds of all bird species.
bioRxiv. 2025 Jul 18:2025.07.16.664580. doi: 10.1101/2025.07.16.664580.
3
Sharks and rays have the oldest vertebrate sex chromosome with unique sex determination mechanisms.
Proc Natl Acad Sci U S A. 2025 Jul 29;122(30):e2513676122. doi: 10.1073/pnas.2513676122. Epub 2025 Jul 22.
4
Bimodal centromeres in pentaploid dogroses shed light on their unique meiosis.
Nature. 2025 Jun 18. doi: 10.1038/s41586-025-09171-z.
5
EvANI benchmarking workflow for evolutionary distance estimation.
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf267.
8
Blastn2dotplots: multiple dot-plot visualizer for genome comparisons.
BMC Bioinformatics. 2025 Jun 2;26(1):146. doi: 10.1186/s12859-025-06175-4.
10
Complete sequencing of ape genomes.
Nature. 2025 May;641(8062):401-418. doi: 10.1038/s41586-025-08816-3. Epub 2025 Apr 9.

本文引用的文献

1
The complete sequence and comparative analysis of ape sex chromosomes.
Nature. 2024 Jun;630(8016):401-411. doi: 10.1038/s41586-024-07473-2. Epub 2024 May 29.
2
Scalable telomere-to-telomere assembly for diploid and polyploid genomes with double graph.
Nat Methods. 2024 Jun;21(6):967-970. doi: 10.1038/s41592-024-02269-8. Epub 2024 May 10.
3
The variation and evolution of complete human centromeres.
Nature. 2024 May;629(8010):136-145. doi: 10.1038/s41586-024-07278-3. Epub 2024 Apr 3.
4
The complete sequence of a human Y chromosome.
Nature. 2023 Sep;621(7978):344-354. doi: 10.1038/s41586-023-06457-y. Epub 2023 Aug 23.
5
Minmers are a generalization of minimizers that enable unbiased local Jaccard estimation.
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad512.
6
UniAligner: a parameter-free framework for fast sequence alignment.
Nat Methods. 2023 Sep;20(9):1346-1354. doi: 10.1038/s41592-023-01970-4. Epub 2023 Aug 14.
7
Deriving confidence intervals for mutation rates across a wide range of evolutionary distances using FracMinHash.
Genome Res. 2023 Jul;33(7):1061-1068. doi: 10.1101/gr.277651.123. Epub 2023 Jun 21.
8
A survey of mapping algorithms in the long-reads era.
Genome Biol. 2023 Jun 1;24(1):133. doi: 10.1186/s13059-023-02972-3.
9
Cycles of satellite and transposon evolution in Arabidopsis centromeres.
Nature. 2023 Jun;618(7965):557-565. doi: 10.1038/s41586-023-06062-z. Epub 2023 May 17.
10
Telomere-to-telomere assembly of diploid chromosomes with Verkko.
Nat Biotechnol. 2023 Oct;41(10):1474-1482. doi: 10.1038/s41587-023-01662-6. Epub 2023 Feb 16.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验